当前位置:网站首页>1 Introduction to spark Foundation
1 Introduction to spark Foundation
2022-07-03 22:12:00 【Ruthless coding machine】
1 Spark What is it?
2 spark Ten years of wind and rain
3 Extended data
4 Spark Four characteristics
5 Spark Framework module
1.6 Spark The mode of operation of
1.7 Spark The role Architecture
8 summary
1 Spark What problem to solve ?
Massive data computing , Offline batch processing and real-time stream calculation can be carried out
2 Spark What are the modules ?
The core SparkCore、SQL Calculation (SparkSQL)、 Flow calculation (SparkStreaming
)、 Figure calculation (GraphX)、 machine learning (MLlib)
3 Spark What are the characteristics ?
Fast 、 Easy to use 、 Strong commonality 、 Multiple modes of operation
4 Hadoop Process based computing and Spark Advantages and disadvantages of thread based approach ?
Hadoop Medium MR Each of them map/reduce task It's all one java Run in process mode , The advantage is that processes are independent of each other , Every task Exclusive process resources , no
There is mutual interference , Easy to monitor , But the problem is task Inconvenient to share data between , The execution efficiency is relatively low . Like multiple map task Reading different data source files requires adding
To each map task in , Cause repeated loading and waste of memory . Thread based computing is to share data and improve execution efficiency ,Spark The minimum execution of threads is adopted
Company , But the disadvantage is that there will be resource competition between threads
5 Spark The mode of operation of ?
Local mode
• Cluster pattern (StandAlone、YARN、K8S)
• Cloud model
6 Spark Operation role of ( contrast YARN)?
• Master: Cluster resource management ( similar ResourceManager)
• Worker: Stand alone resource management ( similar NodeManager)
• Driver: Single task manager ( similar ApplicationMaster)
• Executor: Single task performer ( similar YARN Inside the container Task)
Basic concepts of threads
Thread is CPU The basic dispatching unit of
A process usually contains multiple threads , Multiple threads under a process share the resources of the process
Threads between different processes are not visible to each other
Threads cannot execute independently
One thread can create and undo another thread
边栏推荐
- Nacos common configuration
- Pooling idea: string constant pool, thread pool, database connection pool
- Conditional statements of shell programming
- The latest analysis of R1 quick opening pressure vessel operation in 2022 and the examination question bank of R1 quick opening pressure vessel operation
- 2022 G3 boiler water treatment registration examination and G3 boiler water treatment examination papers
- Yyds dry goods inventory hands-on teach you to create a jigsaw puzzle using the canvasapi
- Global and Chinese market of telematics boxes 2022-2028: Research Report on technology, participants, trends, market size and share
- Rest参考
- Exclusive interview with the person in charge of openkruise: to what extent has cloud native application automation developed now?
- regular expression
猜你喜欢
2022 safety officer-a certificate registration examination and summary of safety officer-a certificate examination
Exclusive interview with the person in charge of openkruise: to what extent has cloud native application automation developed now?
Minio deployment
Collections SQL communes
Buuctf, misc: n solutions
[dynamic planning] counting garlic customers: the log of garlic King (the longest increasing public subsequence)
pivot ROP Emporium
[actual combat record] record the whole process of the server being attacked (redis vulnerability)
Collection | pytoch common loss function disassembly
Blue Bridge Cup Guoxin Changtian MCU -- program download (III)
随机推荐
2022 G3 boiler water treatment registration examination and G3 boiler water treatment examination papers
What indicators should be paid attention to in current limit monitoring?
Yyds dry inventory Chapter 4 of getting started with MySQL: data types that can be stored in the data table
Remember the experience of automatically jumping to spinach station when the home page was tampered with
Netfilter ARP log
DR882-Qualcomm-Atheros-QCA9882-2T2R-MIMO-802.11ac-Mini-PCIe-Wi-Fi-Module-5G-high-power
How does sentinel, a traffic management artifact, make it easy for business parties to access?
BUUCTF,Misc:LSB
Cesium terrain clipping draw polygon clipping
js demo 计算本年度还剩下多少天
Great gods, I want to send two broadcast streams: 1. Load basic data from MySQL and 2. Load changes in basic data from Kafka
Report on the current situation and development trend of ethoxylated sodium alkyl sulfate industry in the world and China Ⓞ 2022 ~ 2027
2022 free examination questions for safety management personnel of hazardous chemical business units and reexamination examination for safety management personnel of hazardous chemical business units
The 14th five year plan and investment feasibility study report of China's industry university research cooperation Ⓧ 2022 ~ 2028
Dahua series books
Is the account opening of Guotai Junan Securities safe and reliable? How to open Guotai Junan Securities Account
DR-NAS26-Qualcomm-Atheros-AR9582-2T-2R-MIMO-802.11-N-5GHz-high-power-Mini-PCIe-Wi-Fi-Module
How PHP drives mongodb
Luogu deep foundation part 1 Introduction to language Chapter 6 string and file operation
pivot ROP Emporium