当前位置:网站首页>Spark Learning Notes (III) -- basic knowledge of spark core
Spark Learning Notes (III) -- basic knowledge of spark core
2022-07-24 14:37:00 【One's cow】
Catalog
Port number Description
4040( Calculation ):Spark View the current Spark-shell Operation task ;
8080( resources ):Standalone In mode ,Spark Master Web Port number ;
7077:Spark Master Internal communication services ;
18080:Spark History server port number ;
8088:Hadoop YARN Check the operation of the task and the port number .
spark Core module
spark SQL、spark Streaming、spark GraphX、spark MLlib

spark core
spark SQL
spark streaming
MLlib
GraphX
spark Core components
Spark The core of is a computing engine .
driver
driver yes spark Drive nodes , Used to perform spark In the task main Method , Responsible for the execution of the actual code .
Simple understanding ,driver Is the program that drives the entire application to run .
Mainly responsible for :
(1) Convert user programs into jobs (job);
(2) stay Executor Scheduling tasks between (task) ;
(3) track Executor Implementation of ;
(4) adopt UI Show how the query runs .
executor
executor It's in a cluster Worker One of the JVM process , Responsible for Spark Run specific tasks in the job , Tasks are independent of each other .
Spark App launch ,Executor The nodes start at the same time , And exists in Spark The life cycle of the application .
If there is Executor The node has failed or crashed ,spark It will schedule the tasks on the faulty node to other Executor Continue to run on the node .
Master&Worker
Master and Worker: there Master It's an entry cheng , Responsible for resource scheduling and allocation , Cluster monitoring, etc ;Worker It's also a process , One Worker Running on a server in the cluster ,Master Allocate resources Worker Parallel processing and calculation of data .
ApplicationMaster
ApplicationMaster A resource container used to request the resource scheduler to execute tasks Container, Run the user's own program tasks job, Monitor the execution of the entire task , Track the status of the entire task , Handle exceptions such as task failure .
Run the architecture
Driver: Manage job task scheduling in the whole cluster .
Executor: Responsible for the actual execution of tasks .

Submit task process
There are generally two ways of deployment execution :Client and Cluster
The main difference between the two models is :Driver The location of the running node of the program .Client Mode will be used for monitoring and scheduling Driver The module executes on the client .Cluster Mode will be used for monitoring and scheduling Driver Modules in Yarn Execution in a cluster .

Yarn Client
(1)Driver Run on the local machine that the task submitted ;
(2)Driver and ResourceManager Communications , Apply to start ApplicationMaster;
(3)ResourceManager Distribute Container, In the right place NodeManager Start the ApplicationMaster;
(4)ApplicationMaster Responsible for providing ResourceManager apply Executor Memory ;
(5)ResourceManager Distribute container;
(6)ApplicationMaster At the designated NodeManager Start the Executor process ;
(7)Executor towards Driver Reverse registration ;
(8)Executor When all registration is completed Driver Start execution ;
(9) perform main function , perform Action operator (Action Operator triggers a Job), Divide according to wide dependency stage, Every stage Generate corresponding TaskSet, take task Distribute to each Executor On the implementation .
Yarn Cluster
(1)Driver Task submitted ;
(2)Driver and ResourceManager Communications , Apply to start ApplicationMaster;
(3)ResourceManager Distribute Container, In the right place NodeManager Start the ApplicationMaster, This ApplicationMaster Namely Driver;
(4)ApplicationMaster Responsible for providing ResourceManager apply Executor Memory ;
(5)ResourceManager Distribute container;
(6)ApplicationMaster At the designated NodeManager Start the Executor process ;
(7)Executor towards Driver Reverse registration ;
(8)Executor When all registration is completed Driver Start execution ;
(9) perform main function , perform Action operator (Action Operator triggers a Job), Divide according to wide dependency stage, Every stage Generate corresponding TaskSet, take task Distribute to each Executor On the implementation .
边栏推荐
- Tensorflow framework of deep learning realizes vgg/rnn network / verification code generation and recognition / text classification
- Jmmert aggregation test report
- Maotai ice cream "bucked the trend" and became popular, but its cross-border meaning was not "selling ice cream"
- The vs compiled application is missing DLL
- TypeError: Cannot read property ‘make‘ of undefined
- Fraud detection cases and Titanic rescued cases
- VSCode如何调试Nodejs
- 字符串——剑指 Offer 58 - II. 左旋转字符串
- 茅台冰淇淋“逆势”走红,跨界之意却并不在“卖雪糕”
- Number of bytes occupied by variables of type char short int in memory
猜你喜欢

Notes on the use of IEEE transaction journal template

Overview of dobesie wavelet (DB wavelet function) in wavelet transform

看完这篇文章,才发现我的测试用例写的就是垃圾

Centos7 installs Damon stand-alone database

VSCode如何调试Nodejs

The server switches between different CONDA environments and views various user processes

LeetCode·每日一题·1184.公交站间的距离·模拟

After reading this article, I found that my test cases were written in garbage

The sliding window of Li Kou "step by step" (209. The smallest sub array, 904. Fruit baskets)

深度学习中的学习率调整策略(1)
随机推荐
TypeError: 'str' object does not support item assignment
字符串——剑指 Offer 58 - II. 左旋转字符串
Atcoder beginer contest 261 f / / tree array
电赛设计报告模板及历年资源
Moving the mouse into select options will trigger the mouseleave event processing scheme
不要灰心,大名鼎鼎的YOLO、PageRank影响力爆棚的研究,曾被CS顶会拒稿
Similarities and differences between nor flash and NAND flash
Regular expression and bypass cases
[oauth2] II. Known changes in oauth2.1
解决 uni-starter 使用本地函数可以登录微信 但是使用云函数登录失败
Extjs4 instance address and Chinese document address
2.4. properties of special profile
Stack and queue - 20. Valid parentheses
自动化渗透扫描工具
Overview of dobesie wavelet (DB wavelet function) in wavelet transform
The sliding window of Li Kou "step by step" (209. The smallest sub array, 904. Fruit baskets)
小熊派 课程导读
exchange
Research Summary / programming FAQs
How vscode debug nodejs