当前位置:网站首页>Spark entry learning-2
Spark entry learning-2
2022-08-03 16:01:00 【@Autowire】
1 Dependency
Wide dependencies: with shuffle
One partition of the parent RDD will be depended on by multiple partitions of the child RDD
Narrow dependencies: no shuffle
One partition of the parent RDD will only be depended on by one partition of the child RDD
Summary:
Narrow dependencies: parallelization + fault tolerance
WideDependency: perform stage division (the stage after shuffle needs to wait for shuffle to execute.
2 DAG && Stage
Spark's DAG: is the flow chart of spark task/program execution!
The beginning of DAG: from the creation of RDD
The end of DAG: to the end of Action
There are several DAGs in a Spark program by several Action operations
Stage: It is the stage divided by shuffle in DAG!
The latter stage can be executed only after the previous stage is executed.
Each task in the same stage can be executed in parallel without waiting!
3 Glossary
4 Job Submission Process
边栏推荐
- 简介undo log、truncate、以及undo log如何帮你回滚事物?
- 指令重排以及案例
- leetcode: 899. Ordered Queue [Thinking Question]
- 随笔-UGUI中LayoutGroup来自适应长度图片长度
- 并发编程的核心问题
- Daily practice------There are 10 numbers that are required to be output from large to small by selection method
- 【码蹄集新手村600题】将一个函数定义宏
- NodeJs - cross domain
- 分享一款免费OPC UA服务器
- 神经网络,凉了?
猜你喜欢
新一代网状网协议T-Mesh无线通信技术优势介绍
STM32 GPIO LED和蜂鸣器实现【第四天】
如何分析周活跃率?
AWS China SDN Connector
How much does Ark Survival Evolved cost?
深度学习GPU最全对比,到底谁才是性价比之王?
DC-DC 2C (40W/30W) JD6606SX2 power back application
【数据库数据恢复】SqlServer数据库无法读取的数据恢复案例
Research on power flow in DC microgrid based on Newton's method (Matlab code implementation)
Basic knowledge points in js - events
随机推荐
spark入门学习-1
49 万奖金等你来拿!第四届实时计算 Flink 挑战赛启动,Beyond Stream Processing!
js数组方法总结
Daily practice------There are 10 numbers that are required to be output from large to small by selection method
劲爆!协程终于来了!线程即将是过去式
Convex Optimization of Optimal Power Flow (OPF) in Microgrids and DC Grids (Matlab Code Implementation)
【Unity入门计划】基本概念(6)-精灵渲染器 Sprite Renderer
2021年12月电子学会图形化三级编程题解析含答案:数星星
产品以及研发团队有使用专业的办公软件,如禅道、蓝湖等,他们应该如何使用 Tita 系统?
0 code 4 steps to experience IoT devices on the cloud
spark入门学习-2
方舟开服教程win
posgresql 到 es 报这个错误 ,啥意思
Taurus.MVC WebAPI 入门开发教程1:框架下载环境配置与运行(含系列目录)。
AI+BI+可视化,Sugar BI架构深度剖析
随笔-UGUI中LayoutGroup来自适应长度图片长度
瞌睡检测系统介绍
GTK实现旋转加载动画
【899. Ordered Queue】
STM32的HAL和LL库区别和性能对比