当前位置:网站首页>Spark entry learning-2
Spark entry learning-2
2022-08-03 16:01:00 【@Autowire】
1 Dependency
Wide dependencies: with shuffle
One partition of the parent RDD will be depended on by multiple partitions of the child RDD
Narrow dependencies: no shuffle
One partition of the parent RDD will only be depended on by one partition of the child RDD
Summary:
Narrow dependencies: parallelization + fault tolerance
WideDependency: perform stage division (the stage after shuffle needs to wait for shuffle to execute.
2 DAG && Stage
Spark's DAG: is the flow chart of spark task/program execution!
The beginning of DAG: from the creation of RDD
The end of DAG: to the end of Action
There are several DAGs in a Spark program by several Action operations
Stage: It is the stage divided by shuffle in DAG!
The latter stage can be executed only after the previous stage is executed.
Each task in the same stage can be executed in parallel without waiting!
3 Glossary
4 Job Submission Process
边栏推荐
- 红蓝对抗经验分享:CS免杀姿势
- ReentrantLock详解
- Taurus.MVC WebAPI 入门开发教程1:框架下载环境配置与运行(含系列目录)。
- No inner demons, to dry!SQL optimization and diagnosis
- 13、OOM模拟
- The general trend, another key industry related to Sino-US competition, has reached a critical moment
- C#.NET 国密数字信封
- spark入门学习-2
- 高可用版 主数据库数据结构改变 备数据库会自动改变吗
- 劲爆!协程终于来了!线程即将是过去式
猜你喜欢
Fortinet产品导入AWS AMI操作文档
Research on power flow in DC microgrid based on Newton's method (Matlab code implementation)
Internship Road: Documenting Confusion in My First Internship Project
用友YonSuite与旺店通数据集成对接-技术篇2
身为售后工程师的我还是觉得软件测试香,转行成功定薪11.5K,特来分享下经验。
出海季,互联网出海锦囊之本地化
js数组方法总结
【码蹄集新手村600题】将一个函数定义宏
方舟开服工具、服务器教程win
【QT】Qt项目demo:数据在ui界面上显示,鼠标双击可弹窗显示具体信息
随机推荐
一个文件管理系统的软硬件配置清单
高可用版 主数据库数据结构改变 备数据库会自动改变吗
微电网和直流电网中最优潮流(OPF)的凸优化(Matlab代码实现)
JS基础--判断
Small Tools(4) 整合Seata1.5.2分布式事务
30W 2C(JD6606S + FP6652X2)BOM
一通骚操作,我把SQL执行效率提高了10000000倍!
请问下阿里云全托管flink能执行两条flink sql命令么?
【899. 有序队列】
ECCV 2022 | Relational Query-Based Temporal Action Detection Methods
方舟开服教程win
Neural networks, cool?
您的移动端app安全吗
leetcode: 899. Ordered Queue [Thinking Question]
2021年12月电子学会图形化三级编程题解析含答案:分身术
DC-DC 2C (40W/30W) JD6606SX2 power back application
新版本的 MaxCompute 中,SQL支持的 LIMIT OFFSET 的语法是什么功能?
DC-DC 2C(40W/30W) JD6606SX2退功率应用
STM32 GPIO LED和蜂鸣器实现【第四天】
土耳其国防部:联合协调中心将对首艘乌克兰粮船进行安全检查