当前位置:网站首页>Spark entry learning-2
Spark entry learning-2
2022-08-03 16:01:00 【@Autowire】
1 Dependency



Wide dependencies: with shuffle
One partition of the parent RDD will be depended on by multiple partitions of the child RDD
Narrow dependencies: no shuffle
One partition of the parent RDD will only be depended on by one partition of the child RDD
Summary:
Narrow dependencies: parallelization + fault tolerance
WideDependency: perform stage division (the stage after shuffle needs to wait for shuffle to execute.
2 DAG && Stage


Spark's DAG: is the flow chart of spark task/program execution!
The beginning of DAG: from the creation of RDD
The end of DAG: to the end of Action
There are several DAGs in a Spark program by several Action operations
Stage: It is the stage divided by shuffle in DAG!
The latter stage can be executed only after the previous stage is executed.
Each task in the same stage can be executed in parallel without waiting!
3 Glossary




4 Job Submission Process

边栏推荐
- Internship Road: Documenting Confusion in My First Internship Project
- 【Unity入门计划】基本概念(8)-瓦片地图 TileMap 02
- Yii2安装遇到Loading composer repositories with package information
- 力扣1206. 设计跳表--SkipList跳表是怎么跳的?
- 【Unity入门计划】基本概念(6)-精灵渲染器 Sprite Renderer
- AI+BI+可视化,Sugar BI架构深度剖析
- JS基础--判断
- 出海季,互联网出海锦囊之本地化
- 兔起鹘落全端涵盖,Go lang1.18入门精炼教程,由白丁入鸿儒,全平台(Sublime 4)Go lang开发环境搭建EP00
- JS basics--judgment
猜你喜欢

方舟开服教程win

如何将二维空间先验注入到ViT中? UMA&港理工&阿里提出SP-ViT,为视觉Transformer学习2D空间先验知识!...

49 万奖金等你来拿!第四届实时计算 Flink 挑战赛启动,Beyond Stream Processing!

美国国防部更“青睐”光量子系统研究路线

【899. 有序队列】

2021年数据泄露成本报告解读

如何选择合适的损失函数,请看......

技术干货|如何将 Pulsar 数据快速且无缝接入 Apache Doris

AI+BI+可视化,Sugar BI架构深度剖析

How to get the 2 d space prior to ViT?UMA & Hong Kong institute of technology & ali SP - ViT, study for visual Transformer 2 d space prior knowledge!.
随机推荐
Ark server opening tutorial win
How to use binary search and find whether the rotation in the array contains a (target) value?Rotate the sorted array leetcode 81. Search
DC-DC 2C (40W/30W) JD6606SX2 power back application
2021年12月电子学会图形化一级编程题解析含答案:下雨
【数据库数据恢复】SqlServer数据库无法读取的数据恢复案例
字典表(还需要输入2个字)
[Code Hoof Set Novice Village 600 Questions] Define a function as a macro
使用Make/CMake编译ARM裸机程序(基于HT32F52352 Cortex-M0+)
nodeJs--跨域
js数组方法总结
开源一夏 | 阿里云物联网平台之极速体验
产品以及研发团队有使用专业的办公软件,如禅道、蓝湖等,他们应该如何使用 Tita 系统?
指令重排以及案例
【Unity入门计划】基本概念(6)-精灵渲染器 Sprite Renderer
请问下,flink cdc监控oracle,我看源码是通过sid方式的,请问怎么改成service
身为售后工程师的我还是觉得软件测试香,转行成功定薪11.5K,特来分享下经验。
【码蹄集新手村600题】将一个函数定义宏
C#.NET 国密数字信封
Js array method is summarized
基于DMS的数仓智能运维服务,知多少?