当前位置:网站首页>The spark operator - repartition operator
The spark operator - repartition operator
2022-08-05 06:11:00 【zdaiqing】
Source code
def repartition(numPartitions: Int)(implicit ord: Ordering[T] = null): RDD[T] = withScope {coalesce(numPartitions, shuffle = true)}- The bottom layer calls coalesce() to implement the repartitioning operation
- Mandatory shuffle operation
Repartition and coalesce relationship
- All operators are used to reset the number of partitions
- The bottom layer of repartition calls coalesce() to implement the repartition function
- repartition forces a shuffle operation
- coalesce decides whether to perform the shuffle operation according to the parameters
References
边栏推荐
猜你喜欢

每日一题-两数相加-0711

【UiPath2022+C#】UiPath控制流程概述

错误类型:reflection.ReflectionException: Could not set property ‘xxx‘ of ‘class ‘xxx‘ with value ‘xxx‘

UE5再次更新!扫描或手动建模面部模型可直接转为绑定好的Metahuman

入门文档08 条件插件

【Day8】 RAID磁盘阵列
![[Paper Intensive Reading] The relationship between Precision-Recall and ROC curves](/img/8f/3c9944db96eef623779a5abe68355b.png)
[Paper Intensive Reading] The relationship between Precision-Recall and ROC curves

入门文档11 自动添加版本号

Contextual non-local alignment of full-scale representations

论那些给得出高薪的游戏公司底气到底在哪里?
随机推荐
【UiPath2022+C#】UiPath 练习-数据操作
lvm逻辑卷及磁盘配额
添加新硬盘为什么扫描不上?如何解决?
每日一题-合并两个有序链表-0720
D39_向量
D39_欧拉角与四元数
Image compression failure problem
链表章6道easy总结(leetcode)
不吹不黑,这的确是我看过微服务架构最好的文章!
入门文档03 区分开发与生产环境(生产环境才执行‘热更新’)
TensorFlow ObjecDetectionAPI在win10系统Anaconda3下的配置
spark源码-任务提交流程之-5-CoarseGrainedExecutorBackend
每日一题-无重复字符的最长子串-0712
成功的独立开发者应对失败&冒名顶替综
spark源码-任务提交流程之-2-YarnClusterApplication
【Day1】VMware软件安装
入门文档12 webserve + 热更新
Leetcode刷题——对链表进行插入排序
Dsf5.0 bounced points determine not return a value
Getting Started 11 Automatically add version numbers