当前位置:网站首页>Detailed explanation of Flink parallelism and slot
Detailed explanation of Flink parallelism and slot
2022-07-07 19:54:00 【A sharp fire rages to the sky】
Flink Parallelism and Slot Detailed explanation
Address :
https://blog.csdn.net/zuodaoyong/article/details/106178488?spm=1001.2101.3001.6650.1&utm_medium=distribute.pc_relevant.none-task-blog-2%7Edefault%7ECTRLIST%7ERate-1.pc_relevant_default&depth_1-utm_source=distribute.pc_relevant.none-task-blog-2%7Edefault%7ECTRLIST%7ERate-1.pc_relevant_default&utm_relevant_index=2
One 、 Concept
1、Task: There are multiple with the same function in one stage subTask Set , similar Spark Of TaskSet
2、SubTask: It is the smallest execution unit of the task , It's a Java Class , Complete the specific calculation logic
3、Slot: Isolation unit of computing resources , One Slot Can run multiple SubTask, But these SubTask It has to be from the same application At different stages of subTask.
Be careful :Flink Divide Task There are four main situations :
(1) similar keyBy,broadcast,rebalance Wait for the operator to generate shuffer
(2)Parallelism( Parallelism ) change
(3)new chain, That is, execute on the operator startNewChain() after , This operator is separate from the operator previously executed .
(4)disableChaining, Execute on the operator disableChaining(), That is, the beginning to the end of the operator , Generate a single task. Use scenarios , For example, the logic of this operator is complex , Let the operator use one alone task Internal SubTask.
Two 、slot
Flink Every one of them TaskManager It's all one JVM process , It may be in slot Execute one or more subTask.
slot The quantity is usually the same as each TaskManager Node availability CPU The number of cores is proportional . commonly Slot The number is per node CPU Number of cores .
Slot The number of is determined by flink-conf.yml In profile taskmanager.numberOfTaskSlots Set up .
Be careful : The same slot Cannot execute the same task The multiple subTask.
Expand :slotSharingGroup(String slotSharingGroup) Sharing slot
hypothesis flink Cluster has 3 Nodes , One jobManager,2 individual TaskManager. Every TaskManager Yes 2 individual Slot. That is, the cluster is a total of 4 individual slot
Task assignment slot The default name is default.

take wordcount For example , Run the application,5 individual task,14 individual subTask Are running on shared slot be known as "default" On .
If flatMap On the call slotSharingGroup(“slot_name”), be flatMap Put it under the name slot_name Of slot On .
Posterior operator map,keyBy,print All will be assigned slot The name is slot_name Up operation .
that , Abnormal conditions have occurred , In the cluster 4 individual slot, There is one default Of slot Running in is source,flatMap The parallelism of is 4, Need to assign to 4 It's called slot_name Of slot On . however slot_name Only 3 individual . Resulting in insufficient resources , Task deployment failed .
The solution is to adjust the parallelism to 3, Or cancel slotSharingGroup Set up .
summary :
(1)Flink The default name of the task resource slot of is default
(2) Call slotSharingGroup Set the slot where the operator runs
(3) If you change the name of the shared slot , The following operator does not set the name of the shared slot , Then it is consistent with the slot name changed last time
(4) Slot names are different subTask Cannot execute in the same slot
3、 ... and 、 Parallelism
1、 Setting of parallelism
(1)Operator Level( Operator level )
(2)Execution Environment Level( At the environmental level )
(3)Client Level( Client level )
(4)System Level( System level , That is, configure )
Parallelism sets priority :Operator Level > Execution Environment Level > Client Level > System Level
边栏推荐
- Research and practice of super-resolution technology in the field of real-time audio and video
- Tips and tricks of image segmentation summarized from 39 Kabul competitions
- 杰理之按键发起配对【篇】
- 爬虫实战(七):爬王者英雄图片
- 杰理之测试盒配置声道【篇】
- Solve the error reporting problem of rosdep
- 2022如何评估与选择低代码开发平台?
- 使用高斯Redis实现二级索引
- 2022.07.02
- 歌单11111
猜你喜欢

超分辨率技术在实时音视频领域的研究与实践

Introduction to bit operation

The strength index of specialized and new software development enterprises was released, and Kirin Xin'an was honored on the list

Kunpeng developer summit 2022 | Kirin Xin'an and Kunpeng jointly build a new ecosystem of computing industry

Install mysql8 for Linux X ultra detailed graphic tutorial

openEuler 有奖捉虫活动,来参与一下?

PMP對工作有益嗎?怎麼選擇靠譜平臺讓備考更省心省力!!!

华南X99平台打鸡血教程

位运算介绍

Tips and tricks of image segmentation summarized from 39 Kabul competitions
随机推荐
Semantic SLAM源码解析
What does "true" mean
831. KMP字符串
Longest common prefix (leetcode question 14)
R language dplyr package mutate_ At function and min_ The rank function calculates the sorting sequence number value and ranking value of the specified data column in the dataframe, and assigns the ra
el-upload上传组件的动态添加;el-upload动态上传文件;el-upload区分文件是哪个组件上传的。
The strength index of specialized and new software development enterprises was released, and Kirin Xin'an was honored on the list
微信公众号OAuth2.0授权登录并显示用户信息
L1-028 judging prime number (Lua)
索引总结(突击版本)
最多可以参加的会议数目[贪心 + 优先队列]
Throughput
Netease Yunxin participated in the preparation of the standard "real time audio and video service (RTC) basic capability requirements and evaluation methods" issued by the Chinese Academy of Communica
Training IX basic configuration of network services
Matplotlib drawing 3D graphics
Make insurance more "safe"! Kirin Xin'an one cloud multi-core cloud desktop won the bid of China Life Insurance, helping the innovation and development of financial and insurance information technolog
怎么在手机上买股票开户 股票开户安全吗
一张图深入的理解FP/FN/Precision/Recall
J ü rgen schmidhub reviews the 25th anniversary of LSTM papers: long short term memory All computable metaverses. Hierarchical reinforcement learning (RL). Meta-RL. Abstractions in generative adversar
Notes...