当前位置:网站首页>[yarn] CDP cluster yarn configuration capacity scheduler batch allocation
[yarn] CDP cluster yarn configuration capacity scheduler batch allocation
2022-07-06 11:32:00 【kiraraLou】
One 、 Preface
It's going to be upgraded recently CDH
Cluster into CDP
colony ,CDH
In the cluster Yarn
By default, the service uses fair
Scheduler ,CDP
The cluster uses capacity
Scheduler , We've been there before The scheduler is set unreasonably due to batch allocation , As a result, tasks are allocated to certain nodes , Make the cluster resource load extremely unbalanced .
To avoid the same problem in CDP
On the cluster , We conduct research in advance . Look at using Will the scheduler also have centralized allocation . But in the process of research , There are some unexpected phenomena , Waiting for follow-up .
Two 、CDH Centralized distribution
As mentioned earlier ,CDH 5.8-Hadoop2.6.0
After version , in the light of Fair dispatch
, There are several configurations as follows , For task types ( Small tasks ) Accelerate the optimization of allocation .
Configuration name | explain |
---|---|
yarn.scheduler.fair.max.assign | Maximum allocation : If assignmultiple by true And dynamic.max.assign by false, Then the maximum number of containers that can be allocated in a heartbeat . |
yarn.scheduler.fair.assignmultiple | Assign multiple : Whether multiple containers are allowed to be allocated in a heartbeat . |
yarn.scheduler.fair.dynamic.max.assign | If assignmultiple It's true , Whether to dynamically determine the amount of resources that a heartbeat can allocate . After opening , About half of the unallocated resources on the node will be allocated to the container in a heartbeat . Default to true . |
Through reasonable configuration , We can use centralized allocation , It will not expand the cluster load difference .
CDH
How to configure a cluster is not covered here .
3、 ... and 、CDP Centralized distribution
CDP Already used in the cluster Capacity scheduling
As the default scheduler , By consulting the official and Cloudera file , Find out Scheduling is also possible through heartbeat NodeManager
Allocate multiple containers . The configuration is as follows :
Configuration name | explain |
---|---|
yarn.scheduler.capacity.per-node-heartbeat.multiple-assignments-enabled | Whether to allow in a NodeManager Allocate multiple containers in the heartbeat . Default to true . |
yarn.scheduler.capacity.per-node-heartbeat.maximum-container-assignments | If multiple-assignments-enabled by true , In a NodeManager The maximum number of containers that can be allocated in the heartbeat . The default is -1, No restrictions . |
yarn.scheduler.capacity.per-node-heartbeat.maximum-offswitch-assignments | If multiple-assignments-enabled by true , In a NodeManager The maximum that can be allocated in the heartbeat off-switch Number of containers . The default is 1, Indicates that only one off switch is allowed to be assigned in a heartbeat . |
How to configure
stay
Cloudera Manager
in , Select cluster >YARN Queue manager UI service
.stay
YARN
In the queue manager window , Click the scheduler configuration tab .stay “ Scheduler configuration ” Window
Choose
Enable Multiple Assignments Per Heartbeat
Check box to allow in aNodeManager
Allocate multiple containers in the heartbeatConfigure the following
NodeManager
Heartbeat properties :
Maximum Container Assignments Per Heartbeat
: In aNodeManager
The maximum number of containers that can be allocated in the heartbeat . Set this value to -1 This restriction will be disabled .Maximum Off-Switch Assignments Per Heartbeat
: Can be in aNodeManager
The maximum number of closed switch containers allocated in the heartbeat .
3、 ... and 、 summary
capacity
Schedulers have similarfair
The heartbeat batch allocation configuration of the scheduler .- CDP colony
capacity
The scheduler enables batch allocation by default , And the assigned quantity is 100, This value needs to be reduced . - Now it's tested , It is found that the configuration does not seem to be effective , This needs to be followed up by experts .
Reference resources
https://docs.cloudera.com/cdp-private-cloud-base/7.1.7/yarn-allocate-resources/topics/yarn-set-user-limits.html
https://docs.cloudera.com/cdp-private-cloud-base/7.1.7/yarn-allocate-resources/topics/yarn-configure-nm-heartbeat.html
https://hadoop.apache.org/docs/r3.1.1/hadoop-yarn/hadoop-yarn-site/CapacityScheduler.html#Reviewing_the_configuration_of_the_CapacityScheduler
边栏推荐
- wangeditor富文本引用、表格使用问题
- Introduction and use of automatic machine learning framework (flaml, H2O)
- 01 project demand analysis (ordering system)
- 引入了junit为什么还是用不了@Test注解
- C语言读取BMP文件
- In the era of DFI dividends, can TGP become a new benchmark for future DFI?
- Codeforces Round #771 (Div. 2)
- 机器学习--人口普查数据分析
- Codeforces Round #753 (Div. 3)
- [蓝桥杯2017初赛]包子凑数
猜你喜欢
QT creator specifies dependencies
vs2019 桌面程序快速入门
学习问题1:127.0.0.1拒绝了我们的访问
Vs2019 first MFC Application
How to build a new project for keil5mdk (with super detailed drawings)
One click extraction of tables in PDF
打开浏览器的同时会在主页外同时打开芒果TV,抖音等网站
Vs2019 desktop app quick start
How to configure flymcu (STM32 serial port download software) is shown in super detail
QT creator support platform
随机推荐
One click extraction of tables in PDF
{一周总结}带你走进js知识的海洋
Solution to the practice set of ladder race LV1 (all)
机器学习--人口普查数据分析
Classes in C #
第4阶段 Mysql数据库
误删Path变量解决
Database advanced learning notes -- SQL statement
Double to int precision loss
Record a problem of raspberry pie DNS resolution failure
小L的试卷
[Bluebridge cup 2020 preliminary] horizontal segmentation
【Flink】CDH/CDP Flink on Yarn 日志配置
C语言读取BMP文件
Case analysis of data inconsistency caused by Pt OSC table change
库函数--(持续更新)
Codeforces Round #771 (Div. 2)
Rhcsa certification exam exercise (configured on the first host)
Solve the problem of installing failed building wheel for pilot
Learn winpwn (3) -- sEH from scratch