当前位置:网站首页>Yarn restart applications record recovery
Yarn restart applications record recovery
2022-07-01 13:12:00 【fanxl12】
Yarn restart applications Record recovery
Yarn restart applications Record recovery
modify yarn-core.xml The configuration file
ResourceManager Restart recovery
take yarn-site.xml Medium
yarn.resourcemanager.recovery.enabledConfiguration item set to true( The default is false)<property> <name>yarn.resourcemanager.recovery.enabled</name> <value>true</value> </property>To configure
yarn.resourcemanager.store.classParameters , This parameter is used to specify RM On which storage medium to save your state before restarting , There are 3 Storage optionsorg.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore
The default value is , File system based storage ( Local storage or HDFS). You can specify yarn.resourcemanager.fs.state-store.uri As a storage path , If you specify this yarn.resourcemanager.fs.state-store.uri You have to set .org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore
be based on ZooKeeper The storage , When you enable RM High availability , You can only choose this way . Because the two one. RM Can be active ( Think you are the real RM), And then brain crack . be based on ZK Storage of can be isolated (fence) Status data prevent brain cracking . You can specify hadoop.zk.address(ZK Node address list ) and yarn.resourcemanager.zk-state-store.parent-path( The root node path of the state data ) Parameters .org.apache.hadoop.yarn.server.resourcemanager.recovery.LeveldbRMStateStore
be based on LevelDB The storage . It is more lightweight than the first two , Occupied storage space and I/O Much smaller , And support better atomic operations . When there are extreme requirements for performance . You can specify yarn.resourcemanager.leveldb-state-store.path As a storage path .<property> <name>yarn.resourcemanager.store.class</name> <value>org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore</value> </property>
To configure yarn.resourcemanager.fs.state-store.uri, If yarn.resourcemanager.store.class yes org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore Need configuration , Configuration here HDFS Storage
<property> <name>yarn.resourcemanager.fs.state-store.uri</name> <value>hdfs://hadoop-master:9010/rmstore</value> </property>Final configuration yarn.resourcemanager.work-preserving-recovery.scheduling-wait-ms, It means from RM After restart, from each NM Sync Container The waiting time of information , After that, new Container. The default value is 10000(10 second ), Generally, it does not need to be changed .
<property> <name>yarn.resourcemanager.work-preserving-recovery.scheduling-wait-ms</name> <value>10000</value> </property>
To configure NodeManager Restart automatic recovery
take yarn-site.xml Medium
yarn.nodemanager.recovery.enabledConfiguration item set to true( The default is false)<property> <name>yarn.nodemanager.recovery.enabled</name> <value>true</value> </property>To configure
yarn.nodemanager.recovery.dirParameters , Appoint NM Before you restart , take Container The status of is written to this local path . The default value is${hadoop.tmp.dir}/yarn-nm-recovery<property> <name>yarn.nodemanager.recovery.dir</name> <value>/opt/topology/db_data/hadoop-data/yarn-nm-recovery</value> </property>To configure
yarn.nodemanager.addressParameters , The parameter is NM Of RPC Address , The default is${yarn.nodemanager.hostname}:0, That is, temporary ports are used randomly . Be sure to specify a fixed port ( Such as 8041), otherwise NM The port will be changed after restart , Can't recover Container The state of<property> <name>yarn.nodemanager.address</name> <value>hadoop-master:45454</value> </property>
边栏推荐
- 基于mysql乐观锁实现秒杀的示例代码
- There are risks in trading
- 【大型电商项目开发】性能压测-压力测试基本概念&JMeter-38
- There are still many things to be done in the second half of the year
- How to play with the reading and writing operations of blocking sockets?
- Tencent always takes epoll, which is annoying
- Which securities company has a low, safe and reliable account opening commission
- How can genetic testing help patients fight disease?
- leetcode:226. 翻转二叉树【dfs翻转】
- La taille de la pile spécifiée est petite, spécifiée à la sortie 328k
猜你喜欢

Feign & Eureka & Zuul & Hystrix 流程
![[encounter Django] - (II) database configuration](/img/13/9512c1e03349092874055771c3433d.png)
[encounter Django] - (II) database configuration
![leetcode:329. The longest incremental path in the matrix [DFS + cache + no backtracking + elegance]](/img/10/acd162c3adf9d6f14fa5a551dc0d25.png)
leetcode:329. The longest incremental path in the matrix [DFS + cache + no backtracking + elegance]

Huawei HMS core joins hands with hypergraph to inject new momentum into 3D GIS
![79. Word search [DFS + backtracking visit + traversal starting point]](/img/d6/a7693b2af435b7cf4562161ca4bd3f.png)
79. Word search [DFS + backtracking visit + traversal starting point]

Fundamentals of number theory and its code implementation

79. 单词搜索【dfs + 回溯visit + 遍历起点】

【开发大杀器】之Idea

Different test techniques
![leetcode:241. Design priority for operation expression [DFS + Eval]](/img/d0/8dedeba7ecedccd25e0e3e96ff3362.png)
leetcode:241. Design priority for operation expression [DFS + Eval]
随机推荐
shell脚本导入存储过程到数据库
When Sqlalchemy deletes records with foreign key constraints, the foreign key constraints do not work. What is the solution?
Example code of second kill based on MySQL optimistic lock
啟動solr報錯The stack size specified is too small,Specify at least 328k
La taille de la pile spécifiée est petite, spécifiée à la sortie 328k
Wang Xing's infinite game ushers in the "ultimate" battle
Introduction to reverse debugging PE structure input table output table 05/07
题目 2612: 蓝桥杯2021年第十二届省赛真题-最少砝码(枚举找规律+递推)
MySQL Replication中的并行复制示例详解
简单的两个圆球loading加载
Vs code set code auto save
Have you ever encountered the problem that flynk monitors the PostgreSQL database and checkpoints cannot be used
codeforces -- 4B. Before an Exam
Mysql间隙锁
SSO and JWT good article sorting
从数据库中更新一条数据,用cdc会同时获得op字段分别为d和c的两条数据吗?我记得之前是只有op为u
leetcode 322. Coin Change 零钱兑换(中等)
【开发大杀器】之Idea
There are risks in trading
Reasons for MySQL reporting 1040too many connections and Solutions