当前位置:网站首页>Distributed resource management and task scheduling framework yarn
Distributed resource management and task scheduling framework yarn
2022-07-05 22:22:00 【m0_ sixty-two million two hundred and ninety-five thousand nine】
stay Hadoop1.x in MapReduce yes Master/Slave structure .1 individual JobTracker Take more than one TaskTracker, We call it MRv1.
JobTracker The main function
Resource management
Task scheduling
T askTracker The main function
Perform tasks , Respond to JobTracker command
Report the heartbeat
The main problem
JobTracker A single point of failure , If it hangs up , The whole system doesn't work
JobTracker Too much load
Support only MR Computing framework , Suitable for batch processing 、 Disk based computing
There is no good decoupling design between resources and computing , A cluster can only use one computing framework
Yarn characteristic
Decoupling design of resource management and computing framework , A cluster resource is shared among the upper computing frameworks
Data sharing within the cluster is consistent , Data no longer needs to be copied and transferred between clusters , Achieve sharing and interoperability
Avoid single point of failure 、 Cluster resource expansion has been reasonably solved
Yarn Operation flow
ResourceManger( Resource management ):
ResourceScheduler( Resource scheduling )
AplicationsManger( Process management )
NodeManger Node resource management
ApplicationMaster task management
Task Where the task runs

Client To the end ResourceManger Submit the assignment , These include applicationMaster Program , start-up applicationMaste The order of
ResourceManger Assign the first Container, That is, allocate a container , And corresponding NodeManger signal communication , send NodeManger Start the job in this container ApplicationMaster
NodeManger Start a Container function ApplicationMaster
ApplicationMaster First of all to REsourceManger register , So users can go through ResourceManger To query the running status of the job ,ApplicationMaster To ResourceManger Apply for various tasks and resources , And monitor the running status of the task
ApplicationMaster Get the resources and the corresponding NodeManger signal communication , Start the task
NodeManger receive ApplicationMaster command , start-up Contain Mission
each Container adopt RPC towards ApplicationMaster Report task status and progress , therefore ApplicationMaster You can restart the task when it fails
After job completion ,ApplicationMaster towards ResourceMangerous Apply for cancellation and close yourself
ResourceManger monitor NodeMangerous and ApplicationMaster
NodeManger Cyclical ResourceManger Report resource usage , And running state
ApplicationMaster Monitoring task , You can make NodeManger Restart the mission
What an application needs Container There are two main categories , as follows :
(1) function ApplicationMaster Of Container: This is from ResourceManager( To the internal resource scheduler ) Application and start up , When a user submits an application , A unique ApplicationMaster Resources needed ;
(2) To run all kinds of tasks Container: This is from ApplicationMaster towards ResourceManager Applied , And by the ApplicationMaster And NodeManager Communication to start .
The above two categories Container It may be on any node , Their positions are usually random , namely ApplicationMaster It may run on the same node as the tasks it manages .
Scheduling strategy
FIFO fifo
Capacity Scheduler( Container scheduler ) General runway , Don't occupy
Fair Scheduler( Fair scheduler ) Allow others to occupy , But when you use it yourself , It will make others lose some data
Yarn shell Instructions
View version information :yarn version
Use yarn Submission of orders jar package :
yarn jar jarName mainClassPath -Dk1=v1 -Dk2=v2 inputPath outputPath
View all application List information :yarn application -list
Kill the designated application, Use command :yarn application kill app-id
see yarn Current resource usage of :yarn top
边栏推荐
- How to quickly experience oneos
- Cobaltstrike builds an intranet tunnel
- The real situation of programmers
- [groovy] mop meta object protocol and meta programming (Introduction to groovyobject interface | introduction to metaclass | implementation of class methods using groovyobject invokemethod)
- 科技云报道:算力网络,还需跨越几道坎?
- Sparse array [matrix]
- 2022软件测试工程师涨薪攻略,3年如何达到30K
- Practice: fabric user certificate revocation operation process
- Assign the output of a command to a variable [repeat] - assigning the output of a command to a variable [duplicate]
- Livelocks and deadlocks of concurrency control
猜你喜欢

Matlab draws a cute fat doll

Business learning of mall commodity module

ESP32 hosted

Bitbucket installation configuration

Implementation technology of recovery

What if win11 is missing a DLL file? Win11 system cannot find DLL file repair method

700. Search in a Binary Search Tree. Sol

Metaverse Ape猿界应邀出席2022·粤港澳大湾区元宇宙和web3.0主题峰会,分享猿界在Web3时代从技术到应用的文明进化历程

The real situation of programmers
![[groovy] groovy dynamic language features (automatic type inference of function arguments in groovy | precautions for function dynamic parameters)](/img/ab/086c7455706eecc467eef92c660803.jpg)
[groovy] groovy dynamic language features (automatic type inference of function arguments in groovy | precautions for function dynamic parameters)
随机推荐
50. Pow(x, n). O(logN) Sol
Serializability of concurrent scheduling
Distance entre les points et les lignes
Sparse array [matrix]
Advantages and disadvantages of the "Chris Richardson microservice series" microservice architecture
Go语言学习教程(十五)
Assign the output of a command to a variable [repeat] - assigning the output of a command to a variable [duplicate]
Search: Future Vision (moving sword)
Hcip day 16
如何开发引入小程序插件
Analysis of the problem that the cookie value in PHP contains a plus sign (+) and becomes a space
Golang writes the opening chapter of selenium framework
Two stage locking protocol for concurrency control
Type of fault
What if win11 is missing a DLL file? Win11 system cannot find DLL file repair method
Alternating merging strings of leetcode simple questions
Server optimization of performance tuning methodology
[groovy] mop meta object protocol and meta programming (execute groovy methods through metamethod invoke)
119. Pascal‘s Triangle II. Sol
Understand the basic concept of datastore in Android kotlin and why SharedPreferences should be stopped in Android