当前位置:网站首页>Azkaban overview
Azkaban overview
2022-07-05 02:46:00 【A vegetable chicken that is working hard】
What is? azkaban
1. The term
- Batch workflow task scheduler
2. explain
- It is mainly used to run a group of work and processes in a specific order within a workflow , It's configured through simple key:value Right way , Through the configuration of dependencies To set up dependencies
- Azkaban Use job Configuration files establish dependencies between tasks , And provide an easy to use web User interface to maintain and track your workflow
Why do we need a workflow scheduling system
1. Solve the dependency between task units
- A complete data analysis system is usually composed of a large number of task units (shell Script program ,java Program ,mapreduce Program 、hive Script etc. )
- There are time sequence and pre and post dependence relationships among task units
- In order to well organize such a complex implementation plan , A workflow scheduling system is needed to schedule execution

2. Timing scheduler
- The whole execution process needs manual participation , And keep an eye on the progress of each task . But a lot of our tasks are carried out in the middle of the night , Set it up by writing a script crontab perform
- Actually , The whole process is similar to a directed acyclic graph (DAG)
- Each subtask is equivalent to a node in a large task , That is to say , What we need is a workflow scheduler , and Azkaban It is a scheduler that can solve the above problems
Azkaban characteristic
1. Compatible with any version of hadoop
2. Easy to use Web The user interface , Convenient and simple foolproof operation
3. Modular and pluggable plug-in mechanism
4. authentication / to grant authorization ( The work of authority )
5. Ability to kill and restart workflow
6. Email reminders about failure and success
Common workflow scheduling system
1. Simple task scheduling
- Use it directly crontab Realization
2. Complex task scheduling
- Develop a scheduling platform or use an off the shelf open source scheduling system , such as ooize、azkaban etc.
Ooize and Azkaban Feature comparison

Azkaban The architecture of
1. Architecture diagram

2. explain
- AzkabanWebServer:AzkabanWebServer As a whole Azkaban The main of workflow system managers , It's user login authentication 、 be responsible for project management 、 Scheduled execution of workflow 、 A series of tasks such as tracking the progress of workflow execution
- AzkabanExecutorServer: Be responsible for specific Submission of Workflow 、 perform , They use MySQL Database to coordinate task execution
- Relational database (MySQL): Store most of the execution flow state ,AzkabanWebServer and AzkabanExecutorServer You need to access the database
边栏推荐
- Structure of ViewModel
- When to catch an exception and when to throw an exception- When to catch the Exception vs When to throw the Exceptions?
- SFTP cannot connect to the server # yyds dry goods inventory #
- Hmi-32- [motion mode] add light panel and basic information column
- Voice chip wt2003h4 B008 single chip to realize the quick design of intelligent doorbell scheme
- 2.常见的请求方法
- Azkaban实战
- Introduce reflow & repaint, and how to optimize it?
- Yyds dry goods inventory intelligent fan based on CC2530 design
- Can you really learn 3DMAX modeling by self-study?
猜你喜欢

Naacl 2021 | contrastive learning sweeping text clustering task

Introduce reflow & repaint, and how to optimize it?

Yuan universe also "real estate"? Multiple second-hand trading websites block metauniverse keywords

Design of kindergarten real-time monitoring and control system

Tencent cloud, realize image upload

Azkaban概述

Design and practice of kubernetes cluster and application monitoring scheme

ELK日志分析系统

"C zero foundation introduction hundred knowledge and hundred cases" (72) multi wave entrustment -- Mom shouted for dinner

Write a thread pool by hand, and take you to learn the implementation principle of ThreadPoolExecutor thread pool
随机推荐
Zabbix
Chinese natural language processing, medical, legal and other public data sets, sorting and sharing
Six stone programming: advantages of automated testing
Day_ 17 IO stream file class
The perfect car for successful people: BMW X7! Superior performance, excellent comfort and safety
Design and implementation of campus epidemic prevention and control system based on SSM
Open source SPL optimized report application coping endlessly
Android advanced interview question record in 2022
【LeetCode】98. Verify the binary search tree (2 brushes of wrong questions)
SFTP cannot connect to the server # yyds dry goods inventory #
Design and implementation of kindergarten management system
Application and Optimization Practice of redis in vivo push platform
【微服务|SCG】Filters的33种用法
Unpool(nn.MaxUnpool2d)
"C zero foundation introduction hundred knowledge and hundred cases" (72) multi wave entrustment -- Mom shouted for dinner
Tencent cloud, realize image upload
Bumblebee: build, deliver, and run ebpf programs smoothly like silk
Summary and practice of knowledge map construction technology
Start the remedial work. Print the contents of the array using the pointer
【LeetCode】501. Mode in binary search tree (2 wrong questions)