当前位置:网站首页>Azkaban overview
Azkaban overview
2022-07-05 02:46:00 【A vegetable chicken that is working hard】
What is? azkaban
1. The term
- Batch workflow task scheduler
2. explain
- It is mainly used to run a group of work and processes in a specific order within a workflow , It's configured through simple key:value Right way , Through the configuration of dependencies To set up dependencies
- Azkaban Use job Configuration files establish dependencies between tasks , And provide an easy to use web User interface to maintain and track your workflow
Why do we need a workflow scheduling system
1. Solve the dependency between task units
- A complete data analysis system is usually composed of a large number of task units (shell Script program ,java Program ,mapreduce Program 、hive Script etc. )
- There are time sequence and pre and post dependence relationships among task units
- In order to well organize such a complex implementation plan , A workflow scheduling system is needed to schedule execution

2. Timing scheduler
- The whole execution process needs manual participation , And keep an eye on the progress of each task . But a lot of our tasks are carried out in the middle of the night , Set it up by writing a script crontab perform
- Actually , The whole process is similar to a directed acyclic graph (DAG)
- Each subtask is equivalent to a node in a large task , That is to say , What we need is a workflow scheduler , and Azkaban It is a scheduler that can solve the above problems
Azkaban characteristic
1. Compatible with any version of hadoop
2. Easy to use Web The user interface , Convenient and simple foolproof operation
3. Modular and pluggable plug-in mechanism
4. authentication / to grant authorization ( The work of authority )
5. Ability to kill and restart workflow
6. Email reminders about failure and success
Common workflow scheduling system
1. Simple task scheduling
- Use it directly crontab Realization
2. Complex task scheduling
- Develop a scheduling platform or use an off the shelf open source scheduling system , such as ooize、azkaban etc.
Ooize and Azkaban Feature comparison

Azkaban The architecture of
1. Architecture diagram

2. explain
- AzkabanWebServer:AzkabanWebServer As a whole Azkaban The main of workflow system managers , It's user login authentication 、 be responsible for project management 、 Scheduled execution of workflow 、 A series of tasks such as tracking the progress of workflow execution
- AzkabanExecutorServer: Be responsible for specific Submission of Workflow 、 perform , They use MySQL Database to coordinate task execution
- Relational database (MySQL): Store most of the execution flow state ,AzkabanWebServer and AzkabanExecutorServer You need to access the database
边栏推荐
- Eight days of learning C language - while loop (embedded) (single chip microcomputer)
- Bumblebee: build, deliver, and run ebpf programs smoothly like silk
- Apache build web host
- tuple and point
- CAM Pytorch
- 1.五层网络模型
- Talk about the things that must be paid attention to when interviewing programmers
- TCP security of network security foundation
- Avoid material "minefields"! Play with super high conversion rate
- Kotlin - 协程 Coroutine
猜你喜欢

Vb+access hotel service management system

TCP security of network security foundation

Introduce reflow & repaint, and how to optimize it?

Pytest (5) - assertion

The most powerful new household god card of Bank of communications. Apply to earn 2100 yuan. Hurry up if you haven't applied!

Master Fur

Azkaban实战

Scientific research: are women better than men?

Avoid material "minefields"! Play with super high conversion rate

【LeetCode】404. Sum of left leaves (2 brushes of wrong questions)
随机推荐
Elfk deployment
Qrcode: generate QR code from text
Design and implementation of kindergarten management system
Use UDP to send a JPEG image, and UPD will convert it into the mat format of OpenCV after receiving it
Acwing game 58 [End]
Use the difference between "Chmod a + X" and "Chmod 755" [closed] - difference between using "Chmod a + X" and "Chmod 755" [closed]
Asp+access campus network goods trading platform
【LeetCode】111. Minimum depth of binary tree (2 brushes of wrong questions)
Vb+access hotel service management system
Marubeni Baidu applet detailed configuration tutorial, approved.
When the low alcohol race track enters the reshuffle period, how can the new brand break the three major problems?
[micro service SCG] 33 usages of filters
Introduce reflow & repaint, and how to optimize it?
Leetcode takes out the least number of magic beans
[uc/os-iii] chapter 1.2.3.4 understanding RTOS
Acwing第 58 场周赛【完结】
Summary and practice of knowledge map construction technology
February database ranking: how long can Oracle remain the first?
Tiny series rendering tutorial
Open source SPL optimized report application coping endlessly