当前位置:网站首页>Azkaban overview
Azkaban overview
2022-07-05 02:46:00 【A vegetable chicken that is working hard】
What is? azkaban
1. The term
- Batch workflow task scheduler
2. explain
- It is mainly used to run a group of work and processes in a specific order within a workflow , It's configured through simple key:value Right way , Through the configuration of dependencies To set up dependencies
- Azkaban Use job Configuration files establish dependencies between tasks , And provide an easy to use web User interface to maintain and track your workflow
Why do we need a workflow scheduling system
1. Solve the dependency between task units
- A complete data analysis system is usually composed of a large number of task units (shell Script program ,java Program ,mapreduce Program 、hive Script etc. )
- There are time sequence and pre and post dependence relationships among task units
- In order to well organize such a complex implementation plan , A workflow scheduling system is needed to schedule execution

2. Timing scheduler
- The whole execution process needs manual participation , And keep an eye on the progress of each task . But a lot of our tasks are carried out in the middle of the night , Set it up by writing a script crontab perform
- Actually , The whole process is similar to a directed acyclic graph (DAG)
- Each subtask is equivalent to a node in a large task , That is to say , What we need is a workflow scheduler , and Azkaban It is a scheduler that can solve the above problems
Azkaban characteristic
1. Compatible with any version of hadoop
2. Easy to use Web The user interface , Convenient and simple foolproof operation
3. Modular and pluggable plug-in mechanism
4. authentication / to grant authorization ( The work of authority )
5. Ability to kill and restart workflow
6. Email reminders about failure and success
Common workflow scheduling system
1. Simple task scheduling
- Use it directly crontab Realization
2. Complex task scheduling
- Develop a scheduling platform or use an off the shelf open source scheduling system , such as ooize、azkaban etc.
Ooize and Azkaban Feature comparison

Azkaban The architecture of
1. Architecture diagram

2. explain
- AzkabanWebServer:AzkabanWebServer As a whole Azkaban The main of workflow system managers , It's user login authentication 、 be responsible for project management 、 Scheduled execution of workflow 、 A series of tasks such as tracking the progress of workflow execution
- AzkabanExecutorServer: Be responsible for specific Submission of Workflow 、 perform , They use MySQL Database to coordinate task execution
- Relational database (MySQL): Store most of the execution flow state ,AzkabanWebServer and AzkabanExecutorServer You need to access the database
边栏推荐
- The most powerful new household god card of Bank of communications. Apply to earn 2100 yuan. Hurry up if you haven't applied!
- CAM Pytorch
- [technology development-26]: data security of new information and communication networks
- 数据库和充值都没有了
- Good documentation
- Last words record
- College Students' innovation project management system
- Hmi-31- [motion mode] solve the problem of picture display of music module
- 单项框 复选框
- 2.常见的请求方法
猜你喜欢

丸子百度小程序详细配置教程,审核通过。

Watch the online press conference of tdengine community heroes and listen to TD hero talk about the legend of developers

2021 Li Hongyi machine learning (3): what if neural network training fails

Apache Web page security optimization

Single box check box

Azkaban概述

Design of kindergarten real-time monitoring and control system

this+闭包+作用域 面试题

Azkaban安装部署

Day_ 17 IO stream file class
随机推荐
Qrcode: generate QR code from text
Breaking the information cocoon - my method of actively obtaining information - 3
腾讯云,实现图片上传
The most powerful new household god card of Bank of communications. Apply to earn 2100 yuan. Hurry up if you haven't applied!
2021 Li Hongyi machine learning (3): what if neural network training fails
Erreur de type de datagramme MySQL en utilisant Druid
端口,域名,协议。
丸子百度小程序详细配置教程,审核通过。
Character painting, I use characters to draw a Bing Dwen Dwen
Azkaban概述
Hmi-31- [motion mode] solve the problem of picture display of music module
Flume配置4——自定义MYSQLSource
Idea inheritance relationship
Tencent cloud, realize image upload
Spoon inserts and updates the Oracle database, and some prompts are inserted with errors. Assertion botch: negative time
Problem solving: attributeerror: 'nonetype' object has no attribute 'append‘
看 TDengine 社区英雄线上发布会,听 TD Hero 聊开发者传奇故事
数据库和充值都没有了
Sqoop命令
Why is this an undefined behavior- Why is this an undefined behavior?