当前位置:网站首页>Deploy dolphin scheduler high availability cluster based on rainbow
Deploy dolphin scheduler high availability cluster based on rainbow
2022-07-27 22:11:00 【Rainbow open source】
In this paper, we describe the process by Rainbond Cloud native application management platform One click deployment highly available DolphinScheduler colony , This way is suitable for people who don't know much about Kubernetes、 Users of complex technologies such as containerization , Reduced in Kubernetes Deployment in China DolphinScheduler Threshold .
Apache DolphinScheduler It's a distributed and extensible visualization DAG Workflow task scheduling open source system . Solve data research and development ETL Intricate dependencies , Can't directly monitor task health status and other issues .DolphinScheduler With DAG The streaming approach will Task Assemble , It can monitor the running status of tasks in real time , At the same time, it supports retry 、 Recovery from the specified node failed 、 Suspension and Kill Tasks, etc
Simple and easy to use :DAG Monitoring interface , All process definitions are visual , Customize by dragging tasks DAG, adopt API Mode docking with the third party system , One key deployment
high reliability : A lot of decentralization Master And many Worker, Self support HA function , Use task queue to avoid overload , It will not cause the machine to get stuck
Rich use scenarios : Support pause resume operation . Multi tenant support , Better response to big data usage scenarios . Support more task types , Such as spark, hive, mr, python, sub_process, shell
High scalability : Support custom task type , Scheduler uses distributed scheduling , Scheduling capacity grows linearly with cluster ,Master and Worker Support dynamic online and offline
Prerequisite
- Usable Rainbond Cloud native application management platform , See documentation Rainbond Fast installation
DolphinScheduler Cluster one click deployment
- Dock and visit the built-in open source app store , Search keywords
dolpYou can find it DolphinScheduler application .
- Click on DolphinScheduler On the right side of the
installGo to the installation page , Fill in the corresponding information , Click OK to start the installation , Automatically jump to the application view .
Options | explain |
|---|---|
Team name | User built workspace , Isolate by namespace |
Cluster name | choice DolphinScheduler To which one K8s colony |
Select application | choice DolphinScheduler To which application is deployed , The application contains several related components |
Application version | choice DolphinScheduler Version of , Currently, the optional version is 3.0.0-beta2 |
- Wait a few minutes ,DolphinScheduler The cluster will be installed , And run it .
- Click on the access , Will visit DolphinScheduler-API Components , The default user password is
admin/dolphinscheduler123
API Master Worker Node scaling
DolphinScheduler API、Master、Worker Both support scaling multiple instances , Multiple instances can ensure the high availability of the entire cluster .
With Worker For example , Enter the assembly -> Telescopic , Set the number of instances .
verification Worker node , Get into DolphinScheduler UI -> The monitoring center -> Worker View node information .
The configuration file
API and Worker Service sharing /opt/dolphinscheduler/conf/common.properties , When modifying the configuration, you only need to modify API The configuration file for the service .
How to support Python 3?
Worker The service is installed by default Python3, You can add environment variables when using PYTHON_HOME=/usr/bin/python3
How to support Hadoop, Spark, DataX etc. ?
With Datax For example :
- Installing a plug-in .Rainbond Team view -> plug-in unit -> Install plug-ins from the store -> Search for
Universal data initialization plug-inAnd install . - Open plug-ins . Get into Worker In component -> plug-in unit -> Opening
Universal data initialization plug-in, And modify the configuration- FILE_URL:http://datax-opensource.oss-cn-hangzhou.aliyuncs.com/datax.tar.gz
- FILE_PATH:/opt/soft
- LOCK_PATH:/opt/soft
- Update components , The initialization plug-in will be downloaded automatically
DataxAnd unpack it/opt/softUnder the table of contents .
边栏推荐
- Excalidraw:很好用的在线、免费「手绘」虚拟白板工具
- 每条你收藏的资讯背后,都离不开TA
- 8000 word explanation of OBSA principle and application practice
- Yyds dry goods inventory # solve the real problem of famous enterprises: cycle number comparison
- Station B collapsed. What did the developer responsible for the repair do that night?
- Live broadcast software app development, uniapp scroll view hidden scroll bar
- 怎么还有人问 MySQL 是如何归档数据的呢?
- After sorting (bubble sorting), learn to continuously update other sorting methods
- What is modcount in the source code? What's the effect
- Exception -exception
猜你喜欢

Pythia: Facebook's latest open source visual and language multitasking learning framework

Small change project (two versions) with detailed ideas

Can JVM tuning be done with single core CPU and 1G memory?

关系型数据库的设计思想,20张图给你看的明明白白

@Can component be used on the same class as @bean?

Log4j 漏洞仍普遍存在,并持续造成影响

What is eplato cast by Plato farm on elephant swap? Why is there a high premium?

【海洋科学】海洋气候指数【Climate Indices】数据集

ApacheSpark-命令执行(CVE-2022-33891) 漏洞复现

STM32 project Sharing -- mqtt intelligent access control system (including app control)
随机推荐
What is modcount in the source code? What's the effect
Yyds dry goods inventory # solve the real problem of famous enterprises: cycle number comparison
Why do server programs need to listen first
枚举和注解
2021-11-05类变量和类方法的理解
Interview question: talk about your understanding of AQS
Excalidraw: an easy-to-use online, free "hand drawn" virtual whiteboard tool
MySQL execution process and order
MySQL执行过程及执行顺序
Software testing interview question: what is the focus of unit testing, integration testing, and system testing?
Software testing interview question: what project documents need to be referred to in designing the system test plan?
[question 23] Sudoku game with rotation | DFS (Beijing Institute of Technology / Beijing Institute of Technology / programming methods and practice / primary school)
How to deal with high concurrency deadlock?
MySQL series - database tables, queries, sorting, and data processing functions
Monitor the running of server jar and restart script
How to realize a good knowledge management system?
Mask automatic update description file (mask description file)
Interview questions that big companies need to prepare
Lvs+kept highly available cluster
2021-11-05 understanding of class variables and class methods