当前位置:网站首页>Deploy dolphin scheduler high availability cluster based on rainbow
Deploy dolphin scheduler high availability cluster based on rainbow
2022-07-27 23:05:00 【Rainbond】
In this paper, we describe the process by Rainbond Cloud native application management platform One click deployment highly available DolphinScheduler colony , This way is suitable for people who don't know much about Kubernetes、 Users of complex technologies such as containerization , Reduced in Kubernetes Deployment in China DolphinScheduler Threshold .
Apache DolphinScheduler It's a distributed and extensible visualization DAG Workflow task scheduling open source system . Solve data research and development ETL Intricate dependencies , Can't directly monitor task health status and other issues .DolphinScheduler With DAG The streaming approach will Task Assemble , It can monitor the running status of tasks in real time , At the same time, it supports retry 、 Recovery from the specified node failed 、 Suspension and Kill Tasks, etc
Simple and easy to use :DAG Monitoring interface , All process definitions are visual , Customize by dragging tasks DAG, adopt API Mode docking with the third party system , One key deployment
high reliability : A lot of decentralization Master And many Worker, Self support HA function , Use task queue to avoid overload , It will not cause the machine to get stuck
Rich use scenarios : Support pause resume operation . Multi tenant support , Better response to big data usage scenarios . Support more task types , Such as spark, hive, mr, python, sub_process, shell
High scalability : Support custom task type , Scheduler uses distributed scheduling , Scheduling capacity grows linearly with cluster ,Master and Worker Support dynamic online and offline
Prerequisite
- Usable Rainbond Cloud native application management platform , See documentation Rainbond Fast installation
DolphinScheduler Cluster one click deployment
- Dock and visit the built-in open source app store , Search keywords
dolpYou can find it DolphinScheduler application .

- Click on DolphinScheduler On the right side of the
installGo to the installation page , Fill in the corresponding information , Click OK to start the installation , Automatically jump to the application view .
| Options | explain |
|---|---|
| Team name | User built workspace , Isolate by namespace |
| Cluster name | choice DolphinScheduler To which one K8s colony |
| Select application | choice DolphinScheduler To which application is deployed , The application contains several related components |
| Application version | choice DolphinScheduler Version of , Currently, the optional version is 3.0.0-beta2 |

- Wait a few minutes ,DolphinScheduler The cluster will be installed , And run it .

- Click on the access , Will visit DolphinScheduler-API Components , The default user password is
admin/dolphinscheduler123

API Master Worker Node scaling
DolphinScheduler API、Master、Worker Both support scaling multiple instances , Multiple instances can ensure the high availability of the entire cluster .
With Worker For example , Enter the assembly -> Telescopic , Set the number of instances .

verification Worker node , Get into DolphinScheduler UI -> The monitoring center -> Worker View node information .

The configuration file
API and Worker Service sharing /opt/dolphinscheduler/conf/common.properties , When modifying the configuration, you only need to modify API The configuration file for the service .
How to support Python 3?
Worker The service is installed by default Python3, You can add environment variables when using PYTHON_HOME=/usr/bin/python3
How to support Hadoop, Spark, DataX etc. ?
With Datax For example :
- Installing a plug-in .Rainbond Team view -> plug-in unit -> Install plug-ins from the store -> Search for
Universal data initialization plug-inAnd install . - Open plug-ins . Get into Worker In component -> plug-in unit -> Opening
Universal data initialization plug-in, And modify the configuration- FILE_URL:http://datax-opensource.oss-cn-hangzhou.aliyuncs.com/datax.tar.gz
- FILE_PATH:/opt/soft
- LOCK_PATH:/opt/soft
- Update components , The initialization plug-in will be downloaded automatically
DataxAnd unpack it/opt/softUnder the table of contents .
About Rainbond
Rainbond The core 100% Open source , Easy to use , You don't need to understand containers and Kubernetes, Support management of various Kubernetes colony , Provide full lifecycle management for enterprise applications .
Github:https://github.com/goodrain/rainbond
Official website :https://www.rainbond.com
Wechat group : Focus on Rainbond The official account is joined by the technical exchange group.
Nail group : Please search for nail group number 31096419
边栏推荐
猜你喜欢

你不知道的Redis那些事,我来详解Redis底层数据结构

Understanding and use of third-party library

In depth analysis - file operation

数据仓库项目从来不是技术项目

Dry goods semantic web, Web3.0, Web3, metauniverse, these concepts are still confused? (medium)

物联网架构完全指南

可能导致索引失效的原因

Leetcode-461. Hamming distance

SparkSQL的UDF及分析案例,220726,,

Solve the problem that the last bit of IP address access is odd and even, or even and odd (the problem encountered when the cloud encryption machine connects to the cloud server, the whole process is
随机推荐
Android 11 security policy and permission management
只会Excel想做图表可视化,让数据动起来?可以,快来围观啦(附大量模板下载)
In depth analysis - file operation
2022/4/11 考试总结
Here comes Gree mask! Kn95 mask only costs 5.5 yuan!
20 character short domain name bypass replication
8000字讲透OBSA原理与应用实践
2022/3/10 考试总结
Redis网红高频面试题三连:缓存穿透?缓存击穿?缓存雪崩?
Introduction to the paper | distributed graph simulation
UDF and analysis cases of sparksql, 220726,,
Chrome realizes automated testing: recording and playback web page actions
4 轮拿下字节 Offer,面试题复盘
Shuffle, partition and read of tfrecord
云计算服务主要安全风险及应对措施
Complete Guide to IOT architecture
jvm组成及内存模型
If there is no reference ground at all, guess if you can control the impedance?
干货|语义网、Web3.0、Web3、元宇宙这些概念还傻傻分不清楚?(中)
初中三年回忆录