当前位置:网站首页>Sailing with karmada: multi cluster management of massive nodes
Sailing with karmada: multi cluster management of massive nodes
2022-06-29 04:49:00 【InfoQ】

Construction status of ICBC cloud platform

- The current business cloud scenarios of ICBC are rich and diverse , There are core business applications represented by payment lines such as Spring Festival red envelopes , There is also MySQL、Redis As a representative of the technical support applications , It also includes blockchain 、 Artificial intelligence and other new technology fields .
- At present, ICBC cloud platform has also conducted in-depth customized research and development based on mainstream open source projects , It ensures the overall autonomy and controllability .
- From the perspective of construction , It is also the largest container cloud in the industry , At present, the number has reached 28 ten thousand +.
Typical business requirements and cloud native infrastructure status
- A single cluster requires high reliability . The number of nodes in our overall single cluster is 2000 following , This is to narrow down the impact of cluster failure rate .
- The resource pool grows rapidly with the business . At present, the new business application is fully on the cloud , Stock applications are constantly migrating to the cloud , Now all core applications have been put into the cloud .
- Business level heterogeneous cluster . Business to specific K8s The version depends on , And there are a lot of heterogeneous CNI、CSI、 Including some underlying hardware heterogeneity .
- Multifocal 、 Multicenter 、 The current situation of cloudy construction , ICBC's business includes the general bank cloud 、 Branch cloud , Ecological cloud, etc . In terms of fault domain, the data center construction of two places and three centers , And there are more fine-grained divisions of multiple fault domains within the data center .
Key challenges
- Limited availability , because K8s The cluster itself is also a fault domain , At present, there is no automatic recovery across fault domains .
- Limited resources , The overall application scheduling and elastic scaling are limited by a single cluster .
- The cluster is opaque : The clusters are now heterogeneous 、 Fault domain and other attributes , Business teams need to be aware of the underlying clusters to make their own choices K8s colony , That led to it. K8s The cluster itself is opaque to upper layer applications .
- Repeat configuration . Although our business is configured and entered on the cloud management platform , But the specific configuration needs to be distributed to each cluster , And each cluster shall ensure synchronization .
Design objectives
- In the multi cluster management plane : Cluster management and overall life cycle management of the cluster , And has a unified standard API entrance .
- In terms of resource management , You need to support multiple versions of comprehensive K8s resources , It also requires multi-dimensional resources Override Support .
- In the aspect of cross cluster automatic scheduling , Scheduling needs to be based on fault domain 、 Automatic scheduling of resource allowance, etc , And it can automatically scale across clusters .
- Disaster recovery , Automatic recovery of cross cluster resources is required , And the management plane and the business cluster need to be decoupled .
- Compatibility , For heterogeneous clusters with a large number of stocks, smooth management is required , At the same time, the project itself needs high expansibility and community activity .
Joint innovation
Karmada project

Karmada The core structure of

- Karmada The control surface has its own independent APIserver, To provide K8s Native API as well as Karmada An extension of API
- adopt Karmada Scheduler, Provide targeted [ Fault domain 、 Cluster resources 、K8s Version and cluster startup plug-ins 、 Multi weighted ] Scheduling policy support , And it is convenient for users to customize the extension
- And member The synchronization aspect of the cluster ,Karmada Agent Of pull Working mode , It can effectively disperse and control the surface pressure , Realize the management of super large-scale multi cluster resource pool .
- meanwhile ,Karmada And support ExecutionController As well as the integration KubeEdge Method to implement the public cloud 、 Private cloud, edge and other network environments K8s Direct management of the cluster .
- And with the help of ExecutionSpace The design of the ,Karmada It realizes the access rights and resource isolation between different clusters , To meet the security needs in multi cluster scenarios .
Karmada The core concept

Case study : How to use Karmada Management business

Practical experience
- Support binding and scheduling of multiple resources, This ensures that the business node needs k8s Resources can be scheduled at the same time , It also greatly improves the timeliness of our resource distribution .
- Support k8s Native object, This ensures that we have a large number of current k8s External clients require little modification .
- Karmada At present, we support Pull and Push Mode distribution , Adapted to a variety of scenarios. Especially in a scenario where we have a large number of clusters , Use Pull Mode can greatly reduce Karmada Control plane performance pressure .
The follow-up plan
attach :Karmada Community technology exchange address
边栏推荐
- Software architecture final review summary
- Research Report on the overall scale, major manufacturers, major regions, products and application segmentation of GPS antenna modules in the global market in 2022
- What are the basic usage methods of MySQL
- 开启生态新姿势 | 使用 WordPress 远程附件存储到 COS
- Visitor pattern
- Facade pattern
- JDBC man Han building code
- 泰克DPO4104数字荧光示波器技术参数
- How to change the password of mysql8 created by docker
- Template method pattern
猜你喜欢

Cipher

如何用万用表测试电子部件

Proxy mode (proxy)

From zero to one, I will teach you to build a "search by text and map" search service (I)
![[IOT] description of renaming the official account](/img/54/43189f34b81a7441cd46d5c2066970.png)
[IOT] description of renaming the official account "Jianyi commerce" to "product renweipeng"
![[结构力学] 结点承载下影响线与直接承载下影响线不同的原因](/img/a6/fce0bb29cc5c84bc0ef20501617e06.png)
[结构力学] 结点承载下影响线与直接承载下影响线不同的原因

Quelles sont les méthodes de simulation et de gravure des programmes? (comprend les outils communs et la façon dont ils sont utilisés)
![[CV] wuenda machine learning course notes Chapter 13](/img/83/583d9ef852cf398ff8ed730bda0eab.jpg)
[CV] wuenda machine learning course notes Chapter 13

Visitor pattern

Memo pattern
随机推荐
i-Teams W3: How to build a sound-bottling business
Mvcc principle in MySQL
Actual combat! Another opening method of magic modified swagger and knife4j
How to solve startup failure due to insufficient MySQL memory
Hantai oscilloscope software | Hantai oscilloscope upper computer software ns-scope, add measurement data arbitrarily
没遇到过这三个问题都不好意思说用过Redis
Résultats D - exam de Qinhuangdao au cours des 20 dernières années
汉泰示波器软件|汉泰示波器上位机软件NS-Scope,任意添加测量数据
data management plan
2022-2028 global and Chinese industrial electronic detonator Market Status and future development trend
Experience sharing of system analysts in preparing for exams: phased and focused
[code random entry - hash table] T15, sum of three numbers - double pointer + sort
See how I do it step by step (I)
Cucumber test practice
ROS URDF model is parsed into KDL tree
系统分析师备考经验分享:分阶段、分重点
Microsoft Pinyin IME personal preferences
Research Report on the overall scale, major manufacturers, major regions, product and application segmentation of the gsm-gprs-edge module of the Internet of things in the global market in 2022
使用VS创建静态链接库.lib并使用
Research Report on the overall scale, major manufacturers, major regions, product and application segmentation of disposable hearing aid batteries in the global market in 2022