当前位置:网站首页>Sailing with karmada: multi cluster management of massive nodes
Sailing with karmada: multi cluster management of massive nodes
2022-06-29 04:49:00 【InfoQ】

Construction status of ICBC cloud platform

- The current business cloud scenarios of ICBC are rich and diverse , There are core business applications represented by payment lines such as Spring Festival red envelopes , There is also MySQL、Redis As a representative of the technical support applications , It also includes blockchain 、 Artificial intelligence and other new technology fields .
- At present, ICBC cloud platform has also conducted in-depth customized research and development based on mainstream open source projects , It ensures the overall autonomy and controllability .
- From the perspective of construction , It is also the largest container cloud in the industry , At present, the number has reached 28 ten thousand +.
Typical business requirements and cloud native infrastructure status
- A single cluster requires high reliability . The number of nodes in our overall single cluster is 2000 following , This is to narrow down the impact of cluster failure rate .
- The resource pool grows rapidly with the business . At present, the new business application is fully on the cloud , Stock applications are constantly migrating to the cloud , Now all core applications have been put into the cloud .
- Business level heterogeneous cluster . Business to specific K8s The version depends on , And there are a lot of heterogeneous CNI、CSI、 Including some underlying hardware heterogeneity .
- Multifocal 、 Multicenter 、 The current situation of cloudy construction , ICBC's business includes the general bank cloud 、 Branch cloud , Ecological cloud, etc . In terms of fault domain, the data center construction of two places and three centers , And there are more fine-grained divisions of multiple fault domains within the data center .
Key challenges
- Limited availability , because K8s The cluster itself is also a fault domain , At present, there is no automatic recovery across fault domains .
- Limited resources , The overall application scheduling and elastic scaling are limited by a single cluster .
- The cluster is opaque : The clusters are now heterogeneous 、 Fault domain and other attributes , Business teams need to be aware of the underlying clusters to make their own choices K8s colony , That led to it. K8s The cluster itself is opaque to upper layer applications .
- Repeat configuration . Although our business is configured and entered on the cloud management platform , But the specific configuration needs to be distributed to each cluster , And each cluster shall ensure synchronization .
Design objectives
- In the multi cluster management plane : Cluster management and overall life cycle management of the cluster , And has a unified standard API entrance .
- In terms of resource management , You need to support multiple versions of comprehensive K8s resources , It also requires multi-dimensional resources Override Support .
- In the aspect of cross cluster automatic scheduling , Scheduling needs to be based on fault domain 、 Automatic scheduling of resource allowance, etc , And it can automatically scale across clusters .
- Disaster recovery , Automatic recovery of cross cluster resources is required , And the management plane and the business cluster need to be decoupled .
- Compatibility , For heterogeneous clusters with a large number of stocks, smooth management is required , At the same time, the project itself needs high expansibility and community activity .
Joint innovation
Karmada project

Karmada The core structure of

- Karmada The control surface has its own independent APIserver, To provide K8s Native API as well as Karmada An extension of API
- adopt Karmada Scheduler, Provide targeted [ Fault domain 、 Cluster resources 、K8s Version and cluster startup plug-ins 、 Multi weighted ] Scheduling policy support , And it is convenient for users to customize the extension
- And member The synchronization aspect of the cluster ,Karmada Agent Of pull Working mode , It can effectively disperse and control the surface pressure , Realize the management of super large-scale multi cluster resource pool .
- meanwhile ,Karmada And support ExecutionController As well as the integration KubeEdge Method to implement the public cloud 、 Private cloud, edge and other network environments K8s Direct management of the cluster .
- And with the help of ExecutionSpace The design of the ,Karmada It realizes the access rights and resource isolation between different clusters , To meet the security needs in multi cluster scenarios .
Karmada The core concept

Case study : How to use Karmada Management business

Practical experience
- Support binding and scheduling of multiple resources, This ensures that the business node needs k8s Resources can be scheduled at the same time , It also greatly improves the timeliness of our resource distribution .
- Support k8s Native object, This ensures that we have a large number of current k8s External clients require little modification .
- Karmada At present, we support Pull and Push Mode distribution , Adapted to a variety of scenarios. Especially in a scenario where we have a large number of clusters , Use Pull Mode can greatly reduce Karmada Control plane performance pressure .
The follow-up plan
attach :Karmada Community technology exchange address
边栏推荐
- Annual inventory review of Alibaba cloud's observable practices in 2021
- Research Report on the overall scale, major manufacturers, major regions, products and applications of high temperature film capacitors in the global market in 2022
- 使用VS创建静态链接库.lib并使用
- 如何用万用表测试电子部件
- [performance test] introduction and installation of JMeter
- What if modstart forgets the background user or password?
- [high concurrency] deeply analyze the callable interface
- Notebook access desktop shared disk
- Complete collection of necessary documents for project management: you can't write these 14 project documents yet?
- 笔记本访问台式机的共享磁盘
猜你喜欢

Network device setting / canceling console port login separate password

Untitled

Quelles sont les méthodes de simulation et de gravure des programmes? (comprend les outils communs et la façon dont ils sont utilisés)

泰克TDS3054B示波器技术指标

仿真与烧录程序有哪几种方式?(包含常用工具与使用方式)

Decorator Pattern

What are the ways to simulate and burn programs? (including common tools and usage)

Alibaba microservice component Sentinel

仿真與燒錄程序有哪幾種方式?(包含常用工具與使用方式)

Memo pattern
随机推荐
Le langage C imprime "Love", "Mars hit Earth" et ainsi de suite en utilisant printf, qui est constamment mis à jour
【代码随想录-动态规划】最长公共子序列
[CV] wuenda machine learning course notes Chapter 13
Mysql 中的 mvcc原理
仿真与烧录程序有哪几种方式?(包含常用工具与使用方式)
What if modstart forgets the background user or password?
MySQL subquery
Cloud native annual technology inventory is released! Ride the wind and waves at the right time
Has my future been considered in the cloud native development route?
Collection of common terms used in satellite navigation
[structural mechanics] the reason why the influence line under joint load is different from that under direct load
Agilent digital multimeter software ns multimeter, real-time data acquisition and automatic data saving
Observer pattern
【HackTheBox】dancing(SMB)
See how I do it step by step (I)
什么是匿名内部类,如何使用匿名内部类
[code random entry - hash table] T15, sum of three numbers - double pointer + sort
如何创建 robots.txt 文件?
[wc2021] Fibonacci - number theory, Fibonacci sequence
Research Report on global market segmentation based on condition based maintenance (CBM) overall scale, major enterprises, major regions, products and applications in 2022