当前位置:网站首页>Kusionstack has a sense of open source | it took two years to break the dilemma of "separating lines like mountains"

Kusionstack has a sense of open source | it took two years to break the dilemma of "separating lines like mountains"

2022-06-09 09:26:00 Ant financial sofastack

 picture

writing | Xiaodong ( flower : Yi Shan )

KusionStack person in charge 、 Senior technical expert of ant group

In the field of infrastructure technology , Focus on cloud native networks 、 Operation and maintenance, programming language and other technical work

this paper 2580 word read 6 minute

| Preface |

This article is written in KusionStack On the eve of open source , The author felt it , Reviewed the team from Kusion From the beginning of project development to the present, the project has successfully embarked on the road of open source . It describes what the author and his team have done Kusion The original intention of the project and the achievements of the project so far , It also expresses the author's sincere gratitude to the team , Between the lines there is a genuine feeling .

KusionStack What is it? ?

KusionStack It is an open source programmable cloud native protocol stack !

Kusion The word comes from fusion ( Means fusion ) , We hope to integrate multiple roles of the operation and maintenance system through a one-stop technology stack , Improve the openness of operation and maintenance infrastructure 、 Extensibility , Reduce cost and increase efficiency as a whole .KusionStack By defining the cloud native programmable access layer , Provide including configuration language KCL、 Model interface 、 Automation tools 、 A complete set of solutions including best practices , Connect cloud native infrastructure and business applications , Connect teams that define and use infrastructure , Research and development of serial application life cycle 、 test 、 Integrate 、 Release phases , Serve the construction of cloud native automation system , Accelerate cloud native landing .

 picture

PART. 1

For an ideal operation and maintenance system

2019 In the autumn ,MOSN Our work has been going on for nearly two years , During this period, we gradually completed the form verification of the core link on Alipay . The whole process except MOSN Various technical challenges and difficulties faced by itself , The so-called cloud native technology bonus , In fact, it has been constrained by the efficiency constraints caused by the solidification of the operation and maintenance system .

One day the supervisor came to me for dinner ( The set of ) , He described to me his ideal operation and maintenance system :

He hoped SRE Ability to write requirements in a specialized language , Write code to define the state of the infrastructure , Instead of spending a lot of energy checking 、 Find out 、 On the cycle of repair . The infrastructure team supports different demands by providing open programmable languages and tools SRE The team , Reach a higher whole ROI.

I immediately realized that this was related to Hashicorp Of Terraform The likeness of God ( later Hashicorp stay 2021 Listing on year-end , With more than 150 The market value of billion dollars has become the highest open source by far IPO) . On the other hand , differ IaaS Delivery scenarios , Ants face a large number of larger scale 、 More complex cloud primitives PaaS scene , It reminds me of Google Internal use of special language 、 Tools and other technologies are open Borg[1] Practice of operation and maintenance capability [2], At that time, I felt it was both interesting and challenging [3].

At the dinner table, we talked about some ideas and some uncertain challenges , He asked me if I wanted to try one , It doesn't matter if it doesn't work . I didn't think much about it , I promised before I finished my meal .

 picture

PART. 2

Long study 、 Exploration and practice

Every walk is like a mountain .

No experience in language design and development , I have no experience in open automation system design , At the beginning of the project , We are in a difficult situation .

After a long period of study 、 After the repeated cycle of exploration and practice , There is still no big improvement in the project , What is more difficult is that we not only have to face the complex and coupled scenes and problems inside ants , And go through 「 This highly engineered approach depends on whether ants have a living soil 」 The question of .

It rains all night when the house leaks , During this period, I experienced some personnel changes regrettably and helplessly , At the same time, for various reasons , The project once fell into various difficulties . Whole 2020 year , We are in the unknown 、 tangle 、 Have no choice but to spend ……

Thank you Lingxi 、 Ting Jian and my supervisor , Thank you for not giving up this project , Still stick with me .

 picture

PART. 3

A painful and happy incubation journey

By preaching continuously 、 Communication and communication , We are gradually working in the infrastructure technology team and SRE The team has found more friends with consensus .

At the same time, technically , We are also free from confusion , It really started Kusion project , Also successfully from PoC Transition to MVP The stage of .

Final , We use “ Non-standard ” Application as a pointcut , Started a painful and happy incubation journey .

Thank you, Mr. zero 、 Qinghe 、 Wavelet 、 Li Feng 、 There is no limit 、 Xiang ye 、 Dayuan …… I can't list them all here , Thank you for your persistence in making this idea a reality .

 picture

PART. 4

Breakthrough and progress

Skip the various explorations and practices in the middle , Looking back on this journey , In this more than one year, we have combined compilation technology 、 Operation and maintenance and platform technology , Successfully established a system based on Kusion Operation and maintenance system of programmable technology stack .

In business scenarios , The project covers from IaaS To SaaS A large number of O & M scenarios , Up to now, a total of 800+ application , Cover 9 individual BG,21 individual BU, Among them, typical cases are delivered to O & M for efficiency improvement 90% above , This is also the first time that ant has incorporated a large number of heterogeneous applications into a complete set of operation and maintenance technology stack .

In ant, we explored deeply based on cloud native container and micro service technology DevOps、CICD practice , The cloud native technology system of ants has been improved , Gradually release the cloud native efficiency bonus , At the same time, a near 300 Human virtual operation and maintenance R & D team .

Participants from different functions and different teams come together to solve their own problems , Contributed to 3W+ commit and 35W+ Line code , Some participants spontaneously become Kusion Research and development of . I think the accumulation of these engineers' cultural concepts and domain knowledge has brought value far beyond the operation and maintenance business itself .

 picture

Besides ,Kusion It has also become a programmable baseline product 、 Cloud native O & M products 、 Basic technologies of new generation O & M products such as multi cloud delivery products , Become part of the ant O & M architecture upgrade .

Remain true to our original aspiration , We hope to promote the rationalization of the cooperative relationship with the operation and maintenance participants through technical means 、 Automation based on open technology stack , And the precipitation and accumulation of operation and maintenance data and knowledge , In order to achieve the continuous improvement of the overall collaborative operation and maintenance efficiency .

meanwhile , Because there are many internal operation and maintenance scenarios and the links are complex , Each link needs to know the operation and maintenance business best SRE Be closely involved in , And platform 、 Application R & D collaboration , Finally, all links are united to form a complete operation and maintenance system , In this way, open technology will become more and more important .

Platform R & D 、SRE、 Code written by multiple roles such as application R & D is a kind of data precipitation , It is also a precipitation of business knowledge , Based on these data and knowledge , There will be more possibilities in the future .

PART. 5

Embark on the road of open source

After a period of internal exploration , We hope that KusionStack Open source to the technology community . Because we are aware of the problems we face , Other companies 、 The team is also facing . With open source , We hope that these work results of the team can be helpful to more people .

Of course , It is also limited by its own ability, energy and resources , We hope to have more friends to participate , Work with us to improve KusionStack, Whether you work in cloud native 、 Operation and maintenance automation 、 In programming languages or compilers , We all look forward to and welcome you to join us .

PART. 6

Looking forward to growing with you

This experience is extremely valuable to me , It is not just that they have tried new exploration and achieved breakthroughs in new technology fields and technological upgrading of ants , More valuable is , I also have a period with a group of people 95 After the small partners together will realize the idea of the magic process .

some time , Kusion Our circle of friends is no longer confined to the ants , Open source oriented , We are looking forward to having more community friends in KusionStack Grow up with us !

Learn more about …

KusionStack Star once :
https://github.com/KusionStack

KusionStack The open source , I hope it can help you , Also hope to improve with more friends KusionStack. Welcome to cloud native 、 Operation and maintenance automation 、 programing language 、 Students who are interested in compilers participate in community co construction , Explore and make breakthroughs in upgrading new technology fields , Realize more new ideas .

Click the end of the article ** Read the original ** Direct to the project address .

【 Reference link 】

[1]《Large-scale cluster management at Google with Borg》:https://pdos.csail.mit.edu/6.824/papers/borg.pdf

[2]Configuration Specifics:https://sre.google/workbook/configuration-specifics/

[3]《Borg, Omega, and Kubernetes》:https://queue.acm.org/detail.cfm?id=2898444

【 Recommended reading this week 】

 picture

KCL: Declarative cloud native configuration policy language

 picture

Wonderful review |KusionStack Open source ~

 picture

【GLCC】 Programming summer camp College student enrollment officially begins !

 picture

Ant group Service Mesh Review and Prospect of progress |SOFAStack 4th anniversary

 picture

原网站

版权声明
本文为[Ant financial sofastack]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/160/202206090901564959.html