当前位置:网站首页>[operation and maintenance thinking] how to do a good job in cloud operation and maintenance services?
[operation and maintenance thinking] how to do a good job in cloud operation and maintenance services?
2020-11-09 15:20:00 【Jiawei Technology】
Do you need operation and maintenance after going to the cloud ? Of course. . Going to the cloud can really simplify part of the operation and maintenance work , For example, for the daily operation and maintenance of the server , But because of the nature of cloud computing ( Open and use 、 Flexible expansion, etc ) And fast iteration of cloud products , Compared with the traditional operation and maintenance of stable , Cloud operation and maintenance more reflects the sensitive state characteristics . How to do the cloud operation and maintenance services well ? In addition to doing a good job in the cloud operation and maintenance basic service guarantee system, there is no problem , We should do a good job in value-added core operation and maintenance services . meanwhile , Cloud computing technology is changing day by day , The ability of operation and maintenance personnel also needs to keep pace with the times . The cloud age is reshaping IT Operation and maintenance .
The difference between cloud operation and maintenance and traditional operation and maintenance
Cloud computing is in full swing , From the Internet industry to manufacturing 、 Finance 、 traffic 、 Medical and other traditional industries are constantly infiltrating and integrating , Promote the transformation and upgrading of traditional industries , Enterprises are enjoying the huge dividend released by cloud computing . however , Is everything going to happen when you get into the cloud ? Of course not. , To enjoy the cloud computing bonus , You need to use the cloud well , Cloud operation and maintenance services are essential .
What is the difference between cloud operation and maintenance and traditional operation and maintenance ? The construction goal of cloud and traditional data center is consistent , They are all for the enterprise IT service ; The operation and maintenance responsibilities are all to guarantee IT The quality of service , Around service level agreements SLA Carry out various operation and maintenance activities . But because of the nature of cloud computing , Cloud operation and maintenance is quite different from traditional operation and maintenance :
The operation and maintenance objects are different
Traditional operation and maintenance , The starting point of contact is mostly hardware , Server, such as 、 Network devices 、 Storage equipment and wind, fire, water and electricity , In the era of cloud computing , Operation and maintenance personnel have been unable to see any physical equipment , The object of cloud operation and maintenance is more soft .
The operation and maintenance requirements are different
Because of the nature of Cloud Computing , Rapid deployment of applications on cloud platform 、 Quick update 、 Real time monitoring and other aspects put forward higher requirements for cloud operation and maintenance ; At the same time for flexible expansion 、 The operation and maintenance services with cloud characteristics such as cloud native also put forward higher requirements .
The operation and maintenance form is different
Cloud vendors' products are constantly and rapidly iterating , Compared with the traditional operation and maintenance of stable , Cloud operation and maintenance is more in a sensitive state , Operation and maintenance personnel need to constantly update their knowledge base along with the upgrade and iteration of cloud products .
The key points of operation and maintenance services on the cloud
Do a good job in cloud operation and maintenance services , You need to be clear about the content and focus of the service , And the way the service is done , The cloud operation and maintenance service system is shown in the figure below :
01 Cloud maintenance basic services
The basic services of cloud operation and maintenance are mainly reflected in three aspects : Monitoring alarm 、 Safety operation and maintenance 、 Daily problem handling .
Monitoring is the most important part of the whole operation and maintenance and even the whole product life cycle , Early warning and fault detection in advance , After the event, provide detailed data for tracking and positioning problems . Monitoring involves both infrastructure such as hosts CPU、 Memory 、 disk IOPS、 Network traffic monitoring , It's also about applications APM monitor , At the same time, whether the operation and maintenance personnel confirm to receive the alarm information 、 Whether the alarm problem is being handled 、 It is equally important to track and manage the process and results of processing .
Safety operation and maintenance includes safety reinforcement 、 Vulnerability scanning 、 Patch fix 、 Security architecture optimization . Security reinforcement is to check the safety baseline and reinforce the safety baseline configuration , Vulnerability scanning for application security scanning , Patch fix for operating system 、 middleware 、 Database for patch updates or bug fixes , Security architecture optimization improves the security capability of existing architecture and security products .
Daily problem handling , Including the installation and configuration of various resource products on the cloud 、 Lifting fitting 、 Backup service 、 Timely handling of technical problems .
02 Cloud operation and maintenance core services
The basic service of cloud operation and maintenance ensures that the system can run smoothly , For enterprises , This is not enough . What enterprises need is more standard 、 More standard 、 Lower cost 、 Lower risk 、 Better scalability to ensure the operation of the system , For operation and maintenance personnel , More efforts need to be devoted to the core aspects of operation and maintenance services .
The core services of cloud operation and maintenance are also reflected in three aspects : Cloud best practice standards 、 Cloud cost optimization 、 Cloud depth inspection and optimization .
Cloud best practices mean that deployment on the cloud is configured according to optimal standards . Including architecture best practices 、 Best practices for cloud resource selection and configuration 、 Best practices for cloud management . For any enterprise on the cloud , They want to use the cloud according to best practice standards .
such as , Here is the hybrid Cloud Architecture , It's designed to best practice standards .
Enterprise local IDC And the public cloud through a dedicated line or VPN Get through , Deploy applications across availability zones in the public cloud , Distribute user requests through load balancing , Form a high availability Architecture , Use cloud database as much as possible , On the Internet 、 host 、 application 、 Deploy cloud security products at different levels such as databases to ensure application and data security in an all-round way , file 、 The database is regularly backed up through the cloud native backup service , Use the fortress machine 、 Access control services 、 Cloud monitoring and other products assist operation and maintenance management .
Again , In terms of cloud resources , Five traditional cloud resources ( Calculation 、 Storage 、 The Internet 、 database 、 Security ) And the selection and allocation of other resources also need to establish standards . For example, planning a VPC There are still more than one. VPC, How to plan and design the subnet segment of switching machine 、 Naming rules for virtual machines 、 Virtual machine selection standard 、 Security group port opening principles, etc , Only match according to best practice standards , Operation and maintenance will be more efficient in the future , Scalability is better .
Best practices for cloud management include backup 、 disaster 、 account number 、 Authority and so on .
Another core service for cloud operation and maintenance , It is to help enterprises save costs . According to the professional advisory body RightScale According to a research report from , Today's enterprise users spend as much as 30%.
How to do a good job in cloud cost optimization operation and maintenance services ? It involves the establishment of cloud cost optimization system , Establish a set of pre engagement planning to in-process analysis , Then to the continuous cycle of cost optimization system after evaluation improvement .
The system focuses on implementation , For example, analyze how to reduce the waste of cloud resources , You can see from the figure below 5 Dimensions are examined and analyzed :
Cloud depth inspection and optimization , By regular ( Like once a quarter ) On Architecture 、 resources 、 Configuration and cost of comprehensive health inspection , Find out the possible unreasonable allocation and weak links , Adjust and optimize , Improve the efficiency of resource utilization 、 Reduce the probability of system failure .
Cloud deep inspection must inspect all cloud resources , The patrol object includes the overall architecture 、 Security 、 cost 、 The Internet 、 Calculation 、 Storage 、 database 、 monitor 、 account number 、 Backup .
High level on the problems found 、 in 、 Low grade , High level issues are already running on the system 、 Usability 、 Function has an impact , Suggest immediate action . The level is medium 、 Low issues suggest action or attention .
03 Use tool platform to assist cloud operation and maintenance
After the workload and complexity of operation and maintenance reach a certain level , We need an automated tool platform to help , For example, automatic inspection 、 Automated release 、 Automatic processing of alarm events, etc , Improve efficiency at the same time reduce the error rate of manual processing .
in addition , Many large and medium-sized enterprises will choose multi cloud strategy when making cloud strategic planning , How to manage multi cloud in a unified way , For example, the whole life cycle management of various cloud platform resources is carried out in a unified interface 、 How to manage the application and approval process of resources 、 How to carry out dose analysis 、 Cost analysis, etc , At this time, you need a multi cloud management platform .
Further thinking about cloud operation and maintenance services
Cloud operation and maintenance is a tedious task 、 Demanding work , In addition to routine daily operation and maintenance services , Sometimes we need to do a good job in special operation and maintenance support services . For example, after the highly concurrent e-commerce business of enterprises is put into the cloud , Similar to 618、 double 11 Wait for the promotion period , Often also need to do a good job in high concurrent escort special support work , For example, full link pressure test 、 Database tuning 、 Disaster recovery drills and so on , Ensure that the system is running smoothly during the event .
Cloud computing technology is changing day by day , Containers 、 There is no server 、 Microservices 、IOT、 Artificial intelligence and other emerging technologies bring more advantages and convenience at the same time , It also puts forward higher requirements for cloud operation and maintenance personnel , The ability of operation and maintenance personnel needs to keep pace with the times . The cloud age is reshaping IT Operation and maintenance .
版权声明
本文为[Jiawei Technology]所创,转载请带上原文链接,感谢
边栏推荐
- 中国程序员 VS 日本程序员,满屏的羞羞!
- 布客·ApacheCN 编程/后端/大数据/人工智能学习资源 2020.11
- I do digital transformation in traditional industries (1)
- H5 official account listens to events in the closed browser (left fork).
- To me, multithreading transaction must be a pseudo proposition!
- CAD2016软件安装教程
- AutoCAD2020 完整版安装图文教程、注册激活破解方法
- SEO建设者,有哪些说不出的苦?
- 听说你一夜之间变了户籍,依萍如洗的打工人该如何自救?
- 岗位内推 | 微软亚洲研究院智能多媒体组招聘计算机视觉算法实习生
猜你喜欢
什么是网站【新四化】?
OpenYurt 深度解读:如何构建 Kubernetes 原生云边高效协同网络?
To me, multithreading transaction must be a pseudo proposition!
C language (circular list) to achieve the basic function of snake
5分钟GET我使用Github 5 年总结的这些骚操作!
Spark Learning (3) -- memory management and performance tuning
高德全链路压测——语料智能化演进之路
融云完成数亿人民币 D 轮融资,将持续打造全球云通信能力
中国程序员 VS 日本程序员,满屏的羞羞!
Guest interview: Wang Jian
随机推荐
CAD2016软件安装教程
布客·ApacheCN 编程/后端/大数据/人工智能学习资源 2020.11
Flink的安装和测试
深入分析商淘多用户商城系统如何从搜索着手打造盈利点
Booker · apachecn programming / back end / big data / AI learning resources 2020.11
Cad2016 download autocad2016 download installation detailed tutorial CAD Download
Leetcode algorithm (1)
Android byte beat one side, was hanged by the interviewer! Fortunately, we got the offer from three sides
.NET报表生成器Stimulsoft Reports.Net 发布最新版v2020.5!
低功耗蓝牙单芯片为物联网助力
HomeLede 2020.11.08 v5.4.72内核 UPnP+NAS+多拨+网盘+DNS优化+帕斯沃 无缝集成+软件包
MES system is different from traditional management in industry application
03. Priority link model
程序员过高工资导致加班?应该降低程序员工资?网友:放过其他苦逼的程序员吧
移动安全加固助力 App 实现全面、有效的安全防护
岗位内推 | 微软亚洲研究院智能多媒体组招聘计算机视觉算法实习生
Simple use of AE (after effects)
基于Chef InSpec的基础设施测试
How can you be a big data worker with an annual salary of 40W if you don't work hard?
Put method of immutablemap