当前位置:网站首页>What's the point of monitoring the involution of the system?
What's the point of monitoring the involution of the system?
2022-08-05 06:15:00 【LinkSLA】
The volume of monitoring, not one day practice
Monitoring is the starting point and difficulty of operation and maintenance.The system needs to perform these functions:
Full stack monitoring;
Association Analysis;
Concatenation across system calls;
Real-time alarm and automatic disposal;
System performance analysis.
Two scenarios of operation and maintenance: abnormal detection and early warning.In other words, good monitoring mainly serves two scenarios: experience and emergency.What is good monitoring?
![]()
1. The alarm is timely, and the designated user is notified as soon as possible to solve the fault in time and prevent the fault from spreading.
2. The alarms are accurate and cannot be falsely reported, missed or repeated, and accurate information is pushed to users.
3. The monitoring objects should be comprehensive, from the dynamic ring of the computer room, hardware equipment, operating system, application components to business systems to carry out full-stack monitoring.
4. The operation and maintenance must be closed-loop, from the alarm trigger to the convergence as an event/work order, and the receipt, processing and termination of the work order are completed within the time specified in the SLA.
Traditional O&M relies on manpower to monitor system operating status, performance indicators, and online and change services.
With the advancement of digitization, servers, software modules, and access data have proliferated, the number and complexity of IT systems have increased, and the number of monitoring projects has become difficult to deal with, and accidents cannot be accurately located.change.
Attack: Data Standards and Value Output
Rome was not built in a day, and the operation and maintenance platform has also experienced the development process of manual tooling, tool platformization, and platform intelligence.The more prominent advantages of intelligent operation and maintenance are data standards and delivery value.
To fully mine the data value of operation and maintenance, find all problems, pinpoint problems, and reduce problems.Multi-layer monitoring, breaking the island of operation and maintenance, full-stack monitoring objects, including:

01
Hardware
Hardware equipment is the basis for monitoring operation and maintenance. Monitoring includes: computer room dynamic ring, server, network equipment, storage equipment, etc.
02
Virtualization
For example: vsphere, powerVM, hyper-V, docker, K8S, etc.
03
Operating System
Support Windows, Linux, AIX and other operating systems.
04
App Components
Supports common commercial and open source components including databases and middleware.
05
Business Systems
Supports monitoring each component of the business system as a logical monitoring object through BPV (Business Process View).
Monitoring of full-stack objects improves operation and maintenance efficiency, and solves the problems of inaccurate alarms, difficult problem location, and difficult root cause location.Identify and locate problems proactively, quickly, and accurately.
1. Quality Assurance
Including abnormal detection, fault diagnosis, fault prediction, fault self-healing.
2. Cost management
Metric monitoring, anomaly detection, resource optimization, capacity planning, performance optimization.
3. Efficiency improvement
Smart changes, machine learning algorithms, security.
LinkSLA started algorithm research and selection of specific implementation scenarios in 2018. It has made breakthroughs in single-point application in full-stack monitoring, anomaly detection, and log anomaly detection, and has achieved remarkable results, bringing more data value to business.Provide decision-making basis for enterprise development.
边栏推荐
- To TrueNAS PVE through hard disk
- Remembering my first CCF-A conference paper | After six rejections, my paper is finally accepted, yay!
- js动态获取屏幕宽高度
- NIO工作方式浅析
- Spark source code-task submission process-6.1-sparkContext initialization-create spark driver side execution environment SparkEnv
- IP数据包格式(ICMP协议与ARP协议)
- ALC实验
- Account and Permission Management
- idea 常用快捷键
- 监控系统的内卷,有什么讲究?
猜你喜欢

Remembering my first CCF-A conference paper | After six rejections, my paper is finally accepted, yay!

dsf5.0新建页面访问时重定向到首页的问题

Image compression failure problem

I/O性能与可靠性

Three modes of vim

VRRP原理及命令

入门文档01 series按顺序执行

spark算子-map vs mapPartitions算子

技术分享杂七杂八技术
![[Day6] File system permission management, file special permissions, hidden attributes](/img/ec/7fb3fa671fac8abf389844c0f4fbe7.png)
[Day6] File system permission management, file special permissions, hidden attributes
随机推荐
Why can't I add a new hard disk to scan?How to solve?
lvm逻辑卷及磁盘配额
idea 常用快捷键
Hugo builds a personal blog
LeetCode面试题
传输层协议(TCP3次握手)
入门文档11 自动添加版本号
IP address and subnet division
入门文档05-2 使用return指示当前任务已完成
Introductory document 05-2 use return instructions the current task has been completed
ACL 和NAT
618,你也许可以清醒亿点点
[Day6] File system permission management, file special permissions, hidden attributes
增长:IT运维发展趋势报告
Spark source code-task submission process-6.2-sparkContext initialization-TaskScheduler task scheduler
Mongodb查询分析器解析
Getting Started Doc 08 Conditional Plugins
spark operator-textFile operator
Call the TensorFlow Objection Detection API for object detection and save the detection results locally
Getting Started 04 When a task depends on another task, it needs to be executed in sequence