当前位置:网站首页>Prometheus监控容器、pod、邮件告警
Prometheus监控容器、pod、邮件告警
2022-08-03 05:26:00 【养了一只皮卡丘】
Cadvisor 进行收集,通过 Prometheus 作为数据源,利用 Grafana 进行展示。
环境说明:
已做工作可以参考上一篇文章Prometheus、Grafan基于docker部署
| 主机名 | IP | 部署功能 |
|---|---|---|
| master | 192.168.143.140 | Grafan 容器 Prometheus 容器 node_exporter |
| node1 | 192.168.143.141 | cadvisor容器 node_exporter |
node1主机上 用此命令运行容器google/cadvisor官方镜像
docker run \
--volume=/:/rootfs:ro \
--volume=/var/run:/var/run:ro \
--volume=/sys:/sys:ro \
--volume=/var/lib/docker/:/var/lib/docker:ro \
--volume=/dev/disk/:/dev/disk:ro \
--publish=8080:8080 \
--detach=true \
--name=cadvisor \
--privileged \
--device=/dev/kmsg \
google/cadvisor
[[email protected] ~]# docker run \
> --volume=/:/rootfs:ro \
> --volume=/var/run:/var/run:ro \
> --volume=/sys:/sys:ro \
> --volume=/var/lib/docker/:/var/lib/docker:ro \
> --volume=/dev/disk/:/dev/disk:ro \
> --publish=8080:8080 \
> --detach=true \
> --name=cadvisor \
> --privileged \
> --device=/dev/kmsg \
> google/cadvisor





在 master 主机上配置prometheus.yml文件
使prometheus能够接受到node1采集的信息
[[email protected] ~]# vim /opt/prometheus.yml
# my global config
global:
scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
# scrape_timeout is set to the global default (10s).
# Alertmanager configuration
alerting:
alertmanagers:
- static_configs:
- targets:
# - alertmanager:9093
# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
# - "first_rules.yml"
# - "second_rules.yml"
# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
# The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
- job_name: "prometheus"
# metrics_path defaults to '/metrics'
# scheme defaults to 'http'.
static_configs:
- targets: ["192.168.143.140:9100"]
- job_name: "Linux Server"
static_configs:
- targets:
- 192.168.143.141:9100
- 192.168.143.142:9100
//新增配置
- job_name: "cadvisor Service "
static_configs:
- targets: ["192.168.143.141:8080"]
//重启docker,也可以docekr restart prometheus
[[email protected] ~]# systemctl restart docker
//master上面查看,监控状态发现有新的节点

//发现原来的模板监控不了,此时添加新的模板
添加新的模板



边栏推荐
猜你喜欢
随机推荐
树——前序
ZEMAX | 如何倾斜和偏心序列光学元件
PHP二维数组保留键值去重
关于芯片你了解吗?
find命令、sort命令、uniq命令
数组与字符串8-最长回文子串
二、Exception和Error有什么区别?
ZEMAX | 探索 OpticStudio中的序列模式
ZEMAX | 绘图分辨率结果对光线追迹的影响
使用JSP实现简单的登录注册功能,并且使用Session跟踪用户登录信息
2. What is the difference between Exception and Error?
VI和VIM编辑指令
servlet学习(七)ServletContext
SQLMAP介绍及使用
IP数据包的格式(1)
【C语言】斐波那契数列
常见的电容器有哪些?唯样商城
Delightful Nuxt3 Tutorial (2): Build a Blog Quickly and Easily
Phase Vocoder的补充完善,Matlab音频变速不变调、变调不变速
增强光学系统设计 | Zemax 全新 22.2 版本产品现已发布!









