2.Prometheus监控入门之监控配置说明

时间：2022-09-30 11:17:02人气：次作者：快盘下载我要评论

[TOC]

0x00 组件介绍

Prometheus

描述: 如果我们采用prometheus提供的二进制可执行文件进行搭建prometheus服务器,可以按照以下流程进行操作运行，二进制Release下载地址: https://github.com/prometheus/prometheus/releases

简单流程:

# (1) 下载二进制可执行文件
wget https://github.com/prometheus/prometheus/releases/download/v2.26.0/prometheus-2.26.0.linux-amd64.tar.gz
tar -zxvf prometheus-2.26.0.linux-amd64.tar.gz  -C /usr/local/
cd /usr/local/prometheus-2.26.0.linux-amd64

# (2) 后台启动并修改开放端口
nohup ./prometheus --config.file=prometheus.yml --web.enable-lifecycle  --web.listen-address=:30090 & 

# (3) 查看启动状态
ps -ef | grep prometheus 
lsof -i:19908

# (4) 查看启动的命令行参数
./prometheus -h

# (5) 强行关闭 Prometheus
lsof -i:19908
kill -9 pid

# (6) 补充系统服务进行启动Prometheus
sudo tee /usr/lib/systemd/system/prometheus.service <<'EOF'
[Unit]
Description=Prometheus Server Systemd
Documentation=https://prometheus.io
After=network.target

[Service]
Type=simple
StandardError=journal
ExecStart=/usr/local/prometheus/prometheus --config.file=/usr/local/prometheus/prometheus.yml  --web.listen-address=:9090 --storage.tsdb.path=/app/prometheus_data --storage.tsdb.retention.time=7d --web.enable-lifecycle --web.enable-admin-api	
Restart=on-failure
RestartSec=3s

[Install]
WantedBy=multi-user.target
EOF

sudo systemctl daemon-reload && systemctl restart prometheus.service

# (7) 自动发现日志文件配置
mkdir /etc/prometheus/ && touch /etc/prometheus/WeiyiGeek_linux_nodes.yml

启动参数: 运行后我们可以访问http://192.168.12.107:30090/classic/flags查看到自定义或者默认的启动参数。

Command-Line Flags - 参数名称	参数值	参数说明
alertmanager.notification-queue-capacity	10000
alertmanager.timeout
config.file	/etc/prometheus/prometheus.yml	指定 prometheus.yml配置文件
enable-feature
log.format	logfmt	设置打印日志的格式，若有自动化日志提取工具可以使用这个参数规范日志打印的格式logger:stderr
log.level	info
query.lookback-delta	5m
query.max-concurrency	20
query.max-samples	50000000
query.timeout	2m
rules.alert.for-grace-period	10m
rules.alert.for-outage-tolerance	1h
rules.alert.resend-delay	1m
scrape.adjust-timestamps	true
storage.exemplars.exemplars-limit	0
storage.remote.flush-deadline	1m
storage.remote.read-concurrent-limit	10
storage.remote.read-max-bytes-in-frame	1048576
storage.remote.read-sample-limit	50000000
storage.tsdb.allow-overlapping-blocks	false
storage.tsdb.max-block-duration	1d12h
storage.tsdb.min-block-duration	2h
storage.tsdb.no-lockfile	false	如果用k8s的deployment 管理要设置为tue
storage.tsdb.path	data/	指定tsdb数据存储路径(容器中默认是/prometheus/data)
storage.tsdb.retention	0s
storage.tsdb.retention.size	0B	[EXPERIMENTAL]要保留的最大存储块字节数,最旧的数据将首先被删除默认为0或禁用。
storage.tsdb.retention.time	0s	指定数据存储时间即何时删除旧数据(推荐7d)
storage.tsdb.wal-compression	true	启用压缩预写日志（WAL）,根据您的数据您可以预期WAL大小将减少一半而额外的CPU负载却很少
storage.tsdb.wal-segment-size	0B
web.config.file
web.console.libraries	console_libraries
web.console.templates	consoles
web.cors.origin	.*
web.enable-admin-api	false	是否启用 admin api 的访问权限(TSDB管理API)
web.enable-lifecycle	true	是否启用 API，启用API后，可以通过 API指令完成 Prometheus 的停止、热加载配置文件等
web.external-url
web.listen-address	0.0.0.0:9090	监听地址和提供服务端口
web.max-connections	512
web.page-title	Prometheus Time Series Collection and Processing Server
web.read-timeout	5m
web.route-prefix	/
web.user-assets

Tips ：当我们启用了 API 时我们可以使用它来看当前prometheus.yml中的配置和利用POST请求重载配置。

# 查看当前配置,如果使修改后的prometheus.yml配置生效可参照下面得方式(在也不用重启容器了)
curl http://192.168.12.107:30090/api/v1/status/config

# 方式1.如果在本机二进制可执行时可以通过使用SIGHUP来重载Prometheus而不用重启(ctrl+c)
kill -SIGHUP prometheus
# 方式2.Yes, sending SIGHUP to the Prometheus process or an HTTP POST request to the /-/reload endpoint 
curl -X POST http://192.168.12.107:30090/-/reload
  # level=info ts=2021-05-08T06:37:53.793Z caller=main.go:944 msg="Loading configuration file" filename=/etc/prometheus/prometheus.yml
  # level=info ts=2021-05-08T06:37:53.799Z caller=main.go:975 msg="Completed loading of configuration file" filename=/etc/prometheus/prometheus.yml totalDuration=6.025725ms remote_storage=2.732µs web_handler=617ns query_engine=1.456µs scrape=124.866µs scrape_sd=55.119µs notify=1.099µs notify_sd=1.377µs rules=4.424043ms