yjph83

prometheus 监控相关（非docker方式）

https://www.gitbook.com/book/songjiayang/prometheus/details （Prometheus 实战）

https://github.com/1046102779/prometheus （Prometheus 非官方中文手册）

http://www.bubuko.com/infodetail-2004088.html （基于prometheus监控k8s集群）

http://www.cnblogs.com/sfnz/p/6566951.html （安装prometheus+grafana监控mysql redis kubernetes等，非docker安装）

http://blog.csdn.net/wenwst/article/details/76624019 (Kubernetes 1.6 部署prometheus和grafana数据持久）)

https://github.com/jason-riddle/monitor-k8s-with-prom （Kubernetes 上prometheus监控相关）

https://github.com/kayrus/prometheus-kubernetes （prometheus-kubernetes）

https://github.com/prometheus/node_exporter （prometheus/node_exporter）

http://dockone.io/article/2579 （ Prometheus在Kubernetes下的监控实践）

http://www.ywnds.com/?p=9656 ( 使用Prometheus+Grafana监控MySQL实践)

https://github.com/prometheus/prometheus/releases （prometheus 下载列表）

https://github.com/prometheus/node_exporter/releases/ （node_exporter下载列表）

https://laily.net/article/Prometheus%20%E5%88%9D%E4%BD%93%E9%AA%8C%281%29%20-%20%E5%AE%89%E8%A3%85 (Prometheus 初体验(1) - 安装)

http://blog.csdn.net/u010871982/article/details/77838592?locationNum=2&fps=1 (prometheus简单入门)

https://www.robustperception.io/scaling-and-federating-prometheus/ (prometheus federate)

http://dbaplus.cn/news-72-1462-1.html (360基于Prometheus的在线服务监控实践)

1、prometheus安装

[root@localhost prometheus]# wget https://github.com/prometheus/prometheus/releases/download/v1.7.1/prometheus-1.7.1.linux-amd64.tar.gz

[root@localhost prometheus]# mkdir /opt/prometheus

[root@localhost prometheus]# tar -zxvf prometheus-1.7.1.linux-amd64.tar.gz -C /opt/prometheus --strip-components=1

[root@localhost prometheus]# cd /opt/prometheus/

[root@localhost prometheus]# cp prometheus.yml prometheus.yml.back

[root@localhost prometheus]# vim prometheus.yml #注意 yaml 文件不允许有 tab 符，一律得使用空格

# 全局配置

global:

scrape_interval: 15s #默认 15秒到目标处抓取数据

# 这个标签是在本机上每一条时间序列上都会默认产生的，主要可以用于联合查询、远程存储、Alertmanger时使用。

external_labels:

monitor: 'codelab-monitor'

# 这里就表示抓取对象的配置

# 设置抓取自身数据

scrape_configs:

# job name 这个配置是表示在这个配置内的时间序例，每一条都会自动添加上这个{job_name:"prometheus"}的标签。

- job_name: 'prometheus'

# 重写了全局抓取间隔时间，由15秒重写成5秒。

scrape_interval: 5s

static_configs:

- targets: ['localhost:9090']

启动：

nohup ./prometheus --config.file=prometheus.yml &

或

nohup /opt/ prometheus-1.7.1.linux-amd64/prometheus &

这时浏览器中页面访问http://localhost:9090/ ，可以看到Prometheus的graph页面。

http://www.cnblogs.com/vovlie/p/Prometheus_install.html （参考）

可直接加载Prometheus配置而不停止服务方式让配置生效，在调试过程中，每次修改配置后执行该操作让配置生效更方便：

# curl -X POST http://localhost:9090/-/reload

# netstat -antl|grep 9090 #查看是否启动成功！

如果我们要采用进程方式管理它，则需要创建脚本：

可以创建一个用户名来启动：

[root@localhost config]# useradd prometheus

[root@localhost ~]# vim /etc/systemd/system/prometheus.service

[Unit]

Description=Prometheus Server

Documentation=https://prometheus.io/docs/introduction/overview/

Deion=prometheus

After=network.target

[Service]

Type=simple

User=prometheus

ExecStart=/usr/local/prometheus/prometheus \ #prometheus安装目录

-config.file=/usr/local/prometheus/prometheus.yml \ #prometheus安装目录下的prometheus.yml

-storage.local.path=/home/prometheusdata

Restart=on-failure

[Install]

WantedBy=multi-user.target

说明： -storage.local.path=/home/prometheusdata 指定的存储目录必须要让创建的prometheus用户有权限

保存退出后，此时可以用命令启动 systemctl start prometheus

# systemctl enable Prometheus.service

# systemctl restart Prometheus.service

2、Grafana 安装

[root@localhost prometheus]# wget https://s3-us-west-2.amazonaws.com/grafana-releases/release/grafana-4.5.0-1.x86_64.rpm

[root@localhost prometheus]# yum install initscripts fontconfig -y

[root@localhost prometheus]# rpm -Uvh grafana-4.5.0-1.x86_64.rpm

warning: grafana-4.5.0-1.x86_64.rpm: Header V4 RSA/SHA1 Signature, key ID 24098cb6: NOKEY

error: Failed dependencies:

urw-fonts is needed by grafana-4.5.0-1.x86_64

安装发现报错;所以采用如下命令重新安装：

[root@localhost prometheus]# yum localinstall grafana-4.5.0-1.x86_64.rpm

[root@localhost prometheus]# service grafana-server start #启动服务

Starting grafana-server (via systemctl): [ OK ]

[root@localhost prometheus]# netstat -anp|grep 3000

查看到3000 端口已经OK；

页面http://localhost:3000 ，默认账号、密码admin/admin

http://docs.grafana.org/installation/rpm/ （gragana 官方文档）

可以将Grafana设置为系统服务

#mkdir-p/var/run/grafana

#chowngrafana.grafana/var/run/grafana

#vim/etc/sysconfig/grafana-server,

添加:PID_FILE_DIR=/var/run/grafan

#vim/etc/systemd/system/grafana.service

[Unit]

Description=GrafanaServices

Documentation=https://github.com/grafana/grafana

After=network.target

[Service]

EnvironmentFile=/etc/sysconfig/grafana-server

User=grafana

Group=grafana

Type=simple

WorkingDirectory=/usr/share/grafana

RuntimeDirectory=grafana

RuntimeDirectoryMode=0750

ExecStart=/usr/sbin/grafana-server\

--config=${CONF_FILE} \

--pidfile=${PID_FILE_DIR}/grafana-server.pid \

cfg:default.paths.logs=${LOG_DIR} \

cfg:default.paths.data=${DATA_DIR} \

cfg:default.paths.plugins=${PLUGINS_DIR}

LimitNOFILE=10000

TimeoutStopSec=20UMask=0027

[Install]

WantedBy=multi-user.target

#以上配置文件中的变量${CONF_FILE}读取的是/etc/sysconfig/grafana-server中的内容

#配置文件变更后必须先reload

# systemctl daemon-reload

# systemctl restart grafana.service

# systemctl enable grafana.service

Prometheus 和 Grafana 的对接如下：

https://prometheus.io/docs/visualization/grafana/ （prometheus和grafana对接文档）

替换grafana的dashboards

Grafana 并没有太多的配置好的图表模板，除了 Percona 开源的一些外，很多需要自行配置。

[root@localhost prometheus]# yum install git -y

[root@localhost prometheus]# git clone https://github.com/percona/grafana-dashboards.git

Cloning into 'grafana-dashboards'...

remote: Counting objects: 1308, done.

remote: Compressing objects: 100% (31/31), done.

remote: Total 1308 (delta 32), reused 40 (delta 21), pack-reused 1256

Receiving objects: 100% (1308/1308), 6.39 MiB | 1.67 MiB/s, done.

Resolving deltas: 100% (982/982), done.

[root@localhost prometheus]# cp -r grafana-dashboards/dashboards /var/lib/grafana/

[root@localhost prometheus]# vim /etc/grafana/grafana.ini

修改如下：

[dashboards.json]
enabled = true
path = /var/lib/grafana/dashboards

[root@localhost prometheus]# service grafana-server restart

或用如下命令重启：

[root@localhost prometheus]# systemctl restart grafana-server

3、node_exporter 安装

[root@localhost prometheus]# wget https://github.com/prometheus/node_exporter/releases/download/v0.14.0/node_exporter-0.14.0.linux-amd64.tar.gz

[root@localhost prometheus]# tar -zxvf node_exporter-0.14.0.linux-amd64.tar.gz

[root@localhost local]# mv /home/prometheus/node_exporter-0.14.0.linux-amd64 ./node_exporter-0.14.0

[root@localhost local]# cd node_exporter-0.14.0/

[root@localhost node_exporter-0.14.0]# nohup ./node_exporter &

查看进程是否OK

[root@localhost node_exporter-0.14.0]# ps -ef|grep node_exporter

root 24760 24106 0 14:39 pts/1 00:00:00 ./node_exporter

root 24766 24106 0 14:39 pts/1 00:00:00 grep --color=auto node_exporter

node_exporter 也可做成服务进程启动，

[root@localhost ~]# vim /etc/systemd/system/node_exporter.service

提供的node exporter 的 systemd 脚本如下：

[Unit]

Deion=node_exporter

Description=Prometheus node exporter

After=local-fs.target network-online.target network.target

Wants=local-fs.target network-online.target network.target

[Service]

Type=simple

User=prometheus #用户prometheus

ExecStart=/usr/local/prometheus/node_exporter/node_exporter

Restart=on-failure

[Install]

WantedBy=multi-user.target

# systemctl enable node_export.service

# systemctl restart node_export.service

4、alertManager 安装

http://blog.csdn.net/y_xiao_/article/details/50818451

Prometheus Alertmanager报警组件

http://www.jianshu.com/p/239b145e2acc (Prometheus Alertmanager报警组件)

Alertmanager报警模块

https://github.com/prometheus/alertmanager ）（alertmanager gighub）

Alert template:

https://prometheus.io/blog/2016/03/03/custom-alertmanager-templates/ （自定义的alertmanager 模板）

Sending alert notifications to multiple destinations

https://www.robustperception.io/sending-alert-notifications-to-multiple-destinations/ (发送提醒到多目的地)

Alert tree:

https://prometheus.io/webtools/alerting/routing-tree-editor/ (Routing tree editor)

[root@localhost prometheus]# wget https://github.com/prometheus/alertmanager/releases/download/v0.9.1/alertmanager-0.9.1.linux-amd64.tar.gz

[root@localhost prometheus]# tar -zxvf alertmanager-0.9.1.linux-amd64.tar.gz

[root@localhost prometheus]# mv alertmanager-0.9.1.linux-amd64 /opt/alertmanager

[root@localhost prometheus]# cd /opt/alertmanager

[root@localhost prometheus]# nohup ./alertmanager -config.file=simple.yml &

重启prometheus 服务：

# ./prometheus -config.file=prometheus.yml -alertmanager.url http://localhost:9093

也可以通过加载配置文件方式而不重启Alertmanager服务：

# curl -XPOST http://localhost:9093/-/reload

# 设置Alertmanager 系统服务

# vim /etc/systemd/system/alertmanager.service

[Unit]

Description=Prometheus Alertmanager.

Documentation=https://github.com/prometheus/alertmanager

After=network.target

[Service]

EnvironmentFile=-/etc/alertmanager/template

User=root

ExecStart=/opt/alertmanager/alertmanager \

-config.file=/opt/alertmanager/simple.yml \

-storage.path=/home/alertmanager \

$ALERTMANAGER_OPTS

ExecReload=/bin/kill -HUP $MAINPID

Restart=on-failure

[Install]

WantedBy=multi-user.target

最后执行：

# systemctl enable alertmanager.service

# systemctl restrart alertmanager.service

访问Alertmanager页面：http://ip:9093/#/alerts

配置 Alertmanager

报警分两部分，报警条件规则文件默认放在Prometheus安装目录下，文件名为 alert.rules。具体通知内容，例如邮件地址和通知人员设置在Alertmanager安装目录下的simply.yml文件，以下是一些基础和常用配置，阈值和时间根据自己需求进行修改。

#alert.rules:

ALERT node_down

IF up == 0 AND job="node"

FOR 5m

ANNOTATIONS {

summary = "Node is down",

description = "Node has been unreachable for more than 5 minutes.",

severity = "warning"

}

ALERT snmp_down

IF up == 0 AND job="snmp"

FOR 5m ANNOTATIONS {

summary = "SNMP is down",

description = "SNMP has been unreachable for more than 5 minutes.",

severity = "warning"

}

ALERT fs_at_80_percent

IF hrStorageUsed{hrStorageDescr=~"/.+"} / hrStorageSize >= 0.8

FOR 15m

ANNOTATIONS {

summary = "File system {{$labels.hrStorageDescr}} is at 80%",

description = "{{$labels.hrStorageDescr}} has been at 80% for more than 15 Minutes.",

severity = "warning"

}

ALERT fs_at_90_percent

IF hrStorageUsed{hrStorageDescr=~"/.+"} / hrStorageSize >= 0.9

FOR 15m

ANNOTATIONS {

summary = "File system {{$labels.hrStorageDescr}} is at 90%",

description = "{{$labels.hrStorageDescr}} has been at 90% for more than 15 Minutes.",

severity = "average"

}

ALERT disk_load_mostly_random_reads

IF rate(diskIOReads{diskIODevice=~"sd[a-z]+"}[5m]) > 20 AND

rate(diskIONReadX{diskIODevice=~"sd[a-z]+"}[5m]) / rate(diskIOReads{diskIODevice=~"sd[a-z]+"}[5m]) < 10000

FOR 15m

ANNOTATIONS { summary = "Disk {{$labels.diskIODevice}} reads are mostly random.",

description = "{{$labels.diskIODevice}} reads have been mostly random for the past 15 Minutes.",

severity = "info"

}

ALERT disk_load_mostly_random_writes

IF rate(diskIOWrites{diskIODevice=~"sd[a-z]+"}[5m]) > 20 AND

rate(diskIONWrittenX{diskIODevice=~"sd[a-z]+"}[5m]) / rate(diskIOWrites{diskIODevice=~"sd[a-z]+"}[5m]) < 10000

FOR 15m

ANNOTATIONS {

summary = "Disk {{$labels.diskIODevice}} writes are mostly random.",

description = "{{$labels.diskIODevice}} writes have been mostly random for the past 15 Minutes.",

severity = "info"

}

ALERT disk_load_high

IF diskIOLA1{diskIODevice=~"s|vd[a-z]+"} > 30

FOR 15m

ANNOTATIONS {

summary = "Disk {{$labels.diskIODevice}} is at 30%",

description = "{{$labels.diskIODevice}} Load has exceeded 30% over the past 15 Minutes.",

severity = "warning"

}

ALERT cpu_load_high

IF ssCpuIdle < 70

FOR 15m

ANNOTATIONS {

summary = "CPU is at 30%",

description = "CPU Load has constantly exceeded 30% over the past 15 Minutes.",

severity = "warning"

}

ALERT linux_load_high

IF laLoad1 > 50

FOR 15m

ANNOTATIONS {

summary = "Linux Load is at 40",

description = "Linux Load has constantly exceeded 40 over the past 15 Minutes.",

severity = "average"

}

ALERT if_operstatus_changed

IF delta(ifOperStatus[15m]) != 0

ANNOTATIONS {

summary = "Port {{$labels.ifDescr}} changed status",

description = "Port {{$labels.ifDescr}} went up or down in the past 15 Minutes",

severity = "info"

}

ALERT if_traffic_at_30_percent

IF ifSpeed > 10000000 AND

ifOperStatus == 1 AND

rate(ifInOctets[5m]) > ifSpeed * 0.3

FOR 15m

ANNOTATIONS {

summary = "Port {{$labels.ifDescr}} is at 30%",

description = "Port {{$labels.ifDescr}} has had at least 30% traffic over the past 15 Minutes.",

severity = "warning"

}

ALERT if_traffic_at_70_percent

IF ifSpeed > 10000000 AND

ifOperStatus == 1 AND rate(ifInOctets[5m]) > ifSpeed * 0.7

FOR 15m

ANNOTATIONS {

summary = "Port {{$labels.ifDescr}} is at 70%",

description = "Port {{$labels.ifDescr}} has had at least 70% traffic over the past 15 Minutes.",

severity = "average"

}

# CPU告警

ALERT cpu_overload

IF node_load1 >= 0.8

FOR 3m

LABELS { severity = "all" }

ANNOTATIONS {

summary = "Instance {{ $labels.instance }} cpu_load1 over 80% for 3 minutes",

description = "{{ $labels.instance }} of job {{ $labels.job }} cpu_load1 over 80% for 3 minutes.",

}

# 内存告警

ALERT memory_overload

IF (node_memory_MemTotal-node_memory_MemFree)/node_memory_MemTotal >= 0.8

FOR 3m

LABELS { severity = "all" }

ANNOTATIONS {

summary = "Instance {{ $labels.instance }} memory_load over 80% for 3 minutes",

description = "{{ $labels.instance }} of job {{ $labels.job }} memory_load over 80% for 3 minutes.",

}

---------------------------------------------------

# simply.yml

主要分三部分,Global部分设置发送邮件服务器信息，route设置规则和报警时间间隔等，receivers设置接收人。

global:

#设置发送邮件的地址和smtp信息

smtp_smarthost:'smtp.abc.com'

smtp_from:'[email protected]'

smtp_auth_username:'prometheus'

smtp_auth_password:'abcd’

route:receiver:'team-X-mails'group_by:['alertname']group_wait:30s

group_interval:5m

repeat_interval:6h

inhibit_rules:

-source_match:

severity:'critical'

target_match:

severity:'warning'

#Applyinhibitionifthealertnameisthesame.

equal:['alertname']

receivers:

-name:'team-X-mails'

email_configs:

-to:'[email protected]'

send_resolved:true

#设置完毕后需要重新加载配置文件

5、cadvisor 安装配置

docker run -d --restart=always --volume=/:/rootfs:ro --volume=/var/run:/var/run:rw --volume=/sys:/sys:ro --volume=/var/lib/docker/:/var/lib/docker:ro --volume=/dev/disk/:/dev/disk:ro --publish=8090:8080 --detach=true --name=cadvisor google/cadvisor:latest

在浏览器中：http://ip:8090 就可以访问了

# 监控cAdvisor报警条件：

# vim containers.rules

ALERT cAdvisor_down

IF absent(container_memory_usage_bytes{name="cadvisor"})

FOR 1m

LABELS { severity = "critical" }

ANNOTATIONS {

summary= "cAdvisor containers down",

description= "cAdvisor container is down for more than 1 minutes."

}

ALERT cAdvisor_high_cpu

IF sum(rate(container_cpu_usage_seconds_total{name="cadvisor"}[1m])) / count(node_cpu{mode="system"}) * 100 > 10

FOR 5m

LABELS { severity = "warning" }

ANNOTATIONS {

summary= "cAdvisor high CPU usage",

description= "cAdvisor CPU usage is {{ humanize $value}}%."

}

ALERT cAdvisor_high_memory

IF sum(container_memory_usage_bytes{name="cadvisor"}) > 1200000000 FOR 5m

LABELS { severity = "warning" }

ANNOTATIONS {

summary = "cAdvisor high memory usage",

description = "cAdvisor memory consumption is at {{ humanize $value}}.",

}

你可能感兴趣的:(prometheus,node_exporter,grafana,alertmanager,cadvisor)

主流行架构 rainbowcheng 架构架构
nexus，gitlab,svn,jenkins,sonar,docker，apollo，catteambition，axure，蓝湖，禅道,WCP；redis，kafka，es，zookeeper，dubbo，shardingjdbc，mysql，InfluxDB，Telegraf，Grafana，Nginx，xxl-job，Neo4j,NebulaGraph是一个高性能的,NOSQL图形数据库
【监控告警】02-Promtheus的学习之路 Kearey. 监控告警微服务网关学习方法
prometheus采用的是拉模式为主，推模式为辅的方式采集数据。Prometheus作为一个指标系统天生就不是精确的——由于指标本身就是稀疏采样的，事实上所有的图表和警报都是”估算”，我们也就不必太纠结于图表和警报的对应性，能够帮助我们发现问题解决问题就是一个好监控系统。当然，有时候我们也得证明这个警报确实没问题，那可以看一眼`ALERTS`指标。`ALERTS`是Prometheus在警报计算
prometheus中step或resolution的含义 iceman1952 prometheus
prometheus官方文档对resolution的解释真是语焉不详，只有下面寥寥几句话Queryingexamples|PrometheusSubqueryReturnthe5-minuterateofthehttp_requests_totalmetricforthepast30minutes,witharesolutionof1minute.rate(http_requests_total[
Prometheus运维六 PromQL查询语言详解及操作安顾里 Prometheus 监控类大数据 kubernetes 运维 linux
海阔凭鱼跃，天高任鸟飞Prometheus官网：https://prometheus.io/文章目录1.什么是PromQL?2.PromQL的基本使用2.1时间序列选择器2.1.1瞬时向量选择器2.2区间向量选择器2.2.1范围向量选择器2.2.2时间位移操作2.2.3使用聚合操作2.3标量和字符串3.PromQL操作符4.内置常用函数5.HTTPAPI操作PromQL6.使用建议1.什么是Pro
基于Prometheus和Grafana的现代服务器监控体系构建 golove666 运维 prometheus grafana 服务器
构建一个基于Prometheus和Grafana的现代服务器监控体系涉及多个步骤。以下是大体的流程和步骤说明：1.Prometheus监控系统Prometheus是一个开源的系统监控和报警工具，专门设计用于抓取时间序列数据。1.1Prometheus的安装Docker安装Prometheusdockerrun-d--name=prometheus-p9090:9090prom/prometheus
压测服务器并使用 Grafana 进行可视化豆瑞瑞 grafana
简介仓库代码GitCode-全球开发者的开源社区,开源代码托管平台参考Welcome!-TheApacheHTTPServerProjectGrafana|查询、可视化、警报观测平台https://prometheus.io/docs/introduction/overview/
Java服务端中的性能监控：Prometheus与Grafana的集成微赚淘客系统@聚娃科技 java prometheus grafana
Java服务端中的性能监控：Prometheus与Grafana的集成大家好，我是微赚淘客返利系统3.0的小编，是个冬天不穿秋裤，天冷也要风度的程序猿！在构建和维护Java服务端应用时，性能监控是确保系统稳定性和性能的重要环节。Prometheus与Grafana是当前最流行的性能监控工具组合之一，能够提供强大的数据采集、存储和可视化功能。本文将介绍如何在Java服务端中集成Prometheus与
使用Docker部署Jmeter+InfluxDB+Grafana 搭建性能监控平台 Geraint丶 docker jmeter
前言之前写过一篇《linux下性能测试监控平台InfluxDB+Grafana+Jmeter的搭建》，后来在应用中发现，在linux下部署多个原生服务组合使用时移植性较差，每次更换一台linux机器都需要重新搭建所有的服务，在安装和修改配置文件的过程中很容易出现各种各样的问题，而且排查问题非常的耗费时间。Docker部署方便，没有那么多的环境参数配置，隔离性好，更重要是可移植性强，可以完美避开li
【云原生】Prometheus 服务自动发现使用详解小码农叔叔微服务链路追踪与监控 Prometheus服务发现 prometheus服务发现普罗米修斯服务自动发现普罗米修斯文件自动发现普罗米修斯基于服务自动发现 Prometheus prometheus
目录一、前言二、Prometheus常规服务监控使用现状2.1Prometheus监控架构图2.2Prometheus服务自动发现的解决方案三、Prometheus服务自动发现介绍3.1什么是Prometheus服务自动发现3.2Prometheus自动服务发现策略3.3Prometheus自动服务发现应用场景3.4Prometheus自动服务发现原理四、Prometheus基于文件的服务发现4.
Prometheus与Grafana在DevOps中的应用与最佳实践范范0825 prometheus grafana devops
Prometheus与Grafana在DevOps中的应用与最佳实践随着DevOps文化和实践的普及，监控和可视化工具已成为DevOps工具链中不可或缺的部分。Prometheus和Grafana是其中最受欢迎的开源监控解决方案之一，它们的结合能够为系统和应用程序提供全面的监控、告警和可视化展示。本篇文章将详细探讨Prometheus和Grafana在DevOps中的应用场景、最佳实践，以及如何构
prometheus基于文件的服务发现嘟嘟嘟嘟嘟 prometheus prometheus 服务发现
之间讲到，prometheus监控的对象就来自于他的配置文件里面的targets，如果要新增被监控对象，就继续往targets里面加。但这个缺点是，每次修改完后都得重启prometheus。有没有什么办法，能在不重启的情况下增加target呢？有，那就是prometheus的服务自动发现今天咱们讲一个最常用的方式，基于文件的服务发现（File-Based-Service-Discovery）1将默
Prometheus的consul自动发现 HB199753 监控类
目录前言一、概述1、简介2、引入consul的好处3、Prometheus支持的多种服务发现机制二、Prometheus的服务发现机制1、基于文件的服务发现2、基于Consul的服务发现三、Consul的服务发现1、docker安装2、docker-compose安装3、基于docker的consul集群4、使用接口注册服务5、修改prometheus使用consul服务发现6、验证总结前言使用P
Prometheus-Alertmanger 告警实例：端口监控企微通知 Richie-Hao #Prometheus prometheus
文章目录Prometheus-Alertmanger告警实例之：端口监控企微告警安装blackbox_exporter插件设置端口监控配置告警消息通知模板rule告警规则重启alertmanager和prometheusPrometheus-Alertmanger告警实例之：端口监控企微告警安装blackbox_exporter插件wgethttps://github.com/prometheus
zabbix4.0安装+grafana数据展示——cent7.3 运维实战课程 grafana zabbix linux 运维
zabbix4.0安装+grafana数据展示——cent7.3如果对运维课程感兴趣，可以在b站上搜索我的账号：运维实战课程，可以关注我，学习更多免费的运维实战技术视频Zabbix_server:192.168.43.166被监控端：192.168.43.xxlnmp工作过程：用户请求nginx，当请求静态页面，nginx直接返回给用户，当请求动态页面,如php程序文件，nginx会调用php-f
银河麒麟V10 SP1 x86 安装Grafana 人间小苦瓜_ grafana kylin 服务器 linux 运维
目录前言一、下载解压安装包二、安装步骤1.创建grafana用户及数据存放目录2.修改配置文件3.把grafana-server添加到systemd中4.启停并设置开机启动5.访问测试前言虽然说prometheus能展示一些图表，但对比Grafana，那只是个过家家。接下来我们需要在同一个服务器上安装Grafana服务，用来展示prometheus收集到的数据一、下载解压安装包wgethttps:
“Jmeter-InfluxDB-Grafana“常见错误有哪些如何解决？神即道道法自然如来 jmeter grafana
常见错误：1.网络不同，检查网络IP是否写对，端口号有没有放开（Centos7端口号命令），防火墙是否关闭firewall-cmd--add-port=3000/tcp--permanentfirewall-cmd--add-port=3000/udp--permanentfirewall-cmd--reload2.Jmeter里面的influxDB地址里面的db=jmeter，和在influxd
在azure上搭建k8s+prometheus+grafana+ingress-controller Y.G Bingo 大数据 K8S k8s prometheus grafana nginx
申请一个AKS集群在本地实现对AKS的控制安装kubectl连接到aks(可以直接点击aks概述中的连接获取命令)使用azurecli获取aks的配置信息（比如获取commercial-yanhuibin-test的k8s配置）azaccountset--subscription32285749-d4c9-4337-b6bb-1709935abc16azaksget-credentials--re
Grafana仪表盘设计最佳实践：如何创建有效的监控面板范范0825 grafana 信息可视化
Grafana仪表盘设计最佳实践：如何创建有效的监控面板引言Grafana是一个开源的数据可视化和监控平台，它提供了丰富的仪表盘功能，用于展示和分析各种数据源（如Prometheus、InfluxDB、Elasticsearch等）。有效的仪表盘设计能够帮助团队迅速识别和解决问题，提高系统的可靠性和性能。本文将深入探讨如何设计高效的Grafana仪表盘，涵盖最佳实践和实际应用。1.了解需求和目标1
双vip高可用的MySQL集群 Hi，你好啊数据库 mysql 数据库高可用
文章目录项目介绍项目架构项目环境项目步骤环境准备Ansible服务器部署1、安装Ansible2、配置免密登录3、修改Ansible的主机清单Prometheus部署1、下载软件包2、二进制安装PrometheusServer3、通过服务管理Prometheus4、安装node_exporter5、安装mysqld_exporter6、添加被监控的服务器部署MySQL集群（基于GTID的半同步）1
Laravel Prometheus Exporter 教程郁俪晟Gertrude
LaravelPrometheusExporter教程laravel-prometheus-exporterAprometheusexporterforLaravel项目地址:https://gitcode.com/gh_mirrors/la/laravel-prometheus-exporter项目介绍LaravelPrometheusExporter是一个专为Laravel框架设计的开源工具，
基于Prometheus和Grafana的现代服务器监控体系构建不会代码的小林服务器
在当今的IT基础设施中，监控是确保系统性能和稳定性的关键组成部分。Prometheus和Grafana是两个广受欢迎的开源工具，它们可以共同构建一个功能全面、可视化强的监控系统。Prometheus是一个开源的监控系统和时间序列数据库，适用于记录实时的度量指标。它不仅提供了多维数据模型和强大的PromQL查询语言，还支持服务发现和HTTP拉取模型。这些特性使得Prometheus特别适合在微服务和
【Grafana】Nginx代理Grafana实现不开启匿名自动登录 shen12138 grafana nginx 运维
Grafana中匿名功能很好用，此方法适用于不能开启匿名访问的另类实现，并且解决了匿名无法切换Domain的问题。一、Grafana配置生成apikey修改root_url=%(protocol)s://%(domain)s:%(http_port)s/grafana1/修改serve_from_sub_path=true二、Nginxserver{listen80;#server_nameexa
APISIX apisix-dashboard prometheus grafana整合显示仪表盘（linux同理）超级无敌宇宙CV战士 prometheus grafana linux
本地环境：windows11，docker26.1.4，apisix版本3.9，curl8.7.1运行apisix1.1下载运行项目apisixgitclonehttps://github.com/apache/apisix.git其中项目中：apisix-docker\example\docker-compose.yml最新版本(3.9)的配置文件中没有apisix-dashboard相关的启动
基于Prometheus和Grafana的现代服务器监控体系构建小绵羊不怕大灰狼 prometheus grafana
1.安装PrometheusPrometheus是一个开源的监控系统和时间序列数据库，适用于记录实时的度量指标。•下载并安装Prometheus：•前往Prometheus官方网站下载适用于您操作系统的版本。•解压并配置prometheus.yml文件，定义抓取目标（targets），如服务器、应用程序等。•配置Prometheus：•编辑prometheus.yml文件，添加您要监控的服务器地址
k8s pod container内存指标说明 yifeiliu338 k8s kubernetes 容器云原生
一、问题描述我司平台研发的devops平台底层采用k8s实现，k8s自带cadvisor进行集群指标收集，根据官网，我们选用了container_memory_working_set_bytes（容器的工作集使用量）作为内存使用量的观察项，但随着后续使用过程中发现该指标上升到一定大小后就会维持不变，并不像应用实际内存使用量，没出现波动；来自kubernetes对该问题的讨论（讨论了5年多了）：ht
深入理解 Prometheus 数据模型与指标监控勤劳兔码农 prometheus
深入理解Prometheus数据模型与指标监控Prometheus作为一款开源的系统监控和报警工具，其核心在于其独特的数据模型和强大的指标监控能力。为了更好地利用Prometheus，我们需要深入理解其数据模型的构成、数据的收集方式以及如何定义和使用指标监控。本指南将详细探讨Prometheus的数据模型、指标类型、数据收集机制和查询语言（PromQL），帮助你构建对Prometheus的全面理解
InfluxDB和OpenTSDB两种时序数据库应用场景 CodeMaster_37714848 opentsdb 时序数据库数据库
InfluxDB概述：InfluxDB是一个开源的高性能时序数据库，专门用于处理大量的时间序列数据。它由InfluxData开发，支持高写入吞吐量和灵活的查询。特点：高性能写入和查询：设计上注重高写入速度和低延迟查询。SQL-like查询语言：使用类似SQL的InfluxQL或Flux查询语言，简化了复杂查询的编写。数据压缩：提供高效的数据压缩机制，减少存储需求。集成和工具：支持与Grafana等
k8s Prometheus 条纹布鲁斯 kubernetes prometheus 云原生
一、部署Prometheuskubectlcreatenskube-ops#创建prometheus-cm.yamlapiVersion:v1kind:ConfigMapmetadata:name:prometheus-confignamespace:kube-opsdata:prometheus.yml:|global:scrape_interval:15s#表示prometheus抓取指标数据
Prometheus与Grafana入门：从安装到基础监控的完整指南勤劳兔码农 prometheus grafana
Prometheus与Grafana入门：从安装到基础监控的完整指南Prometheus和Grafana是现代监控系统的黄金组合。Prometheus作为一个开源的监控系统和时间序列数据库，以其强大的指标收集和查询能力广泛应用于云原生环境。而Grafana则是一个用于数据可视化和监控的开源平台，能够将Prometheus收集的数据以图表的形式展现出来，帮助用户更直观地理解系统的运行状态。本指南将从
二、Prometheus常用exporter安装详解 Spring雷监控日志管理企业运维实战 Doker运维实战 prometheus elasticsearch linux 运维
目录一、node_exporter1.安装配置2.节点添加3.状态查询二、elasticsearch_exporter1.安装配置2.节点添加3.状态查询三、redis_exporter1.安装配置2.节点添加3.状态查询四、rabbitmq_exporter1.安装配置2.节点添加3.状态查询五、kafka_exporter1.安装配置2.节点添加3.状态查询六、GrafanaDashboard
web前段跨域nginx代理配置刘正强 nginx cms Web
nginx代理配置可参考server部分 server { listen 80; server_name localhost;
spring学习笔记 caoyong spring
一、概述 a>、核心技术 : IOC与AOP b>、开发为什么需要面向接口而不是实现接口降低一个组件与整个系统的藕合程度，当该组件不满足系统需求时，可以很容易的将该组件从系统中替换掉，而不会对整个系统产生大的影响 c>、面向接口编口编程的难点在于如何对接口进行初始化,(使用工厂设计模式)
Eclipse打开workspace提示工作空间不可用 0624chenhong eclipse
做项目的时候，难免会用到整个团队的代码，或者上一任同事创建的workspace， 1.电脑切换账号后，Eclipse打开时，会提示Eclipse对应的目录锁定，无法访问，根据提示，找到对应目录，G:\eclipse\configuration\org.eclipse.osgi\.manager，其中文件.fileTableLock提示被锁定。解决办法，删掉.fileTableLock文件，重
Javascript 面向对面写法的必要性？一炮送你回车库 JavaScript
现在Javascript面向对象的方式来写页面很流行，什么纯javascript的mvc框架都出来了：ember 这是javascript层的mvc框架哦,不是j2ee的mvc框架我想说的是，javascript本来就不是一门面向对象的语言，用它写出来的面向对象的程序，本身就有些别扭，很多人提到js的面向对象首先提的是：复用性。那么我请问你写的js里有多少是可以复用的，用fu
js array对象的迭代方法换个号韩国红果果 array
1.forEach 该方法接受一个函数作为参数，对数组中的每个元素使用该函数 return 语句失效 function square(num) { print(num, num * num); } var nums = [1,2,3,4,5,6,7,8,9,10]; nums.forEach(square); 2.every 该方法接受一个返回值为布尔类型
对Hibernate缓存机制的理解归来朝歌 session 一级缓存对象持久化
在hibernate中session一级缓存机制中，有这么一种情况：问题描述：我需要new一个对象，对它的几个字段赋值，但是有一些属性并没有进行赋值，然后调用 session.save()方法，在提交事务后，会出现这样的情况： 1：在数据库中有默认属性的字段的值为空 2：既然是持久化对象，为什么在最后对象拿不到默认属性的值？通过调试后解决方案如下：对于问题一，如你在数据库里设置了
WebService调用错误合集 darkranger webservice
Java.Lang.NoClassDefFoundError: Org/Apache/Commons/Discovery/Tools/DiscoverSingleton 调用接口出错，一个简单的WebService import org.apache.axis.client.Call;import org.apache.axis.client.Service; 首先必不可
JSP和Servlet的中文乱码处理 aijuans Java Web
JSP和Servlet的中文乱码处理前几天学习了JSP和Servlet中有关中文乱码的一些问题，写成了博客，今天进行更新一下。应该是可以解决日常的乱码问题了。现在作以下总结希望对需要的人有所帮助。我也是刚学，所以有不足之处希望谅解。一、表单提交时出现乱码：在进行表单提交的时候，经常提交一些中文，自然就避免不了出现中文乱码的情况，对于表单来说有两种提交方式：get和post提交方式。所以
面试经典六问 atongyeye 工作面试
题记：因为我不善沟通，所以在面试中经常碰壁，看了网上太多面试宝典，基本上不太靠谱。只好自己总结，并试着根据最近工作情况完成个人答案。以备不时之需。以下是人事了解应聘者情况的最典型的六个问题： 1 简单自我介绍关于这个问题，主要为了弄清两件事，一是了解应聘者的背景，二是应聘者将这些背景信息组织成合适语言的能力。我的回答：(针对技术面试回答，如果是人事面试，可以就掌
contentResolver.query()参数详解百合不是茶 android query()详解
收藏csdn的博客,介绍的比较详细,新手值得一看 1.获取联系人姓名一个简单的例子，这个函数获取设备上所有的联系人ID和联系人NAME。 [java] view plain copy public void fetchAllContacts() {
ora-00054:resource busy and acquire with nowait specified解决方法 bijian1013 oracle 数据库 kill nowait
当某个数据库用户在数据库中插入、更新、删除一个表的数据，或者增加一个表的主键时或者表的索引时，常常会出现ora-00054:resource busy and acquire with nowait specified这样的错误。主要是因为有事务正在执行（或者事务已经被锁），所有导致执行不成功。 1.下面的语句
web 开发乱码征客丶 spring Web
以下前端都是 utf-8 字符集编码一、后台接收 1.1、 get 请求乱码 get 请求中，请求参数在请求头中；乱码解决方法： a、通过在web 服务器中配置编码格式：tomcat 中，在 Connector 中添加URIEncoding="UTF-8"； 1.2、post 请求乱码 post 请求中，请求参数分两部份， 1.2.1、url？参数，
【Spark十六】： Spark SQL第二部分数据源和注册表的几种方式 bit1129 spark
Spark SQL数据源和表的Schema case class apply schema parquet json JSON数据源准备源数据 {"name":"Jack", "age": 12, "addr":{"city":"beijing&
JVM学习之:调优总结 -Xms -Xmx -Xmn -Xss BlueSkator -Xss -Xmn -Xms -Xmx
堆大小设置JVM 中最大堆大小有三方面限制：相关操作系统的数据模型（32-bt还是64-bit）限制；系统的可用虚拟内存限制；系统的可用物理内存限制。32位系统下，一般限制在1.5G~2G；64为操作系统对内存无限制。我在Windows Server 2003 系统，3.5G物理内存，JDK5.0下测试，最大可设置为1478m。典型设置： java -Xmx355
jqGrid 各种参数详解(转帖) BreakingBad jqGrid
jqGrid 各种参数详解分类：源代码分享个人随笔请勿参考解决开发问题 2012-05-09 20:29 84282人阅读评论(22) 收藏举报 jquery 服务器 parameters function ajax string
读《研磨设计模式》-代码笔记-代理模式-Proxy bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.lang.reflect.InvocationHandler; import java.lang.reflect.Method; import java.lang.reflect.Proxy; /* * 下面
应用升级iOS8中遇到的一些问题 chenhbc ios8 升级iOS8
1、很奇怪的问题，登录界面，有一个判断，如果不存在某个值，则跳转到设置界面，ios8之前的系统都可以正常跳转，iOS8中代码已经执行到下一个界面了，但界面并没有跳转过去，而且这个值如果设置过的话，也是可以正常跳转过去的，这个问题纠结了两天多，之前的判断我是在 -(void)viewWillAppear:(BOOL)animated 中写的，最终的解决办法是把判断写在 -(void
工作流与自组织的关系？ comsci 设计模式工作
目前的工作流系统中的节点及其相互之间的连接是事先根据管理的实际需要而绘制好的，这种固定的模式在实际的运用中会受到很多限制，特别是节点之间的依存关系是固定的，节点的处理不考虑到流程整体的运行情况，细节和整体间的关系是脱节的，那么我们提出一个新的观点，一个流程是否可以通过节点的自组织运动来自动生成呢？这种流程有什么实际意义呢？这里有篇论文，摘要是：“针对网格中的服务
Oracle11.2新特性之INSERT提示IGNORE_ROW_ON_DUPKEY_INDEX daizj oracle
insert提示IGNORE_ROW_ON_DUPKEY_INDEX 转自：http://space.itpub.net/18922393/viewspace-752123 在 insert into tablea ...select * from tableb中，如果存在唯一约束，会导致整个insert操作失败。使用IGNORE_ROW_ON_DUPKEY_INDEX提示，会忽略唯一
二叉树:堆 dieslrae 二叉树
这里说的堆其实是一个完全二叉树,每个节点都不小于自己的子节点,不要跟jvm的堆搞混了.由于是完全二叉树,可以用数组来构建.用数组构建树的规则很简单: 一个节点的父节点下标为: (当前下标 - 1)/2 一个节点的左节点下标为: 当前下标 * 2 + 1 &
C语言学习八结构体 dcj3sjt126com c
为什么需要结构体，看代码 # include <stdio.h> struct Student //定义一个学生类型，里面有age, score, sex, 然后可以定义这个类型的变量 { int age; float score; char sex; } int main(void) { struct Student st = {80, 66.6,
centos安装golang dcj3sjt126com centos
#在国内镜像下载二进制包 wget -c http://www.golangtc.com/static/go/go1.4.1.linux-amd64.tar.gz tar -C /usr/local -xzf go1.4.1.linux-amd64.tar.gz #把golang的bin目录加入全局环境变量 cat >>/etc/profile<
10.性能优化-监控-MySQL慢查询 frank1234 性能优化 MySQL慢查询
1.记录慢查询配置 show variables where variable_name like 'slow%' ; --查看默认日志路径查询结果：--不用的机器可能不同 slow_query_log_file=/var/lib/mysql/centos-slow.log 修改mysqld配置文件：/usr /my.cnf[一般在/etc/my.cnf，本机在/user/my.cn
Java父类取得子类类名 happyqing java this 父类子类类名
在继承关系中，不管父类还是子类，这些类里面的this都代表了最终new出来的那个类的实例对象，所以在父类中你可以用this获取到子类的信息！ package com.urthinker.module.test; import org.junit.Test; abstract class BaseDao<T> { public void
Spring3.2新注解@ControllerAdvice jinnianshilongnian @Controller
@ControllerAdvice，是spring3.2提供的新注解，从名字上可以看出大体意思是控制器增强。让我们先看看@ControllerAdvice的实现： @Target(ElementType.TYPE) @Retention(RetentionPolicy.RUNTIME) @Documented @Component public @interface Co
Java spring mvc多数据源配置 liuxihope spring
转自：http://www.itpub.net/thread-1906608-1-1.html 1、首先配置两个数据库 <bean id="dataSourceA" class="org.apache.commons.dbcp.BasicDataSource" destroy-method="close&quo
第12章 Ajax（下） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
BW / Universe Mappings blueoxygen BO
BW Element OLAP Universe Element Cube Dimension Class Charateristic A class with dimension and detail objects (Detail objects for key and desription) Hi
Java开发熟手该当心的11个错误 tomcat_oracle java 多线程工作单元测试
#1、不在属性文件或XML文件中外化配置属性。比如，没有把批处理使用的线程数设置成可在属性文件中配置。你的批处理程序无论在DEV环境中，还是UAT（用户验收测试）环境中，都可以顺畅无阻地运行，但是一旦部署在PROD 上，把它作为多线程程序处理更大的数据集时，就会抛出IOException，原因可能是JDBC驱动版本不同，也可能是#2中讨论的问题。如果线程数目可以在属性文件中配置，那么使它成为
推行国产操作系统的优劣 yananay windows linux 国产操作系统
最近刮起了一股风，就是去“国外货”。从应用程序开始，到基础的系统，数据库，现在已经刮到操作系统了。原因就是“棱镜计划”，使我们终于认识到了国外货的危害，开始重视起了信息安全。操作系统是计算机的灵魂。既然是灵魂，为了信息安全，那我们就自然要使用和推行国货。可是，一味地推行，是否就一定正确呢？先说说信息安全。其实从很早以来大家就在讨论信息安全。很多年以前，就据传某世界级的网络设备制造商生产的交