石头-豆豆

二进制部署Prometheus及监控服务并实现监控告警演示

二进制部署Prometheus及监控服务

一、部署prometheus

1、下载
版本：2.33.3

https://github.com/prometheus/prometheus/releases/download/v2.33.3/prometheus-2.33.3.linux-amd64.tar.gz

2、下载完后解压即可使用

[root@k8s-03 src]# tar -zxvf prometheus-2.33.3.linux-amd64.tar.gz 
[root@k8s-03 src]# ls
prometheus-2.33.3.linux-amd64  prometheus-2.33.3.linux-amd64.tar.gz
[root@k8s-03 src]# mv prometheus-2.33.3.linux-amd64 /usr/local/prometheus
[root@k8s-03 src]# cd /usr/local/prometheus/
[root@k8s-03 prometheus]# ls
console_libraries  consoles  LICENSE  NOTICE  prometheus  prometheus.yml  promtool
[root@k8s-03 prometheus]# pwd
/usr/local/prometheus

3、设置prometheus开机启动

[root@k8s-03 prometheus]# cat /usr/lib/systemd/system/prometheus.service 
[Unit]
Description=prometheus
[Service]
ExecStart=/usr/local/prometheus/prometheus --config.file=/usr/local/prometheus/prometheus.yml  --web.listen-address=:9091 
ExecReload=/bin/kill -HUP $MAINPID
KillMode=process
Restart=on-failure
[Install]
WantedBy=multi-user.target

注：通过启动脚本加 --web.listen-address=:9091 参数的方式自定义prometheus端口，
以下演示。

[root@k8s-03 prometheus]# netstat -ntlp
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address           Foreign Address         State       PID/Program name    
tcp        0      0 127.0.0.1:25            0.0.0.0:*               LISTEN      1332/master         
tcp        0      0 0.0.0.0:22              0.0.0.0:*               LISTEN      1069/sshd           
tcp6       0      0 ::1:25                  :::*                    LISTEN      1332/master         
tcp6       0      0 :::9091                 :::*                    LISTEN      2488/prometheus     
tcp6       0      0 :::22                   :::*                    LISTEN      1069/sshd           
[root@k8s-03 prometheus]# cat /usr/lib/systemd/system/pro
proc-sys-fs-binfmt_misc.automount  proc-sys-fs-binfmt_misc.mount      prometheus.service                 
[root@k8s-03 prometheus]# cat /usr/lib/systemd/system/pro
proc-sys-fs-binfmt_misc.automount  proc-sys-fs-binfmt_misc.mount      prometheus.service                 
[root@k8s-03 prometheus]# cat /usr/lib/systemd/system/prometheus.service 
[Unit]
Description=prometheus
[Service]
ExecStart=/usr/local/prometheus/prometheus --config.file=/usr/local/prometheus/prometheus.yml  --web.listen-address=:9091
ExecReload=/bin/kill -HUP $MAINPID
KillMode=process
Restart=on-failure
[Install]
WantedBy=multi-user.target

网页访问：

4、设置开机自启

systemctl daemon-reload
systemctl start prometheus.service
systemctl enable prometheus.service  #开机启动

5、查看是否已启动

[root@k8s-03 prometheus]# netstat -ntlp
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address           Foreign Address         State       PID/Program name    
tcp        0      0 127.0.0.1:25            0.0.0.0:*               LISTEN      1332/master         
tcp        0      0 0.0.0.0:22              0.0.0.0:*               LISTEN      1069/sshd           
tcp6       0      0 ::1:25                  :::*                    LISTEN      1332/master         
tcp6       0      0 :::9090                 :::*                    LISTEN      2302/prometheus     
tcp6       0      0 :::22                   :::*                    LISTEN      1069/sshd

6、默认配置文件

[root@k8s-03 prometheus]# cat prometheus.yml 
# my global config
global:
  scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
    - static_configs:
        - targets:
          # - alertmanager:9093

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
  # - "first_rules.yml"
  # - "second_rules.yml"

# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
  # The job name is added as a label `job=` to any timeseries scraped from this config.
  - job_name: "prometheus"

    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    static_configs:
      - targets: ["localhost:9090"]

7、热加载prometheus配置文件

[root@prometheus prometheus]# ps -ef|grep prometheus
root 1081 1 0 13:25 ? 00:00:10 /opt/monitor/prometheus/prometheus --config.file=/opt/monitor/prometheus/prometheus.yml
root 3123 2619 0 14:10 pts/0 00:00:00 grep --color=auto prometheus
[root@prometheus prometheus]# kill -HUP 1081

8、通过web验证prometheus是否启动
ip+9090端口

二、安装node-exporter

1、下载 node-exporter

wget https://github.com/prometheus/node_exporter/releases/download/v1.3.1/node_exporter-1.3.1.linux-amd64.tar.gz

2、解压缩

[root@k8s-03 src]# tar -zxvf node_exporter-1.3.1.linux-amd64.tar.gz -C /usr/local/
[root@k8s-03 src]# cd /usr/local/
[root@k8s-03 local]# ls
bin  etc  games  include  lib  lib64  libexec  node_exporter-1.3.1.linux-amd64  prometheus  sbin  share  src
[root@k8s-03 local]# mv node_exporter-1.3.1.linux-amd64/ node_exporter

3、启动node-exporter

[root@k8s-03 node_exporter]# cat /usr/lib/systemd/system/node_exporter.service 
[Unit]
Description=node_exporter
[Service]
ExecStart=/usr/local/node_exporter/node_exporter  --collector.systemd --collector.systemd.unit-include=(docker|sshd|nginx).service
ExecReload=/bin/kill -HUP $MAINPID
KillMode=process
Restart=on-failure
[Install]
WantedBy=multi-user.target

4、加载配置并启动

systemctl daemon-reload
systemctl start node_exporter.service
systemctl enable node_exporter.service #设置开机启动

5、浏览器验证
ip地址+端口

6、prometheus设置抓取目标 node_exporter
加入以下三行

  - job_name: "node_exporter"
    static_configs:
      - targets: ["localhost:9100"]

prometheus.yml 完整文档

# my global config
global:
  scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
    - static_configs:
        - targets:
          # - alertmanager:9093

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
  # - "first_rules.yml"
  # - "second_rules.yml"

# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
  # The job name is added as a label `job=` to any timeseries scraped from this config.
  - job_name: "prometheus"

    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    static_configs:
      - targets: ["localhost:9090"]
  - job_name: "node_exporter"
    static_configs:
      - targets: ["localhost:9100"]

热加载prometheus

[root@k8s-03 prometheus]# ps -ef|grep prometheus
root       2564      1  0 15:13 ?        00:00:08 /usr/local/prometheus/prometheus --config.file=/usr/local/prometheus/prometheus.yml --web.listen-address=:9090
root      12384   2215  0 15:59 pts/0    00:00:00 grep --color=auto prometheus
root@k8s-03 prometheus]# kill -HUP 2564

浏览器验证

三、部署alertmanager

1、下载alertmanager二进制包

[root@k8s-03 src]# wget https://github.com/prometheus/alertmanager/releases/download/v0.23.0/alertmanager-0.23.0.linux-amd64.tar.gz

2、解压二进制包

[root@k8s-03 src]# tar -zxvf alertmanager-0.23.0.linux-amd64.tar.gz -C /usr/local/
alertmanager-0.23.0.linux-amd64/
alertmanager-0.23.0.linux-amd64/alertmanager.yml
alertmanager-0.23.0.linux-amd64/LICENSE
alertmanager-0.23.0.linux-amd64/NOTICE
alertmanager-0.23.0.linux-amd64/alertmanager
alertmanager-0.23.0.linux-amd64/amtool
[root@k8s-03 src]# cd /usr/local/
[root@k8s-03 local]# ls
alertmanager-0.23.0.linux-amd64  bin  etc  games  include  lib  lib64  libexec  node_exporter  prometheus  sbin  share  src
[root@k8s-03 local]# mv alertmanager-0.23.0.linux-amd64/ alertmanager
[root@k8s-03 local]# ls
alertmanager  bin  etc  games  include  lib  lib64  libexec  node_exporter  prometheus  sbin  share  src
[root@k8s-03 local]# cd alertmanager/
[root@k8s-03 alertmanager]# ls
alertmanager  alertmanager.yml  amtool  LICENSE  NOTICE
[root@k8s-03 alertmanager]# pwd
/usr/local/alertmanager

3、添加systemd管理

[root@prometheus alertmanager]# cat /usr/lib/systemd/system/alertmanager.service

[Unit]
Description=alertmanager
[Service]
ExecStart=/usr/local/alertmanager/alertmanager --config.file=/usr/local/alertmanager/alertmanager.yml
ExecReload=/bin/kill -HUP $MAINPID
KillMode=process
Restart=on-failure
[Install]
WantedBy=multi-user.target

4、加载配置并启动设置开机自启

systemctl daemon-reload
systemctl start alertmanager.service
systemctl enable alertmanager.service

5、检查alertmanager端口是否启动

root@k8s-03 alertmanager]# netstat -ntlp
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address           Foreign Address         State       PID/Program name    
tcp        0      0 127.0.0.1:25            0.0.0.0:*               LISTEN      1332/master         
tcp        0      0 0.0.0.0:22              0.0.0.0:*               LISTEN      1069/sshd           
tcp6       0      0 ::1:25                  :::*                    LISTEN      1332/master         
tcp6       0      0 :::9090                 :::*                    LISTEN      2564/prometheus     
tcp6       0      0 :::9093                 :::*                    LISTEN      12462/alertmanager  
tcp6       0      0 :::9094                 :::*                    LISTEN      12462/alertmanager  
tcp6       0      0 :::9100                 :::*                    LISTEN      12322/node_exporter 
tcp6       0      0 :::22                   :::*                    LISTEN      1069/sshd

浏览器访问alertmanager 界面
ip+9093

6、修改prometheus 告警地址

修改 alertmanager 地址

# Alertmanager configuration
alerting:
  alertmanagers:
  - static_configs:
    - targets: ['localhost:9093']

prometheus 完整配置

[root@k8s-03 prometheus]# cat prometheus.yml 
# my global config
global:
  scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
  - static_configs:
    - targets: ['localhost:9093']

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
  # - "first_rules.yml"
  # - "second_rules.yml"

# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
  # The job name is added as a label `job=` to any timeseries scraped from this config.
  - job_name: "prometheus"

    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    static_configs:
      - targets: ["localhost:9090"]
  - job_name: "node_exporter"
    static_configs:
      - targets: ["localhost:9100"]

四、安装钉钉告警

1、下载钉钉告警插件

[root@k8s-03 src]# wget https://github.com/timonwong/prometheus-webhook-dingtalk/releases/download/v2.0.0/prometheus-webhook-dingtalk-2.0.0.linux-amd64.tar.gz

2、解压压缩包

[root@k8s-03 src]# tar -zxvf prometheus-webhook-dingtalk-2.0.0.linux-amd64.tar.gz -C /usr/local/
[root@k8s-03 local]# mv prometheus-webhook-dingtalk-2.0.0.linux-amd64/ prometheus-webhook-dingtalk

3、设置钉钉告警模板

[root@k8s-03 templates]# pwd
/usr/local/prometheus-webhook-dingtalk/templates
[root@k8s-03 templates]# cat webhook.tmpl 
{{- define "webhook.tmpl" }}
{{- range $i, $alert := .Alerts.Firing -}}
[报警项]:{{ index $alert.Labels "alertname" }}
[实例]:{{ index $alert.Labels "instance" }}
[job]:{{ index $alert.Labels "job" }}
[报警内容]:{{ index $alert.Annotations "summary" }}
[开始时间]:{{ $alert.StartsAt.Format "2006-01-02 15:04:05" }}
====================
{{- end }}
{{- end }}

4、修改钉钉机器人告警配置

我这里使用加签机器人（建钉钉机器人可以勾选关键词、验签、IP地址）

templates:
  - templates/webhook.tmpl 
## Targets, previously was known as "profiles"
targets:
  webhook1:     #加签的机器人
    url: https://oapi.dingtalk.com/robot/send?access_token=953d580a587dfb790df0bcfd70*******7d534c3a88
    # secret for signature
    secret: SEC6f8a6137e0c*******************221bac7009c52
  webhook2:     #不加签的机器人
    url: https://oapi.dingtalk.com/robot/send?access_token=xxxxxxxxxxxx
  webhook_legacy:
    url: https://oapi.dingtalk.com/robot/send?access_token=xxxxxxxxxxxx
    # Customize template content
    message:
      # Use legacy template
      title: '{{ template "legacy.title" . }}'
      text: '{{ template "legacy.content" . }}'
  webhook_mention_all:   #@所有人钉钉
    url: https://oapi.dingtalk.com/robot/send?access_token=xxxxxxxxxxxx
    mention:
      all: true
  webhook_mention_users:  #@指定用户钉钉
    url: https://oapi.dingtalk.com/robot/send?access_token=xxxxxxxxxxxx
    mention:
      mobiles: ['156xxxx8827', '189xxxx8325']

创建 webhook-dingtalk系统服务启动文件
vim /usr/lib/systemd/system/webhook-dingtalk.service

[Unit]
Description=prometheus-webhook-dingtalk
Documentation=https://github.com/timonwong/prometheus-webhook-dingtalk
After=network.target

[Service]
User=prometheus
Group=prometheus
ExecStart=/usr/local/prometheus-webhook-dingtalk/prometheus-webhook-dingtalk  --config.file=/usr/local/prometheus-webhook-dingtalk/config.yml
Restart=on-failure

[Install]
WantedBy=multi-user.target

启动服务报错：

 Failed at step USER spawning /usr/local/prometheus-webhook-dingtalk/prometheus-webhook-dingtalk: No such process

解决办法：
命令的方式后台启动

nohup /usr/local/prometheus-webhook-dingtalk/prometheus-webhook-dingtalk  --config.file=/usr/local/prometheus-webhook-dingtalk/config.yml &

5、curl测试发信到钉钉(复制下面第二第三项)

#先传统模式测试一下是否能收到消息
curl 'https://oapi.dingtalk.com/robot/send?access_token=0df42dc863ec08274b3f3226ca1fc6cd3a85564343' \
-H 'Content-Type: application/json' \
-d '{"msgtype": "text", 
    "text": {
         "content": "shooter钉钉机器人群消息测试"
    }
  }'
 
#测试prometheus-webhook-dingtalk    （带验签 webhook1）
curl 'http://localhost:8060/dingtalk/webhook1/send' \
-H 'Content-Type: application/json' \
-d '{"msgtype": "text", 
    "text": {
         "content": "shooter钉钉机器人群消息测试"
    }
  }'
 
 
curl 'http://localhost:8060/dingtalk/webhook1/send' \
   -H 'Content-Type: application/json' \
   -d '{"msgtype": "ding.link.text","text": {"ding.link.content": "'"咸鱼我来了"'"}}'

钉钉接收到消息说明成功了。(先不管消息为空的问题，这是因为接收参数问题)

五、修改alertmanager配置(钉钉告警版)

1、修改alertmanager.yml

global:
  resolve_timeout: 5m

#templates:
#  - '/opt/monitor/alertmanager/template/*.tmpl'

route:
  group_by: ['alertname']
  group_wait: 30s
  group_interval: 1m
  repeat_interval: 2m
  receiver: 'web.hook'
receivers:
- name: 'web.hook'
  webhook_configs:
  - url: 'http://localhost:8060/dingtalk/webhook1/send'
    send_resolved: true
inhibit_rules:
  - source_match:
      alertname: 'ApplicationDown'
      severity: 'critical'
    target_match:
      severity: 'warning'
    equal: ['alertname',"target","job","instance"]

2、重启alertmanager服务

systemctl restart alertmanager

六、设置prometheus告警规则

1、在 prometheus目录下新建rules文件夹，在文件夹下创建一个first_rules.yml规则文件
并设置告警规则 node节点不在线告警。

[root@k8s-03 rules]# pwd
/usr/local/prometheus/rules
[root@k8s-03 rules]# ls
first_rules.yml
[root@k8s-03 rules]# cat first_rules.yml 
groups:
    - name: 主机状态-监控告警
      rules:
      - alert: 主机状态
        expr: up == 0
        for: 1m
        labels:
          status: 非常严重
        annotations:
          summary: "{{$labels.instance}}:服务器宕机"
          description: "{{$labels.instance}}:服务器延时超过5分钟"

2、修改prometheus设置告警规则文件路径

rule_files:

“/usr/local/prometheus/rules/first_rules.yml”
去掉注释，并设置正确的规则文件路径。

# my global config
global:
  scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
# Alertmanager configuration
alerting:
  alertmanagers:
  - static_configs:
    - targets: ['localhost:9093']


# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
   - "/usr/local/prometheus/rules/first_rules.yml"
  # - "second_rules.yml"

# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
  # The job name is added as a label `job=` to any timeseries scraped from this config.
  - job_name: "prometheus"

    # metrics_path defaults to '/metrics'
    # scheme defaults to 'http'.

    static_configs:
      - targets: ["localhost:9090"]
  - job_name: "node_exporter"
    static_configs:
      - targets: ["localhost:9100"]

3、热加载prometheus配置

ps -ef|grep prometheus
kill -HUP 2564

4、prometheus浏览器查看rules

5、关闭node_exporter服务看效果

systemctl stop node_exporter

alertmanager 浏览器查看是否有告警！

钉钉告警！

6、重启 node_exporter

[root@k8s-03 prometheus]# systemctl start node_exporter.service

linux 中路由解决方案1
在Linux的路由表中，当存在多条默认路由（0.0.0.0）且它们的Metric值相同时，内核会根据其他因素决定优先使用哪条路由。在你的例子中，eth1和wlan0的Metric值均为1024，但系统优先选择eth1，可能原因如下：可能原因分析接口优先级（基于接口索引或名称顺序）Linux内核可能会根据网络接口的创建顺序或接口索引号（ifindex）决定优先级。通常，先初始化的接口（如eth1）会
ubuntu 6.8.0 安装xenomai3.3 ZPC8210 ROS ubuntu linux 运维
通过以下步骤来获取和准备Linux内核6.8.0的源码，并应用Xenomai补丁：1.下载Linux内核6.8.0源码你可以从TheLinuxKernelArchives下载Linux内核6.8.0的源码。以下是具体步骤：访问内核官方网站：打开TheLinuxKernelArchives。找到对应版本的内核：在网站中找到内核6.8.0的下载链接。通常在v6.x目录下。下载源码：下载linux-6.
交叉编译Python-3.6.0到aarch64/aarch32 —— 支持sqlite3
参考https://datko.net/2013/05/10/cross-compiling-python-3-3-1-for-beaglebone-arm-angstrom/平台主机：ubuntu14.0464bit开发板：qemu+aarch64（参考：http://www.cnblogs.com/pengdonglin137/p/6442583.html）工具链：aarch64-linux-
跨平台ZeroMQ：在Rust中使用zmq库的完整指南涵树_fx 架构设计 Rust 实战 rust 开发语言后端
“消息就像神经元间的电信号，而ZeroMQ就是那个让系统思考的神经网络”——某个深夜调试zmq的程序员当你需要轻量级、高性能的进程间通信时，ZeroMQ就像代码世界里的瑞士军刀。今天我们一起探索如何在Rust生态中使用这把利器，感受它如何在不同操作系统间架起通信的桥梁。安装ZeroMQ：三大操作系统的通关秘籍Linux(Debian/Ubuntu)sudoaptupdatesudoaptinsta
在Linux环境下从0私有化部署Dify
在Linux环境下从0搭建Dify准备工作系统环境私有化部署下载Dify代码ZIP包启动Dify启动Docker容器访问Dify本地环境服务器环境准备工作因工作需要私有化部署公司内部的知识库，研究了一下准备采用Dify+RAG的方式实现，以下是具体步骤。系统环境服务器配置：官方建议2核4G以上；Liunx版本：RockyLinuxrelease9.4；Docker版本：28.1.1；Dify版本：
嵌入式Linux内核镜像生成过程飘逸轻舞 linux arm开发运维嵌入式
嵌入式Linux内核镜像生成过程嵌入式Linux系统的核心组件是内核，它是操作系统的核心部分，负责管理硬件资源、提供系统调用接口以及驱动设备等功能。在嵌入式系统中，将内核编译成镜像文件是部署系统的关键步骤之一。本文将介绍嵌入式Linux的内核镜像生成过程，并提供相应的源代码示例。获取Linux内核源代码首先，我们需要获取Linux内核的源代码。可以从Linux官方网站（www.kernel.org
Linux 启动过程流程图--ARM版进击的程序汪 linux arm开发运维
以下是ARM版本Linux启动过程的超详细树状图，涵盖硬件上电到应用程序交互的全流程，并包含关键函数调用链及源码位置，适用于系统开发与调试场景：ARMLinux启动全流程（含函数调用链）ARMLinux启动流程（函数级调用链）│├───**1.硬件上电与BootROM阶段**│││├───硬件复位与初始化││├───CPU进入Reset异常向量（ARM异常向量表基址0x0或0xffff0000）│
Markdown 安装使用教程小奇JAVA面试安装使用教程 markdown
一、Markdown简介Markdown是一种轻量级标记语言，语法简洁、易读易写，广泛用于编写博客、文档、README文件等。它可以导出为HTML、PDF等格式，兼容各种平台如GitHub、Typora、VSCode等。二、Markdown编辑器推荐2.1桌面端编辑器平台特点TyporaWindows/macOS/Linux所见即所得，简洁高效VSCode+插件跨平台强大可扩展，开发者首选Mark
linux下启动svn服务器,linux下svn服务器安装配置与启动
1.采用源文件编译安装。源文件共两个，为：subversion-1.6.1.tar.gz(subversion源文件)subversion-deps-1.6.1.tar.gz(subversion依赖文件)注意文件版本必须一致,否则很容易产生各种奇怪的问题.2.上传以上两个文件到服务器上，解压。解压命令为：tarxfvzsubversion-1.6.1.tar.gztarxfvzsubversio
Pushgateway扩展Prometheus监控 ivwdcwso 运维与云原生 prometheus k8s 云原生
Pushgateway是Prometheus生态系统中的一个重要组件,它允许我们将短期作业或批处理任务的指标推送到Prometheus中。本文将详细介绍如何安装、配置和使用Pushgateway来扩展Prometheus监控。1.Pushgateway简介Pushgateway主要用于解决以下场景:短期作业无法被Prometheus直接抓取批处理任务需要推送指标防火墙后的应用需要主动推送指标它作为
Prometheus系列01-Prometheus的单机版二进制部署 tinychen777 Devops linux 监控程序 centos
作为CNCF中最成功的开源项目之一，Prometheus已经成为了云原生监控的代名词，被广泛应用在Kubernetes和OpenShift等项目中，同时有很多第三方解决方案也会集成Prometheus。随着Kubernetes在容器调度和管理上确定领头羊的地位，Prometheus也成为Kubernetes容器监控的标配。考虑到k8s系统的复杂性和上手难度较高，本文将从最简单最基础的部分开始循序渐
【Prometheus】cAdvisor工作原理介绍码上淘金 prometheus
cAdvisor（ContainerAdvisor）是Google开源的容器监控工具，专注于实时采集和暴露容器级别的资源使用数据。其底层实现基于Linux内核的多项技术，结合高效的事件驱动架构，实现对容器资源的细粒度监控。以下从核心机制、数据采集原理和架构实现三方面详细解析：一、核心依赖技术cAdvisor的监控能力建立在Linux内核提供的底层机制之上：cgroups（控制组）资源隔离与统计：c
subversion安装、备份、安全认证实践笔记——宋轶聪 etune subversion svn apache tortoisesvn 工作存储
在windows上配置svn的方法在linux10.117.100.130上安装svnsvn库的导入导出查看svn服务器版本SVN备份策略Svn服务配置和维护常用命令linux下启动和停止win下启动和停止svn把svn加为系统服务配置apache通过http访问svnsvn命令行====================================在windows上的配置方法=========
【Prometheus】通过tar包部署单机版Prometheus 和 Pushgateway
在ECS（ElasticComputeService）机器上通过tar包部署Prometheus和Pushgateway，并配置Prometheus采集Pushgateway的数据，是一个常见的监控部署任务。以下是详细的步骤说明：环境准备操作系统：Linux（如CentOS、Ubuntu）已安装tar命名已开通ECS实例的相应端口（9090forPrometheus,9091forPushgate
内核必须懂(七): Linux四级页表(x64) weixin_34310127 操作系统
目录前言Intel四级页表实操寻址获取cr3获取PGD获取PUD获取PMD获取PTE获取内容最后前言Linux四级页表的作用主要就是地址映射,将逻辑地址映射到物理地址.很多时候,有些地方想不明白就可以查看实际物理地址进行分析.Intel四级页表其实很多设计的根源或者说原因都来自于CPU的设计,OS很多时候都是辅助CPU.Linux的四级页表就是依据CPU的四级页表来设计的.这里主要说的就是Inte
Linux内存管理和寻址详解 *烟雨 linux 驱动开发网络
1.概念内存管理模式段式：内存分为了多段，每段都是连续的内存，不同的段对应不用的用途。每个段的大小都不是统一的，会导致内存碎片和内存交换效率低的问题。页式：内存划分为多个内存页进行管理，如在Linux系统中，每一页的大小为4KB。由于分了页后，就不会产生细小的内存碎片。但是仍然也存在内存碎片问题。段页式：段式和页式结合。地址类型划分逻辑地址：程序所使用的地址，通常是没被段式内存管理映射的地址，称为
从小白到进阶：解锁linux与c语言高级编程知识点嵌入式开发的任督二脉（1） small_wh1te_coder 嵌入式 linux c 嵌入式硬件算法 c 汇编面试 linux
【硬核揭秘】Linux与C高级编程：从入门到精通，你的全栈之路！第一部分：初识Linux与环境搭建，玩转软件包管理——嵌入式开发的第一道“坎”嘿，各位C语言的“卷王”们！你可能已经习惯了在Windows或macOS上敲代码，用IDE点点鼠标就能编译运行。但当你踏入嵌入式开发的大门，尤其是涉及到那些跑着Linux系统的“大家伙”（比如树莓派、工控机、智能路由器），你就会发现，一个全新的世界在你面前展
Linux报错解决——导入了gcc版本，但是还是显示原来的gcc版本的解决办法 William.csj 报错解决 Ubuntu linux 运维服务器
一、问题描述我想要切换gcc版本，于是我用sudo安装了gcc-11，接着我在终端运行了：exportCC=/usr/bin/gcc-11exportCXX=/usr/bin/g++-11运行gcc--version还是显示：gcc(Ubuntu13.3.0-6ubuntu2~24.04)13.3.0二、原因分析即使你exportCC=/usr/bin/gcc-11，但gcc--version还是
Linux下Redis安装配置全攻略（2024最新版）「已注销」 linux redis 运维
手残党也能搞定的Redis安装指南还在为Linux安装Redis发愁？（别问我怎么知道的）今天这个保姆级教程绝对能让你爽到飞起！从零开始到完全可用只要10分钟，连小白都能轻松上手！（信我，真的）环境准备（超级重要）先确认你的Linux发行版（敲黑板！）：#查看系统信息cat/etc/os-release推荐系统：Ubuntu20.04/22.04LTSCentOS7/8RockyLinux8/9安
数据仓库技术及应用（Hive 产生背景与架构设计，存储模型与数据类型）娟恋无暇数据仓库笔记 hive
1.Hive产生背景传统Hadoop架构存在的一些问题：MapReduce编程必须掌握Java，门槛较高传统数据库开发、DBA、运维人员学习门槛高HDFS上没有Schema的概念，仅仅是一个纯文本文件Hive的产生：为了让用户从一个现有数据基础架构转移到Hadoop上现有数据基础架构大多基于关系型数据库和SQL查询Facebook诞生了Hive2.Hive是什么官网：https://hive.ap
python profile_python程序之profile分析
操作系统：CentOS7.3.1611_x64python版本：2.7.5问题描述1、Python开发的程序在使用过程中很慢，想确定下是哪段代码比较慢；2、Python开发的程序在使用过程中占用内存很大，想确定下是哪段代码引起的；解决方案使用profile分析分析cpu使用情况可以使用profile和cProfile对python程序进行分析，这里主要记录下cProfile的使用，profile参
Linux基础第5天-：Vim编译器的常用指令今天也好累 linux vim 运维笔记学习编辑器服务器
光标移动（了解）行间移动gg键：移动光标到第一行（命令模式下）G键：移动光标到最后一行（命令模式下）:n：移动到第n行，:6（移动到第6行）（末行模式下）列间移动$键：移动光标到当前行的行尾（最后一列），一般可以使用Shift+$（命令模式下）0键：移动光标到当前行的行首（第一列）（命令模式下）方向键↑↓上下实现行间移动，←→左右实现列间移动删除（重点）列删除x键：删除当前光标所在出的一个字符（命
Java面试八股文(2023最新)--Linux面试题月月崽面试 linux 运维服务器
目录1.什么是Linux内核2.Linux的体系结构.4.基本命令5.如何查看最近1000行日志6.如何查端口号是否被占用7.查看当前所有已经使用的端口情况8.什么是硬链接和软链接?1.什么是Linux内核Linux系统的核心是内核,内核控制着计算机系统上的软硬件,在必要时分配硬件,并根据需要执行软件.系统内存管理应用程序管理硬件设备管理文件系统管理2.Linux的体系结构.Linux体系结构可以
Linux 工作环境配置
终端shell如果是pc就安装iterm2，如果是远程服务器就跳过该步骤调整字体，主题；熟悉呼出和tab切换快捷键安装完成后，在/bin目录下会多出一个zsh的文件。修改默认终端，执行：【chsh-s/bin/zsh】chsh需要su权限，没有的话可以在bashrc中加入【exec/bin/zsh】此时可以安装autojump了，https://blog.csdn.net/liujan511536
webpack+vite前端构建工具 -答疑
webpack答疑1输入webpack命令，执行的是全局版本还是本地版本的webpack当在命令行窗口输入webpack命令时，其执行优先级可通过以下步骤明确判断：1.1【全局安装优先机制】执行原理：系统会按照环境变量PATH的顺序逐级查找可执行文件路径比对：全局安装路径：npminstall-gwebpack会安装在类似/usr/local/bin（Mac/Linux）或C:\Users\用户名
Linux: perf: debug问题一例，cpu使用率上升大约2%；多线程如何细化cpu及perf数据分析 mzhan017 kernel 系统性能 linux 服务器网络
文章目录前提面临的问题内核级别函数的差别继续debug总结根据pid前提一个进程安置在一个CPU上，新功能上线之后，固定量的业务打起来，占用的CPU是42%。之前没有新功能的情况下，CPU占用是40%。差了大约2%。而且这个进程里的线程数非常多，有50多个线程。从差距看变化不大，没有别的办法，只能使用perf来抓取数据来看。但是使用perf也要面临很多的问题。面临的问题面临的问题有一堆：两次per
K8s系列之：Kubernetes 的 OLM 快乐骑行^_^ Ansible Docker K8S 服务器相关知识总结 K8s系列 Kubernetes OLM
K8s系列之：Kubernetes的OLM什么是Kubernetes的OLM什么是Kubernetes中的OperatorOLM的功能OLM的核心组件OLM优势OLM的工作原理OLM与OperatorHub的关系OLM示例场景什么是CRDoperator和CRD的关系为什么需要CRD和OperatorCRD定义资源类型DebeziumServer如何使用debeziumoperatorDebezi
K8s系列之：Kubernetes 的 RBAC (Role-Based Access Control) 快乐骑行^_^ Ansible Docker K8S 服务器相关知识总结 K8s系列 Kubernetes RBAC Role-Based Access Control
K8s系列之：Kubernetes的RBACRole-BasedAccessControl认识RBACRBAC的关键概念RoleClusterRoleRoleBindingClusterRoleBindingRBAC的工作机制RBAC配置过程RBAC示例场景RBAC的优点总结认识RBACRBAC（基于角色的访问控制）是Kubernetes中的一种权限管理机制，用于控制用户或服务账户对Kuberne
11 DPDK 探索大页内存原理
在分析dpdk大页内存的源码之前，有必要对linux内存管理的原理以及大页内存的原理有个了解，缺少这些底层基础知识，分析dpdk大页内存的源码将举步维艰。这篇文章详细介绍下linux内存管理以及大页内存的方方面面，为分析dpdk大页内存源码扫除障碍。一、linux内存管理原理1、mmu内存管理的引入在没有引入mmu内存管理单元时，对于32位操作系统，每个进程都有2的32次方的地址空间(4G)。如果
vscode remote-ssh 拓展免密访问 linux虚拟机
前置步骤，在linux安装好ssh并且win可以使用密码登录linuxsudoaptinstallopenssh-server-y在win上检查密钥是否存在检查公钥和私钥cat~/.ssh/id_rsa.pubcat~/.ssh/id_rsa如果不存在，重新生成ssh-keygen-trsa-b4096重新执行cat~/.ssh/id_rsa.pub将公钥的内容粘贴到linux下~/.ssh/au
scala的option和some 矮蛋蛋编程 scala
原文地址： http://blog.sina.com.cn/s/blog_68af3f090100qkt8.html 对于学习 Scala 的 Java™ 开发人员来说，对象是一个比较自然、简单的入口点。在本系列前几期文章中，我介绍了 Scala 中一些面向对象的编程方法，这些方法实际上与 Java 编程的区别不是很大。我还向您展示了 Scala 如何重新应用传统的面向对象概念，找到其缺点
NullPointerException Cb123456 android BaseAdapter
java.lang.NullPointerException: Attempt to invoke virtual method 'int android.view.View.getImportantForAccessibility()' on a null object reference 出现以上异常.然后就在baidu上
PHP使用文件和目录天子之骄 php文件和目录读取和写入 php验证文件 php锁定文件
PHP使用文件和目录 1.使用include()包含文件 (1)：使用include()从一个被包含文档返回一个值 (2)：在控制结构中使用include() include_once()函数需要一个包含文件的路径，此外，第一次调用它的情况和include()一样，如果在脚本执行中再次对同一个文件调用，那么这个文件不会再次包含。在php.ini文件中设置
SQL SELECT DISTINCT 语句何必如此 sql
SELECT DISTINCT 语句用于返回唯一不同的值。 SQL SELECT DISTINCT 语句在表中，一个列可能会包含多个重复值，有时您也许希望仅仅列出不同（distinct）的值。 DISTINCT 关键词用于返回唯一不同的值。 SQL SELECT DISTINCT 语法 SELECT DISTINCT column_name,column_name F
java冒泡排序 3213213333332132 java 冒泡排序
package com.algorithm; /** * @Description 冒泡 * @author FuJianyong * 2015-1-22上午09:58:39 */ public class MaoPao { public static void main(String[] args) { int[] mao = {17,50,26,18,9,10
struts2.18 +json,struts2-json-plugin-2.1.8.1.jar配置及问题！ 7454103 DAO spring Ajax json qq
struts2.18 出来有段时间了！（貌似是稳定版）闲时研究下下！貌似 sruts2 搭配 json 做 ajax 很吃香！实践了下下！不当之处请绕过！呵呵网上一大堆 struts2+json 不过大多的json 插件都是 jsonplugin.34.jar strut
struts2 数据标签说明 darkranger jsp bean struts servlet Scheme
数据标签主要用于提供各种数据访问相关的功能，包括显示一个Action里的属性，以及生成国际化输出等功能数据标签主要包括： action ：该标签用于在JSP页面中直接调用一个Action，通过指定executeResult参数，还可将该Action的处理结果包含到本页面来。 bean ：该标签用于创建一个javabean实例。如果指定了id属性，则可以将创建的javabean实例放入Sta
链表.简单的链表节点构建 aijuans 编程技巧
/*编程环境WIN-TC*/ #include "stdio.h" #include "conio.h" #define NODE(name, key_word, help) \ Node name[1]={{NULL, NULL, NULL, key_word, help}} typedef struct node { &nbs
tomcat下jndi的三种配置方式 avords tomcat
jndi(Java Naming and Directory Interface，Java命名和目录接口)是一组在Java应用中访问命名和目录服务的API。命名服务将名称和对象联系起来，使得我们可以用名称访问对象。目录服务是一种命名服务，在这种服务里，对象不但有名称，还有属性。 tomcat配置
关于敏捷的一些想法 houxinyou 敏捷
从网上看到这样一句话：“敏捷开发的最重要目标就是：满足用户多变的需求，说白了就是最大程度的让客户满意。” 感觉表达的不太清楚。感觉容易被人误解的地方主要在“用户多变的需求”上。第一种多变，实际上就是没有从根本上了解了用户的需求。用户的需求实际是稳定的，只是比较多，也比较混乱，用户一般只能了解自己的那一小部分，所以没有用户能清楚的表达出整体需求。而由于各种条件的，用户表达自己那一部分时也有
富养还是穷养，决定孩子的一生 bijian1013 教育人生
是什么决定孩子未来物质能否丰盛？为什么说寒门很难出贵子，三代才能出贵族？真的是父母必须有钱，才能大概率保证孩子未来富有吗？-----作者：@李雪爱与自由事实并非由物质决定，而是由心灵决定。一朋友富有而且修养气质很好，兄弟姐妹也都如此。她的童年时代，物质上大家都很贫乏，但妈妈总是保持生活中的美感，时不时给孩子们带回一些美好小玩意，从来不对孩子传递生活艰辛、金钱来之不易、要懂得珍惜
oracle 日期时间格式转化征客丶 oracle
oracle 系统时间有 SYSDATE 与 SYSTIMESTAMP； SYSDATE：不支持毫秒，取的是系统时间； SYSTIMESTAMP：支持毫秒，日期，时间是给时区转换的，秒和毫秒是取的系统的。日期转字符窜：一、不取毫秒： TO_CHAR(SYSDATE, 'YYYY-MM-DD HH24:MI:SS') 简要说明， YYYY 年 MM 月
【Scala六】分析Spark源代码总结的Scala语法四 bit1129 scala
1. apply语法 FileShuffleBlockManager中定义的类ShuffleFileGroup，定义： private class ShuffleFileGroup(val shuffleId: Int, val fileId: Int, val files: Array[File]) { ... def apply(bucketId
Erlang中有意思的bug bookjovi erlang
代码中常有一些很搞笑的bug，如下面的一行代码被调用两次（Erlang beam） commit f667e4a47b07b07ed035073b94d699ff5fe0ba9b Author: Jovi Zhang <[email protected]> Date: Fri Dec 2 16:19:22 2011 +0100 erts:
移位打印10进制数转16进制-2008-08-18 ljy325 java 基础
/** * Description 移位打印10进制的16进制形式 * Creation Date 15-08-2008 9:00 * @author 卢俊宇 * @version 1.0 * */ public class PrintHex { // 备选字符 static final char di
读《研磨设计模式》-代码笔记-组合模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.List; abstract class Component { public abstract void printStruct(Str
利用cmd命令将.class文件打包成jar chenyu19891124 cmd jar
cmd命令打jar是如下实现：在运行里输入cmd，利用cmd命令进入到本地的工作盘符。(如我的是D盘下的文件有此路径 D:\workspace\prpall\WEB-INF\classes) 现在是想把D:\workspace\prpall\WEB-INF\classes路径下所有的文件打包成prpall.jar。然后继续如下操作： cd D: 回车 cd workspace/prpal
[原创]JWFD v0.96 工作流系统二次开发包 for Eclipse 简要说明 comsci eclipse 设计模式算法工作 swing
JWFD v0.96 工作流系统二次开发包 for Eclipse 简要说明 &nb
SecureCRT右键粘贴的设置 daizj secureCRT 右键粘贴
一般都习惯鼠标右键自动粘贴的功能，对于SecureCRT6.7.5 ，这个功能也已经是默认配置了。老版本的SecureCRT其实也有这个功能，只是不是默认设置，很多人不知道罢了。菜单： Options->Global Options ...->Terminal 右边有个Mouse的选项块。 Copy on Select Paste on Right/Middle
Linux 软链接和硬链接 dongwei_6688 linux
1.Linux链接概念Linux链接分两种，一种被称为硬链接（Hard Link），另一种被称为符号链接（Symbolic Link）。默认情况下，ln命令产生硬链接。【硬连接】硬连接指通过索引节点来进行连接。在Linux的文件系统中，保存在磁盘分区中的文件不管是什么类型都给它分配一个编号，称为索引节点号(Inode Index)。在Linux中，多个文件名指向同一索引节点是存在的。一般这种连
DIV底部自适应 dcj3sjt126com JavaScript
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml&q
Centos6.5使用yum安装mysql——快速上手必备 dcj3sjt126com mysql
第1步、yum安装mysql [root@stonex ~]# yum -y install mysql-server 安装结果： Installed: mysql-server.x86_64 0:5.1.73-3.el6_5 &nb
如何调试JDK源码 frank1234 jdk
相信各位小伙伴们跟我一样，想通过JDK源码来学习Java，比如collections包，java.util.concurrent包。可惜的是sun提供的jdk并不能查看运行中的局部变量，需要重新编译一下rt.jar。下面是编译jdk的具体步骤： 1.把C:\java\jdk1.6.0_26\sr
Maximal Rectangle hcx2013 max
Given a 2D binary matrix filled with 0's and 1's, find the largest rectangle containing all ones and return its area. public class Solution { public int maximalRectangle(char[][] matrix)
Spring MVC测试框架详解——服务端测试 jinnianshilongnian spring mvc test
随着RESTful Web Service的流行，测试对外的Service是否满足期望也变的必要的。从Spring 3.2开始Spring了Spring Web测试框架，如果版本低于3.2，请使用spring-test-mvc项目（合并到spring3.2中了）。 Spring MVC测试框架提供了对服务器端和客户端（基于RestTemplate的客户端）提供了支持。 &nbs
Linux64位操作系统（CentOS6.6）上如何编译hadoop2.4.0 liyong0802 hadoop
一、准备编译软件 1.在官网下载jdk1.7、maven3.2.1、ant1.9.4，解压设置好环境变量就可以用。环境变量设置如下：（1）执行vim /etc/profile （2）在文件尾部加入: export JAVA_HOME=/home/spark/jdk1.7 export MAVEN_HOME=/ho
StatusBar 字体白色 pangyulei status
[[UIApplication sharedApplication] setStatusBarStyle:UIStatusBarStyleLightContent]; /*you'll also need to set UIViewControllerBasedStatusBarAppearance to NO in the plist file if you use this method
如何分析Java虚拟机死锁 sesame java thread oracle 虚拟机 jdbc
英文资料： Thread Dump and Concurrency Locks Thread dumps are very useful for diagnosing synchronization related problems such as deadlocks on object monitors. Ctrl-\ on Solaris/Linux or Ctrl-B
位运算简介及实用技巧（一）：基础篇 tw_wangzhengquan 位运算
http://www.matrix67.com/blog/archives/263 去年年底写的关于位运算的日志是这个Blog里少数大受欢迎的文章之一，很多人都希望我能不断完善那篇文章。后来我看到了不少其它的资料，学习到了更多关于位运算的知识，有了重新整理位运算技巧的想法。从今天起我就开始写这一系列位运算讲解文章，与其说是原来那篇文章的follow-up，不如说是一个r
jsearch的索引文件结构 yangshangchuan 搜索引擎 jsearch 全文检索信息检索 word分词
jsearch是一个高性能的全文检索工具包，基于倒排索引，基于java8，类似于lucene，但更轻量级。 jsearch的索引文件结构定义如下： 1、一个词的索引由=分割的三部分组成：第一部分是词第二部分是这个词在多少