Nagios监控搭建与配置详细步骤

(一)安装Nagios   (Nagios服务器为:192.168.6.6    Nagios客户端为: 192.168.2.33)

1.基础支持套件:gcc glibc glibc-common gd gd-devel xinetd openssl-devel httpd php   注:php和httpd均用源码包安装,安装配置方法此处不在详述

# yum install -y gcc glibc glibc-common gd gd-devel xinetd openssl-devel

2.创建Nagios账户和组

#useradd -m nagios
#groupadd nagcmd
#usermod -a -G nagcmd nagios
#usermod -a -G nagcmd apache

3.编译安装

#tar xvf nagios-3.5.1.tar.gz
#cd nagios-3.5.1
#./configure prefix=/usr/local/nagios --with-command-group=nagcmd --with-nagios-user=nagios --with-nagios-group=nagios
#make all
#make install
#make install-init        (生成init启动脚本)
#make install-config      (生成一些模板配置文件)
#make install-commandmode (设置相应的权限)
#make install-webconf     (生成Apache配置文件nagios.conf)

4.为Nagios设置Web验证的密码

#/usr/local/apache/bin/htpasswd -c /usr/local/nagios/etc/htpasswd.user nagiosadmin

5.设置Nagios的开机启动

chkconfig --add nagios
chkconfig nagios on

6.安装Nagios的插件nagios-plugin

#tar zxvf nagios-plugins-1.4.16.tar.gz
#cd nagios-plugins-1.4.16
#./configure --prefix=/usr/local/nagios --with-nagios-user=nagios --with-nagios-group=nagios
 --with-apt-get-command --with-ping6-command --with-ping-command --with-mysql
 --with-gnutls --enable-extra-opts
#make
#make install


7.此时完成初步安装,可以监控查看本机的一些服务,检测配置文件并启动nagios

#/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg

Nagios Core 3.5.1
Copyright (c) 2009-2011 Nagios Core Development Team and Community Contributors
Copyright (c) 1999-2009 Ethan Galstad
Last Modified: 08-30-2013
License: GPL

Website: http://www.nagios.org
Reading configuration data...
   Read main config file okay...
Processing object config file '/usr/local/nagios/etc/objects/commands.cfg'...
Processing object config file '/usr/local/nagios/etc/objects/contacts.cfg'...
Processing object config file '/usr/local/nagios/etc/objects/timeperiods.cfg'...
Processing object config file '/usr/local/nagios/etc/objects/templates.cfg'...
Processing object config directory '/usr/local/nagios/etc/servers'...
Processing object config file '/usr/local/nagios/etc/servers/localhost.cfg'...
   Read object config files okay...

Running pre-flight check on configuration data...

Checking services...
 Checked 6 services.
Checking hosts...
 Checked 1 hosts.
Checking host groups...
 Checked 0 host groups.
Checking service groups...
 Checked 0 service groups.
Checking contacts...
 Checked 1 contacts.
Checking contact groups...
 Checked 1 contact groups.
Checking service escalations...
 Checked 0 service escalations.
Checking service dependencies...
 Checked 0 service dependencies.
Checking host escalations...
 Checked 0 host escalations.
Checking host dependencies...
 Checked 0 host dependencies.
Checking commands...
 Checked 25 commands.
Checking time periods...
 Checked 5 time periods.
Checking for circular paths between hosts...
Checking for circular host and service dependencies...
Checking global event handlers...
Checking obsessive compulsive processor commands...
Checking misc settings...

Total Warnings: 0
Total Errors:   0

Things look okay - No serious problems were detected during the pre-flight check

出现此处,表明,配置文件没有错误,可以启动nagios和apache

#service nagios start
#/usr/local/apache/sbin/apchectl

访问nagios
http://192.168.6.6/nagios/ 

此时提示页面无法访问,原因在于由于Apache是源码包安装,默认路径和rpm包不一样,需要在Apache的httpd.conf配置文件中添加指定访问路径


8. 配置apache并加载nagios登录页面 
找到apache 的配置文件/usr/local/apache/conf/httpd.conf

找到:

User daemon
Group daemon 修改为

User nagios
Group nagios

为了安全起见,一般情况下要让nagios 的web 监控页面必须经过授权才能访问,这需要增加验证配置,即在httpd.conf 文件最后添加如下信息:

下面信息在编译nagios(make install-webconf )时就已经生成,配置信息在:/etc/httpd/confd.d/nagios.conf 文件中

#######################################################################
#setting for nagios
ScriptAlias /nagios/cgi-bin "/usr/local/nagios/sbin"

     AuthType Basic
     Options ExecCGI
     AllowOverride None
     Order allow,deny
     Allow from all
     AuthName "Nagios Access"
     AuthUserFile /usr/local/nagios/etc/htpasswd.user
     Require valid-user

Alias /nagios "/usr/local/nagios/share"

     AuthType Basic
     Options None
     AllowOverride None
     Order allow,deny
     Allow from all
     AuthName "nagios Access"
     AuthUserFile /usr/local/nagios/etc/htpasswd.user
     Require valid-user

###########################################################################

9.重启nagios、apache并访问nagios

#service nagios restart
#/usr/local/apache/bin/apachectl restart
http://192.168.6.6/nagions

提示输入用户名密码,访问成功

但是登陆进去后,nagios页面右侧全部乱码
解决方法:

主要是apache没有开启cgi脚本的缘故

进入apache的主配置文件httpd.conf
#vim /usr/local/apache/conf/httpd.conf
 
#LoadModule cgid_module modules/mod_cgid.so
#LoadModule actions_module modules/mod_actions.so

将上面2行的#去掉,重启apache就OK

再次访问 ,乱码消失OK!

(二)配置Nagios

1.nagios配置目录信息

# cd /usr/local/nagios/etc/
# ls
cgi.cfg  htpasswd.user  nagios.cfg  objects  resource.cfg
[root@localhost etc]# ll
total 68
-rw-rw-r-- 1 nagios nagios 11669 Nov 29 14:18 cgi.cfg (CGI配置文件)
-rw-r--r-- 1 root   root      50 Nov 29 14:20 htpasswd.user (Apache的验证密码文件)
-rw-rw-r-- 1 nagios nagios 44710 Nov 29 14:18 nagios.cfg (主配置文件)
drwxrwxr-x 2 nagios nagios  4096 Nov 29 14:18 objects (对象定义文件目录)
-rw-rw---- 1 nagios nagios  1340 Nov 29 14:18 resource.cfg (资源配置文件)

2.修改nagios.cfg主配置文件

#vim nagios.cfg

注释掉:cfg_file=/usr/local/nagios/etc/objects/localhost.cfg  ―――― #cfg_file=/usr/local/nagios/etc/objects/localhost.cfg
将 #cfg_dir=/usr/local/nagios/etc/servers  的 #(注释)去掉 -----  cfg_dir=/usr/local/nagios/etc/servers

在/usr/local/nagios/etc/目录中新建 servers子目录,在里面可以直接添加主机配置文件
#mkdir servers

3.配置object目录中的配置文件

#cd objects/
#ll
total 48
-rw-rw-r-- 1 nagios nagios  7716 Nov 29 14:18 commands.cfg (命令定义文件)
-rw-rw-r-- 1 nagios nagios  2166 Nov 29 14:18 contacts.cfg (联系人信息定义文件)
-rw-rw-r-- 1 nagios nagios  5403 Nov 29 14:18 localhost.cfg
-rw-rw-r-- 1 nagios nagios  3124 Nov 29 14:18 printer.cfg
-rw-rw-r-- 1 nagios nagios  3293 Nov 29 14:18 switch.cfg
-rw-rw-r-- 1 nagios nagios 10812 Nov 29 14:18 templates.cfg
-rw-rw-r-- 1 nagios nagios  3208 Nov 29 14:18 timeperiods.cfg (时间周期定义文件)
-rw-rw-r-- 1 nagios nagios  4019 Nov 29 14:18 windows.cfg

配置联系人信息(邮件接收者邮箱地址)

联系人定义:
#vim contacts.cfg
将 email 字段后边的  nagios@localhost  改成自己的邮箱,将报警信息发送的此邮箱,比如 [email protected] 
如果是设置提醒多个邮箱可以在后跟其它邮箱地址,以逗号隔开,比如: [email protected],[email protected]

保存,退出。

(三)nrpe安装配置

1.安装nrpe

#tar zxvf nrpe-2.12.tar.gz
#cd nrpe-1.12
# ./configure && make all
#make install-plugin
#make install-daemon
#make install-daemon-config
#make install-xinetd

2.配置nrpe

#vim /etc/xinetd.d/nrpe
在only_from=127.0.0.1 后添加 192.168.6.6 以空格隔开

3.添加端口

#vim /etc/services       在最后添加   nrpe  5666/tcp   #nrpe


修改配置文件/usr/local/nagios/etc/objects/commands.cfg加入对nrpe的支持

#vim /usr/local/nagios/etc/objects/commands.cfg     在末尾添加如下内容

##############################################################################
#nrpe set
define command{
command_name check_nrpe
command_line /usr/local/nagios/libexec/check_nrpe -H $HOSTADDRESS$ -c $ARG1$
}
##############################################################################


4.配置/usr/local/nagios/etc/nrpe.cfg 文件

#cd /usr/local/nagios/etc/nrpe.cfg
#vim nrpe.cfg
将command[check_hda1]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /dev/hda1 中的hda1改为: sda  如下:
command[check_sda]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /dev/sda

将command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200  改为:
  command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 400 -c 450

添加 command[check_swap]=/usr/local/nagios/libexec/check_swap -w 90% -c 80%

保存,退出。

重启xinetd服务

OK,Nagios服务端安装成功!!!

 

(四)Nagios客户端安装配置


#yum install -y openssl openssl-devel

#useradd -s /sbin/nlogin nagios


2.安装nagios-plugin

# cd /opt/software/
#tar zxvf nagios-plugins-1.4.16.tar.gz
#cd nagios-plugin-1.4.16
#./configure --prefix=/usr/local/nagios --with-nagios-user=nagios --with-nagios-group=nagios --enable-libtap
--enable-redhat-pthread-workaround --with-apt-get-command --with-ping6-command --with-ping-command
--with-mysql --with-gnutls --enable-extra-opts --with-openssl --with-trusted-path
#make
#make install

3.安装配置nrpe

#yum install xinetd -y

#tar zxvf nrpe-2.12.tar.gz
#cd nrpe-2.12
#./configure && make all
#make install-plugin
#make install-daemon
#make install-daemon-config
#make install-xinetd

#vim /etc/xinetd.d/nrpe 
在only_from    = 127.0.0.1 后添加 nagios服务器端IP地址 192.168.6.6 如下:
  only_from    = 127.0.0.1 192.168.6.6

#vim /etc/services
在文件末尾添加 nrpe   5666/tcp      #nrpe

修改/usr/local/nagios/etc/nrpe.cfg

#cp -p  /usr/local/nagios/etc/nrpe.cfg    /usr/local/nagios/etc/nrpe.cfg.default   -修改前先备份一下该配置文件

#vim /usr/local/nagios/etc/nrpe.cfg
将nrpe.cfg文件中的 hda1 字符全部修改为 sda 
或者使用sed命令批量修改,如下:
sed  -i 's/hda1/sda/g'  /usr/local/nagios/etc/nrpe.cfg

添加swep分区监控:  command[check_swap]=/usr/local/nagios/libexec/check_swap -w 90% -c 80%

保存,退出。

重启xinetd服务 :
#service xinetd restart
 

(五) 在服务端/usr/local/nagios/etc/servers/目录中添加编辑被监控主机配置文件

1.本地主机配置文件模版

#cd /usr/local/nagios/etc/servers
#touch localhost.cfg   注:localhost.cfg 为本地主机配置文件
#vim localhost.cfg   添加如下模版内容
##########################################################################################
#define  host

define host{
host_name 192.168.6.6-nagios server     #主机名称,可随便定义
alias nagios Server                     #服务器别名,监控端为Server 被监控端为 Client
address 192.168.6.6                     #服务器端IP地址 或者被监控端IP地址
check_command check-host-alive          #检查的命令
check_interval 1
#retry_interval 1
max_check_attempts 1
check_period 24x7                       #检查的时间范围
process_perf_data 0
retain_nonstatus_information 0
contact_groups admins                   #联系人组
notification_interval 10                #检查时间间隔,单位为分钟
notification_period 24x7
notification_options d,u,r              #通知选项,d-宕机(down)  w-报警(warning)  u-未知(unkown) c-严重(critical) r-从异常情况恢复
}

#define services

#define check-host-alive
define service {
host_name 192.168.6.6-nagios server
service_description check-host-alive
check_period 24x7
max_check_attempts 1
normal_check_interval 1
#retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check-host-alive
}

define service {
host_name 192.168.6.6-nagios server
service_description check-users
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_users
}

define service {
host_name 192.168.6.6-nagios server
service_description check-load
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_load
}

define service {
host_name 192.168.6.6-nagios server
service_description check-total-procs
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_total_procs
}

define service {
host_name 192.168.6.6-nagios server
service_description check_sda
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_sda
}

define service {
host_name 192.168.6.6-nagios server
service_description check_swap
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_swap
}
########################################################################################

2.其它主机配置文件模版

#cd /usr/local/nagios/etc/servers/
#touch 192.168.2.33.cfg
#vim 192.168.2.33.cfg   添加如下内容

##################################################################################
#define  host

define host{
host_name 192.168.2.33-Test
alias nagios Client
address 192.168.2.33
check_command check-host-alive
check_interval 1
#retry_interval 1
max_check_attempts 1
check_period 24x7
process_perf_data 0
retain_nonstatus_information 0
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options d,u,r
}

#define check-host-alive
define service {
host_name 192.168.2.33-Test
service_description check-host-alive
check_period 24x7
max_check_attempts 1
normal_check_interval 1
#retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check-host-alive
}

define service {
host_name 192.168.2.33-Test
service_description check-users
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_users
}

define service {
host_name 192.168.2.33-Test
service_description check-load
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_load
}

define service {
host_name 192.168.2.33-Test
service_description check-total-procs
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_total_procs
}

define service {
host_name 192.168.2.33-Test
service_description check_sda
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_sda
}

define service {
host_name 192.168.2.33-Test
service_description check_swap
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_swap
}
##############################################################################################

3.监控服务器监控运行的端口

(1)Nagios客户端配置

#vim /usr/local/nagios/etc/nrpe.cfg  文件末尾添加定义端口信息,如下:

command[check_TemplateUpload:8080]=/usr/local/nagios/libexec/check_tcp -H localhost -p 8080  -w 120 -c 180

保存,退出

重启xinetd服务

#service xinetd restart


(2)Nagios服务端配置

#vim /usr/local/nagios/etc/services/192.168.2.33.cfg 文件末尾添加如下内容

define service {
host_name 192.168.2.33-Test
service_description check_TemplateUpload:8080 
check_period 24x7
max_check_attempts 4
normal_check_interval 1
retry_check_interval 1
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
check_command check_nrpe!check_TemplateUpload:8080
}

保存,退出

重启nagios服务

#service nagios restart

访问Nagios服务

http://192.168.6.6/nagios

大功告成!!!!!

你可能感兴趣的:(服务器,local,客户端,监控)