zabbix haproxy 监控

摘自:前端

http://www.tuicool.com/articles/JrYNNrmnode

写的很是好,步步紧逼,环环相扣。直到成功!git

文章首发站点:OpensGalaxy

这是一个HAProxy的zabbix监控模板,HAProxy提供高可用性、负载均衡以及基于TCP和HTTP应用的代理,支持虚拟主机,它是免费、快速而且可靠的一种解决方案。HAProxy自己提供一个web页面能够显示haproxy的各类信息,方便管理员查看系统状态等。这次我采用socat(socat是netcat的扩展实现)这款开源软件经过haproxy的socks配合zabbix实现haproxy系统的状态监控。

1、安装socat

[root@haproxy01 ~]# wget http://www.dest-unreach.org/socat/download/socat-1.7.3.0.tar.bz2
[root@haproxy01 ~]# tar xzf socat-1.7.3.0.tar.gz [root@haproxy01 ~]# cd socat-1.7.3.0 [root@haproxy01 ~]# ./configure [root@haproxy01 ~]# make [root@haproxy01 ~]# make install [root@haproxy01 ~]# which socat ###安装完自动会在/usr/local/bin 建立执行程序。 /usr/local/bin/socat

2、开启haproxy sock

编辑haproxy配置文件在global下添加以下:

global
    stats socket    /usr/local/haproxy/haproxy.sock # 重启haproxy

Haproxy 信息

[root@haproxy01 ~]# echo "show info" |socat /usr/local/haproxy/haproxy.sock stdio
Name: HAProxy Version: 1.4.24 Release_date: 2013/06/17 Nbproc: 1 Process_num: 1 Pid: 3499 Uptime: 2d 1h00m47s Uptime_sec: 176447 Memmax_MB: 0 Ulimit-n: 131102 Maxsock: 131102 Maxconn: 65535 Maxpipes: 0 CurrConns: 18 PipesUsed: 0 PipesFree: 0 Tasks: 34 Run_queue: 1 node: HAProxy01 description: haproxy server 01

Haproxy 状态

[root@CMHAProxy01 ~]# echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio
# pxname,svname,qcur,qmax,scur,smax,slim,stot,bin,bout,dreq,dresp,ereq,econ,eresp,wretr,wredis,status,weight,act,bck,chkfail,chkdown,lastchg,downtime,qlimit,pid,iid,sid,throttle,lbtot,tracked,type,rate,rate_lim,rate_max,check_status,check_code,check_duration,hrsp_1xx,hrsp_2xx,hrsp_3xx,hrsp_4xx,hrsp_5xx,hrsp_other,hanafail,req_rate,req_rate_max,req_tot,cli_abrt,srv_abrt,admin_stats,FRONTEND,,,0,1,2000,2918,1779980,133971311,0,0,0,,,,,OPEN,,,,,,,,,1,1,0,,,,0,0,0,1,,,,0,2918,0,0,0,0,,0,1,2918,,, admin_stats,BACKEND,0,0,0,0,2000,0,1779980,133971311,0,0,,0,0,0,0,UP,0,0,0,,0,176433,0,,1,1,0,,0,,1,0,,0,,,,0,0,0,0,0,0,,,,,0,0, test,FRONTEND,,,0,2,2000,5402,1267880,26563174,0,0,0,,,,,OPEN,,,,,,,,,1,2,0,,,,0,0,0,3,,,,0,5402,0,0,0,0,,0,3,5402,,, test,test1,0,0,0,1,,2701,632860,13110551,,0,,0,0,0,0,UP,3,1,0,1,0,176433,0,,1,2,1,,2701,,2,0,,2,L7OK,200,3,0,2701,0,0,0,0,0,,,,0,1, test,test2,0,0,0,1,,2701,635020,13452623,,0,,0,0,0,0,UP,3,1,0,0,0,176433,0,,1,2,2,,2700,,2,0,,2,L7OK,200,3,0,2701,0,0,0,0,0,,,,0,0, test,BACKEND,0,0,0,2,2000,5402,1267880,26563174,0,0,,0,0,0,0,UP,6,2,0,,0,176433,0,,1,2,0,,5401,,1,0,,3,,,,0,5402,0,0,0,0,,,,,0,1,

看到这里你们都明白了,咱们能够经过“show stat”返回的信息,过滤出咱们想要获得的监控字段数据。这里咱们主要取qcur(queue数量)、scur(session数量)、bin(入站流量)、bout(出站流量)、status(后端服务器状态)、downtime(宕机总时长)、check_status(健康检测状态),自定义key收集这些数据制做相应的item,若是你想监控更多信息能够取更多的字段数据。

你们看到这些字段数据使用”,”分隔,很是方便咱们用awk来过滤数据,具体自定义key见代码。首先,咱们须要创建一个HAProxy模版,而后创建发现规则,来发现这些后端服务器,这样每次haproxy添加服务的时候,zabbix就会自动发现这些后端服务器、自动创建item、自动创建Graph、自动创建Trigger,一切所有自动完成。

3、Step by Step

一、创建一个haproxy模版

二、创建发现规则

key shell以下:(这里有一点须要注意,就是haproxy.sock这个文件的权限问题,默认是644,单纯执行这个shell没问题,但若是须要zabbix用户去获取返回值的时候就会提示权限不足,因此我直接将haproxy.sock设置成了666,也能够设置visudo,我为了方便直接修改了这个文件权限)github

[root@haproxy01 ~]# cat /usr/bin/has.discovery.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin printf '{\n' printf '\t"data":[\n' export LINE=$(echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep -v "^#"|grep -v "^admin_stats"|awk -F "," '{print $2}'|grep -v "^FRONTEND"|grep -v "^BACKEND"|grep -v "^$"|wc -l) ##(这里我过滤调admin_stats web管理台、FRONTEND前端、BACKEND后端 获得的是全部后端服务器的列表) export N=0 #printf '\t\t{\n' for i in $(echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep -v "^#"|grep -v "^admin_stats"|awk -F "," '{print $2}'|grep -v "^FRONTEND"|grep -v "^BACKEND"); do printf "\t\t\t{\"{#SNAME}\":\"$i\"}"; N=$(($N+1)) if [ $N -eq $LINE ]; ## 最后一行是不能加","的,因此须要if判断一下是否到了最后一行。 then printf '\n' else printf ',\n' fi done; printf ']}\n'

执行结果以下:(zabbix的发现规则就是要返回json格式的数据)

[root@haproxy01 haproxy]# sh /usr/bin/has.discovery.sh 
{
    "data":[ {"{#SNAME}":"test1"}, {"{#SNAME}":"test2"}, {"{#SNAME}":"testapi1"}, {"{#SNAME}":"test51"}, {"{#SNAME}":"test52"}, {"{#SNAME}":"testclient1"}, {"{#SNAME}":"testclient2"}, {"{#SNAME}":"testmobile1"}, {"{#SNAME}":"testmobile2"} ]}

三、创建自定义key

+ Queue数量[qcur]

key shell以下:web

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.queue.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $3}' ## 重点就是经过awk取哪一个字段的数据

+ Session数量[scur]

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.session.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $5}'

+ 虚拟服务器入站流量[bin]

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.bin.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $9}'

+ 虚拟服务器出站流量[bout]

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.bout.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $10}'

+ 后端服务器状态[status]

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.status.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $18}'

+ 宕机总时长[downtime]

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.downtime.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $25}'

+ 健康检测状态[check_status]

[root@haproxy01 haproxy]# cat /usr/bin/ckey.ha.lastchk.sh 
#!/bin/bash export PATH=/usr/lib64/qt-3.3/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin echo "show stat" |socat /usr/local/haproxy/haproxy.sock stdio|grep "\<$1\>"|grep -v "^FRONTEND"|grep -v "^BACKEND"|awk -F "," '{print $38}'

+ 定义key

[root@haproxy01 haproxy]# cat /etc/zabbix/zabbix_agentd.d/HaProxy_status.conf UserParameter=has.discovery,/usr/bin/has.discovery.sh UserParameter=ckey.ha.bin[*],/usr/bin/ckey.ha.bin.sh $1 UserParameter=ckey.ha.bout[*],/usr/bin/ckey.ha.bout.sh $1 UserParameter=ckey.ha.downtime[*],/usr/bin/ckey.ha.downtime.sh $1 UserParameter=ckey.ha.lastchk[*],/usr/bin/ckey.ha.lastchk.sh $1 UserParameter=ckey.ha.queue[*],/usr/bin/ckey.ha.queue.sh $1 UserParameter=ckey.ha.session[*],/usr/bin/ckey.ha.session.sh $1 UserParameter=ckey.ha.status[*],/usr/bin/ckey.ha.status.sh $1

重启zabbix-agentredis

+ 验证一下:

[root@zabbixproxy01 ~]# zabbix_get -s 192.168.1.100 -k has.discovery
{
    "data":[ {"{#SNAME}":"test1"}, {"{#SNAME}":"test2"}, {"{#SNAME}":"testapi1"}, {"{#SNAME}":"test51"}, {"{#SNAME}":"test52"}, {"{#SNAME}":"testclient1"}, {"{#SNAME}":"testclient2"}, {"{#SNAME}":"testmobile1"}, {"{#SNAME}":"testmobile2"} ]} [root@zabbixproxy01 ~]# zabbix_get -s 192.168.1.100 -k ckey.ha.status[test1] UP

其他的key验证效果大同小异,这里就不浪费篇幅一一列举了。shell

四、建立Graph prototypes

+ 网络流量图

+ Session

+ 健康检测状态

+ 宕机总时间

生成的Graphs效果图

五、建立Trigger prototypes

+ inbound网络流量最近5分钟内超过2Mbps

+ outbound网络流量最近5分钟内超过2Mbps

+ 后端服务器健康检测异常

+ 后端服务器宕机

这里设置一下依赖,由于服务器是否宕机依赖于健康检测状态

到此为止,全套的Haproxy监控模版就设置完成了,你能够根据你实际的需求作调整,总之这只是一个实现思路。

key shell和导出的模版下载地址以下:

自定义key shelljson

导出的haproxy模版vim

你们也能够访问个人github https://github.com/EdisonWujr/zabbix-templates

相关文章
相关标签/搜索