LVS服务器网络经常死掉

两台LVS服务器,按Active-Standby方式配置,LVS服务器的网络经常断掉,如果主或备不要启动heartbeat,单独一台运行,完全没有任可问题。

出现的错误提示如下:不知啥原因,我把keepalive由原来1改为5,还是一样的。

eartbeat[9610]: 2006/04/24_21:35:46 info: Initial resource acquisition complete (auto_failback)
heartbeat[9610]: 2006/04/24_21:35:46 info: remote resource transition completed.
heartbeat[9618]: 2006/04/24_21:45:02 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:45:02 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:45:03 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:45:03 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:45:04 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:45:04 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:45:05 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:45:05 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:45:06 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:45:06 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:45:07 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:45:07 ERROR: write failure on ucast eth1.: No such device
heartbeat[9610]: 2006/04/24_21:45:08 WARN: 6 lost packet(s) for [lvsmasterserver01] [858:865]
heartbeat[9610]: 2006/04/24_21:45:09 info: No pkts missing from lvsmasterserver01!
heartbeat[9610]: 2006/04/24_21:47:11 ERROR: Both machines own our resources!
heartbeat[9610]: 2006/04/24_21:47:11 ERROR: Both machines own our resources!
heartbeat[9618]: 2006/04/24_21:48:01 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:01 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:01 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:01 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:02 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:02 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:03 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:03 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:04 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:04 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:05 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:05 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:06 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:06 ERROR: write failure on ucast eth1.: No such device
heartbeat[9618]: 2006/04/24_21:48:07 ERROR: glib: Unable to send [-1] ucast packet: No such device
heartbeat[9618]: 2006/04/24_21:48:07 ERROR: write failure on ucast eth1.: No such device
heartbeat[9610]: 2006/04/24_21:48:08 WARN: 6 lost packet(s) for [lvsmasterserver01] [1039:1046]
heartbeat[9610]: 2006/04/24_21:48:21 ERROR: Irretrievably lost packet: node lvsmasterserver01 seq 1043
heartbeat[9610]: 2006/04/24_21:48:21 ERROR: Irretrievably lost packet: node lvsmasterserver01 seq 1044
heartbeat[9610]: 2006/04/24_21:48:21 ERROR: Irretrievably lost packet: node lvsmasterserver01 seq 1045
heartbeat[9610]: 2006/04/24_21:48:22 WARN: 5 lost packet(s) for [lvsmasterserver01] [1054:1060]
/192.168.10.68 ldirectord lvsmac
IPaddr[13070]: 2006/04/24_21:54:30 INFO: IPaddr Running OK
ResourceManager[13036]: 2006/04/24_21:54:32 info: Running /etc/ha.d/resource.d/ldirectord start
ResourceManager[13036]: 2006/04/24_21:54:33 info: Running /etc/ha.d/resource.d/lvsmac start
mach_down[12810]: 2006/04/24_21:54:33 info: /usr/lib/heartbeat/mach_down: nice_failback: foreign resources acquired
mach_down[12810]: 2006/04/24_21:54:33 info: mach_down takeover complete for node lvsmasterserver01.
heartbeat[9610]: 2006/04/24_21:54:33 info: mach_down takeover complete.
heartbeat[9610]: 2006/04/24_21:54:56 info: Link lvsmasterserver01:eth1 dead.
heartbeat[9610]: 2006/04/24_21:54:59 info: Heartbeat restart on node lvsmasterserver01
heartbeat[9610]: 2006/04/24_21:54:59 info: Link lvsmasterserver01:eth1 up.
heartbeat[9610]: 2006/04/24_21:54:59 info: Status update for node lvsmasterserver01: status init
heartbeat[9610]: 2006/04/24_21:54:59 info: Status update for node lvsmasterserver01: status up
harc[13326]: 2006/04/24_21:54:59 info: Running /etc/ha.d/rc.d/status status
harc[13336]: 2006/04/24_21:54:59 info: Running /etc/ha.d/rc.d/status status
heartbeat[9610]: 2006/04/24_21:55:00 info: Status update for node lvsmasterserver01: status active
harc[13346]: 2006/04/24_21:55:00 info: Running /etc/ha.d/rc.d/status status
heartbeat[9610]: 2006/04/24_21:55:00 info: remote resource transition completed.
heartbeat[9610]: 2006/04/24_21:55:00 info: lvsslaveserver01 wants to go standby [foreign]
heartbeat[9610]: 2006/04/24_21:55:01 info: standby: lvsmasterserver01 can take our foreign resources
heartbeat[13356]: 2006/04/24_21:55:01 info: give up foreign HA resources (standby).
ResourceManager[13367]: 2006/04/24_21:55:01 info: Releasing resource group: lvsmasterserver01 IPaddr::192.168.10.67/32/eth0:0/192.168.10.67 ldirectord lvsmac
ResourceManager[13367]: 2006/04/24_21:55:01 info: Running /etc/ha.d/resource.d/lvsmac stop
ResourceManager[13367]: 2006/04/24_21:55:01 info: Running /etc/ha.d/resource.d/ldirectord stop
ResourceManager[13367]: 2006/04/24_21:55:03 info: Running /etc/ha.d/resource.d/IPaddr 192.168.10.67/32/eth0:0/192.168.10.67 stop
IPaddr[13537]: 2006/04/24_21:55:04 INFO: /sbin/route -n del -host 192.168.10.67
IPaddr[13537]: 2006/04/24_21:55:04 INFO: /sbin/ifconfig eth0:0 192.168.10.67 down
IPaddr[13537]: 2006/04/24_21:55:04 INFO: IP Address 192.168.10.67 released
IPaddr[13455]: 2006/04/24_21:55:04 INFO: IPaddr Success
ResourceManager[13587]: 2006/04/24_21:55:04 info: Releasing resource group: lvsmasterserver01 IPaddr::192.168.10.68/32/eth0:1/192.168.10.68 ldirectord lvsmac
ResourceManager[13587]: 2006/04/24_21:55:04 info: Running /etc/ha.d/resource.d/lvsmac stop
ResourceManager[13587]: 2006/04/24_21:55:04 info: Running /etc/ha.d/resource.d/ldirectord stop
ResourceManager[13587]: 2006/04/24_21:55:05 info: Running /etc/ha.d/resource.d/IPaddr 192.168.10.68

Forums:

我想你大概需要将你的heartbeat配置贴出来,这样大家可以看看到底哪里有些问题。

已经解决了吗?

randomness