Comment 8 for bug 659510

Revision history for this message
flashydave (dave-opensourcesolutions) wrote :

Since upgrading to Maverick and installing cluster-glue 1.0.6-1ubuntu1.1 I am getting repeats of

crmd: WARN: lrm_signon: can not initiate connection
crmd: [4963]: WARN: do_lrm_control: Failed to sign on to the LRM 29 (30 max) time

on a system that previously was working on lucid (except for upstart which is why I upgraded)
netstat -anp shows
unix 2 [ ] DGRAM 22204 4546/lrmd
ps auxf shows:

root 4539 0.0 0.0 0 0 ? Z 00:04 0:00 \_ [lrmd] <defunct>
root 4546 0.0 0.0 36424 1344 ? S 00:04 0:00 \_ /usr/lib/heartbeat/lrmd

Anyone seen this or know how best to debug this?
A rapid response would be helpful as I have a production HA cluster down!