[pve-devel] training : watchdog not working on 1 server
    Alexandre DERUMIER 
    aderumier at odiso.com
       
    Wed Feb  3 17:20:58 CET 2016
    
    
  
Hi,
We are currently testing watchdogs during our training session,
and 1 of the 3 nodes cluster don't load the watchdog correctly
I have tried with softdog  or iTCO_wdt, 
the watchdog timer is never enabled and have a 15s countdown.
the 3 nodes cluster are exactly the same model (old dell poweredge 2950),
clean proxmox 4.1 install with all last updates
# ipmitool mc watchdog get
Watchdog Timer Use:     Reserved (0x00)
Watchdog Timer Is:      Stopped
Watchdog Timer Actions: No action (0x00)
Pre-timeout interval:   1 seconds
Timer Expiration Flags: 0x00
Initial Countdown:      15 sec
Present Countdown:      15 sec
# dmesg|grep softdog
[   19.098138] softdog: Software Watchdog Timer: 0.08 initialized. soft_noboot=0 soft_margin=60 sec soft_panic=0 (nowayout=0)
# dmesg|grep -i watchdog
[    0.096195] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
[    9.340545] systemd[1]: Cannot add dependency job for unit watchdog-mux.socket, ignoring: Unit watchdog-mux.socket failed to load: No such file or directory.
[   19.098138] softdog: Software Watchdog Timer: 0.08 initialized. soft_noboot=0 soft_margin=60 sec soft_panic=0 (nowayout=0)
>>Unit watchdog-mux.socket failed to load: No such file or directory. 
I don't see this warning on other nodes
Any idea how I can debug that ?
Alexandre
    
    
More information about the pve-devel
mailing list