[pve-devel] Blacklisting HP hardware watchdog timer module ?

Alexandre DERUMIER aderumier at odiso.com
Wed Dec 2 11:56:16 CET 2015


Also, if somebody have a redhat subscription,
it seem that they have an explain about the problem

https://access.redhat.com/solutions/707563


(don't have access, sorry )

----- Mail original -----
De: "aderumier" <aderumier at odiso.com>
À: "pve-devel" <pve-devel at pve.proxmox.com>
Cc: "t lamprecht" <t.lamprecht at proxmox.com>
Envoyé: Mercredi 2 Décembre 2015 11:54:37
Objet: Re: [pve-devel] Blacklisting HP hardware watchdog timer module ?

I don't have hp server to test, 

but on dell server, by default the idrac watchdog is not used (module is not loaded), 
if the motherboard intel watchdog is loaded ( iTCO_wdt), the idrac/bmc watchdog module is not loaded. 
(to have idrac watchdog working, I need to use nmi_watchdog=0 in grub.cfg to disable motherboard watchdog) 


I think the problem with HP, is that both are loaded (motherboard and ilo), but ilo watchdog is not updated. 

according to https://www.kernel.org/doc/Documentation/watchdog/hpwdt.txt, 
the module need to be loaded with priority=1: 


1. If the kernel has not been booted with nmi_watchdog turned off then 
edit /boot/grub/menu.lst and place the nmi_watchdog=0 at the end of the 
currently booting kernel line. 
2. reboot the sever 
3. Once the system comes up perform a rmmod hpwdt 
4. insmod /lib/modules/`uname -r`/kernel/drivers/char/watchdog/hpwdt.ko priority=1 




I don't known what is the advantage to use bmc|ilo|idrac watchdog vs motherboard watchdog ? 


----- Mail original ----- 
De: "Emmanuel Kasper" <e.kasper at proxmox.com> 
À: "pve-devel" <pve-devel at pve.proxmox.com> 
Cc: "t lamprecht" <t.lamprecht at proxmox.com> 
Envoyé: Mercredi 2 Décembre 2015 11:29:41 
Objet: [pve-devel] Blacklisting HP hardware watchdog timer module ? 

Hi 
It seems that the HP Watchdog timer does not work properly: it triggers 
a kernel panic instead of rebooting the server. 

The issue came up here in this thread: 

http://forum.proxmox.com/threads/24015-VE-4-0-Kernel-Panic-on-HP-Proliant-servers 

At least 3 users seem to have solved the problem by blacklisting the 
corresponding hpwdt kernel module. 

As remarked by Alexandre, the Ubuntu folks actually decided to black 
*all* the hardware watchdog timers some time ago 
( https://lists.ubuntu.com/archives/kernel-team/2015-March/054512.html ) 

Should we add hpwdt to our list of blacklisted module ? 

Emmanuel 






_______________________________________________ 
pve-devel mailing list 
pve-devel at pve.proxmox.com 
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel 



More information about the pve-devel mailing list