[pve-devel] need help to debug random host freeze on multiple hosts

Cesar Peschiera brain at click.com.py
Mon Dec 29 08:55:13 CET 2014


I know that this isn't a solution, but i will tell you only as a comment for
future decisions:

Long time ago, when i worked with Novell Netware, i had a problem of cache
in the AMD processor, so i had that disable it, and after, this server was
very slow, but was stable. Since that time i never recommended servers with
AMD processor.

Moreover, maybe will be good disable some flags to AMD processor and test 
it. How do it?, sincerely i don't know, but if you know it, please comment 
it here, as also your tests (if you can)

----- Original Message ----- 
From: "Alexandre DERUMIER" <aderumier at odiso.com>
To: "Cesar Peschiera" <brain at click.com.py>
Cc: "datanom.net" <mir at datanom.net>; "pve-devel" <pve-devel at pve.proxmox.com>
Sent: Monday, December 29, 2014 3:31 AM
Subject: Re: [pve-devel] need help to debug random host freeze on multiple
hosts


>>Maybe i ask you a silly question, did you see the syslog and kern.log
>>file?

Yes sure , I have nothing in logs.
(That's why I thinked of kdump to try to have more info).

I'll really don't known if it's a software real kernel panic, or a hardware
bug.

I just see on vmware forum some amd microcode bug, and see that dell provide
a new bios update this month.
I'll try to update to see if it's help.



----- Original Message ----- 
From: "Alexandre DERUMIER" <aderumier at odiso.com>
To: "datanom.net" <mir at datanom.net>
Cc: "pve-devel" <pve-devel at pve.proxmox.com>
Sent: Monday, December 29, 2014 1:49 AM
Subject: Re: [pve-devel] need help to debug random host freeze on multiple
hosts


>>>Bad RAM stick?
>>>Bad PSU?
>>>Overheating of the CPU?
>
> No errors reporting in dell Idrac.
>
> (I have the problem on 6 differents nodes.....)
>
> I was also thinking of electrical problem, but voltages don't report any
> error.
>
> Maybe the only difference is that I have more load currently on all my
> nodes because of Xmas period
> (We host a lot of ecommerce websites)
> I'm around 60-70% load on this quad opteron platforms.
>
>
> I'll try to implement kdump today.
>
>
>
> ----- Mail original ----- 
> De: "datanom.net" <mir at datanom.net>
> À: "pve-devel" <pve-devel at pve.proxmox.com>
> Envoyé: Dimanche 28 Décembre 2014 19:02:04
> Objet: Re: [pve-devel] need help to debug random host freeze on multiple
> hosts
>
> On Sun, 28 Dec 2014 17:37:50 +0100 (CET)
> Alexandre DERUMIER <aderumier at odiso.com> wrote:
>
>>
>> I really don't known how to debug that, because the system freeze, and I
>> don't have any kernel panic output in display or serial.
>>
>>
>> Can somebody help me to add something to have debug output ?
>>
> Bad RAM stick?
> Bad PSU?
>
> -- 
> Hilsen/Regards
> Michael Rasmussen
>
> Get my public GnuPG keys:
> michael <at> rasmussen <dot> cc
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xD3C9A00E
> mir <at> datanom <dot> net
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xE501F51C
> mir <at> miras <dot> org
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xE3E80917
> -------------------------------------------------------------- 
> /usr/games/fortune -es says:
> Bridge ahead. Pay troll.
>
> _______________________________________________
> pve-devel mailing list
> pve-devel at pve.proxmox.com
> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
> _______________________________________________
> pve-devel mailing list
> pve-devel at pve.proxmox.com
> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
>




More information about the pve-devel mailing list