[PVE-User] pveproxy dying, node unusable

Emmanuel Kasper e.kasper at proxmox.com
Mon Dec 11 17:14:37 CET 2017


On 12/11/2017 04:50 PM, Lindsay Mathieson wrote:
> Also I was unable to connect to the VM's on those nodes, not even via RDP
> 
> On 12/12/2017 1:46 AM, Lindsay Mathieson wrote:
>>
>> I dist-upraded two nodes yesterday. Now both those nodes have multiple
>> unkilliable pveproxy processes. dmesg has many entries of:
>>
>>     [50996.416909] INFO: task pveproxy:6798 blocked for more than 120
>>     seconds.
>>     [50996.416914]       Tainted: P           O 4.4.95-1-pve #1
>>     [50996.416918] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
>>     disables this message.
>>     [50996.416922] pveproxy        D ffff8809194e3df8 0  6798      1
>>     0x00000004
>>     [50996.416925]  ffff8809194e3df8 ffff880ff6f5ed80 ffff880ff84fe200
>>     ffff880fded5e200
>>     [50996.416927]  ffff8809194e4000 ffff880fc7fb43ac ffff880fded5e200
>>     00000000ffffffff
>>     [50996.416929]  ffff880fc7fb43b0 ffff8809194e3e10 ffffffff818643b5
>>     ffff880fc7fb43a8
>>
>>
>> qm list hangs
>>
>> Node vms do not respond in web gui
>>
>> The node I did not upgrade is fine.


Hi Lindsay
As a quick check, is the cluster file system mounted on /etc/pve and can
you read files there normally ( ie cat /etc/pve/datacenter.cfg working ) ?

Are the node storages  returning their status properly ?
(ie pvesm status does not hang)




More information about the pve-user mailing list