[PVE-User] critical HA problem on a PVE6 cluster

Herve Ballans herve.ballans at ias.u-psud.fr
Thu May 14 17:01:52 CEST 2020


Hi Mark,

Thanks. Yes we are investigating with network engineers.

We upgraded the entire cluster in PVE 6.2 and the cluster is fully 
operational now.

But we think indeed that something in the network has changed and caused 
the problem (switch upgrades ?)

Therefore, for example, does activating or disabling the IGMP protocol 
could have an impact on corosync or not (in PVE 6) ?

Regards,
Hervé

On 11/05/2020 19:33, Mark Adams via pve-user wrote:
> Subject:
> Re: [PVE-User] critical HA problem on a PVE6 cluster
> From:
> Mark Adams <mark at openvs.co.uk>
> Date:
> 11/05/2020 à 19:33
>
> To:
> PVE User List <pve-user at pve.proxmox.com>
>
>
> As Eneko already said, this really sounds like a network problem - if your
> hosts lose connectivity to each other they will reboot themselves, and it
> sounds like this is what happened to you.
>
> You are sure there has been no changes to your network around the time this
> happened? Have you checked your switch config is still right (maybe it
> reset?)
>
> Maybe the switches have bugged out and need a reboot? check the logs on
> them for errors.



More information about the pve-user mailing list