[PVE-User] I lost the cluster communication in a 10 nodes cluster

Denis Morejon denis.morejon at etecsa.cu
Mon Oct 15 18:46:33 CEST 2018


Is multicast communication the main cause of cluster proxmox file system 
problems ?

Why some times date and time have to be with cluster errors ?

Since my point of view cluster communication errors are the most 
critical errors since affect all VMs keeping It from start again

because of not quorrum.

Are there any tips (or steps) to fix it or to avoid it ?


El 15/10/18 a las 03:57, Thomas Lamprecht escribió:
> On 10/12/18 6:57 PM, Denis Morejon wrote:
>> The 10 nodes lost the communication with each other. And they were working fine for a month. They all have version 5.1.
>>
> any environment changes? E.g., switch change or software update
> (which then could block multicast)?
>
> Can you also see if the omping test go still through:
> https://pve.proxmox.com/pve-docs/chapter-pvecm.html#_cluster_network
>
>> All nodes have the same date/time and show a status like this:
>>
>> root at proxmox11:~# pvecm status
>>
>> Quorum information
>> ------------------
>> Date:             Fri Oct 12 11:55:59 2018
>> Quorum provider:  corosync_votequorum
>> Nodes:            1
>> Node ID:          0x00000007
>> Ring ID:          7/60372
>> Quorate:          No
>>
>> Votequorum information
>> ----------------------
>> Expected votes:   10
>> Highest expected: 10
>> Total votes:      1
>> Quorum:           6 Activity blocked
>> Flags:
>>
>> Membership information
>> ----------------------
>>      Nodeid      Votes Name
>> 0x00000007          1 192.168.80.11 (local)
>>
>>
>
>



More information about the pve-user mailing list