[pve-devel] corosync bug: cluster break after 1 node clean shutdown
Fabian Grünbichler
f.gruenbichler at proxmox.com
Tue Sep 29 15:28:19 CEST 2020
huge thanks for all the work on this btw!
I think I've found a likely culprit (a missing lock around a
non-thread-safe corosync library call) based on the last logs (which
were now finally complete!).
rebuilt packages with a proof-of-concept-fix:
23b03a48d3aa9c14e86fe8cf9bbb7b00bd8fe9483084b9e0fd75fd67f29f10bec00e317e2a66758713050f36c165d72f107ee3449f9efeb842d3a57c25f8bca7 pve-cluster_6.1-8_amd64.deb
9e1addd676513b176f5afb67cc6d85630e7da9bbbf63562421b4fd2a3916b3b2af922df555059b99f8b0b9e64171101a1c9973846e25f9144ded9d487450baef pve-cluster-dbgsym_6.1-8_amd64.deb
I removed some logging statements which are no longer needed, so output
is a bit less verbose again. if you are not able to trigger the issue
with this package, feel free to remove the -debug and let it run for a
little longer without the massive logs.
if feedback from your end is positive, I'll whip up a proper patch
tomorrow or on Thursday.
More information about the pve-devel
mailing list