[pve-devel] corosync problems - need help

Alexandre DERUMIER aderumier at odiso.com
Sun Sep 14 11:06:51 CEST 2014


Ok,I finally solved,

kill -9 dlm_controld
kill -9 corosync -f

and service cman start


Now all is working fine again.

Thanks for the help !




----- Mail original ----- 

De: "Alexandre DERUMIER" <aderumier at odiso.com> 
À: "Dietmar Maurer" <dietmar at proxmox.com> 
Cc: pve-devel at pve.proxmox.com 
Envoyé: Dimanche 14 Septembre 2014 10:52:50 
Objet: Re: [pve-devel] corosync problems - need help 

I have restarted 2 nodes, 

they see them together but no the other nodes. 


I think corosync is totally hanging on other nodes, they don't have see 2 nodes nodes. 


Now I'll try to find a way to restart corosync without restarting the full node. 

(main problem is dlm_control, not sure I can kill it) 


----- Mail original ----- 

De: "Alexandre DERUMIER" <aderumier at odiso.com> 
À: "Dietmar Maurer" <dietmar at proxmox.com> 
Cc: pve-devel at pve.proxmox.com 
Envoyé: Dimanche 14 Septembre 2014 10:34:36 
Objet: Re: [pve-devel] corosync problems - need help 

Another strange thing, 

I have stopped 1 node, 

and other nodes see it online ?????? 

#clustat nodes 


Member Name ID Status 
------ ---- ---- ------ 
kvm6 1 Online 
kvm4 2 Online 
kvm3 3 Online 
kvm2 4 Online 
kvm5 5 Online 
kvm1 6 Online ---> the node is shutdown 
kvm8 7 Online 
kvm7 8 Online 
kvm9 9 Online 
kvm10 10 Online 
kvm11 11 Online, Local 
kvm12 12 Online 


----- Mail original ----- 

De: "Alexandre DERUMIER" <aderumier at odiso.com> 
À: "Dietmar Maurer" <dietmar at proxmox.com> 
Cc: pve-devel at pve.proxmox.com 
Envoyé: Dimanche 14 Septembre 2014 10:10:01 
Objet: Re: [pve-devel] corosync problems - need help 

>>I meant: Is it read-only on all nodes? 

Yes :( 

>>If you edit /etc/cluster/cluster.conf you need to increase version number to prevent overwrite. Then 
>>restart cman. 

Ok, I'll try that 

Seem that we can reload cluster.conf with "cman_tool version -r" 



>> >> (and /etc/init.d/cman stop is hanging) 
>> 
>>if nothing helps, try 'kill -9 ...' 

yes,it's hanging on 
#Stopping cluster: 
# Stopping dlm_controld... 

I'll test with increasing window_size. 

I'll keep you in touch. 

(thanks for the help) 


----- Mail original ----- 

De: "Dietmar Maurer" <dietmar at proxmox.com> 
À: "Alexandre DERUMIER" <aderumier at odiso.com> 
Cc: pve-devel at pve.proxmox.com 
Envoyé: Dimanche 14 Septembre 2014 09:27:22 
Objet: RE: [pve-devel] corosync problems - need help 

> >> I would like to try to change corosync window_size, but how can I do it online 
> ? 
> > 
> >On all nodes? 

I meant: Is it read-only on all nodes? 
> 
> Yes, if possible. As I can't edit cluster.conf (read only), don't known how to inject 
> it online. 

If you edit /etc/cluster/cluster.conf you need to increase version number to prevent overwrite. Then 
restart cman. 

If there are still working nodes where /etc/pve is writable edit it there. 

> >> (and /etc/init.d/cman stop is hanging) 

if nothing helps, try 'kill -9 ...' 
_______________________________________________ 
pve-devel mailing list 
pve-devel at pve.proxmox.com 
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel 
_______________________________________________ 
pve-devel mailing list 
pve-devel at pve.proxmox.com 
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel 
_______________________________________________ 
pve-devel mailing list 
pve-devel at pve.proxmox.com 
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel 



More information about the pve-devel mailing list