[pve-devel] corosync problems - need help
Alexandre DERUMIER
aderumier at odiso.com
Mon Sep 15 07:26:52 CEST 2014
Also, about the pmxcfs sefgaults,
I have see this messages
Sep 14 09:06:33 kvm1 pmxcfs[65403]: [dcdb] notice: cpg_join retry 62840
Sep 14 10:57:25 kvm11 pmxcfs[13112]: [dcdb] notice: cpg_join retry 65090
with retry around 65000 (16bits)
and
int retries = 0;
result = cpg_join(dfsm->cpg_handle, &dfsm->cpg_group_name);
if (result == CPG_ERR_TRY_AGAIN) {
nanosleep(&tvreq, NULL);
++retries;
if ((retries % 10) == 0)
cfs_dom_message(dfsm->log_domain, "cpg_join retry %d", retries);
goto loop;
}
could it be related to retries integer type?
----- Mail original -----
De: "Alexandre DERUMIER" <aderumier at odiso.com>
À: "Dietmar Maurer" <dietmar at proxmox.com>
Cc: pve-devel at pve.proxmox.com
Envoyé: Lundi 15 Septembre 2014 07:06:40
Objet: Re: [pve-devel] corosync problems - need help
>>This just indicates that corosync does not work as expected.
My understand is that the faulty node join the mutlicast group, other see it.
but when others nodes try to talk with him, they have no response ?
I'm going to do some wireshark network traces today
I'll also try to update all other nodes to kernel 3.10. (not sure it's related)
----- Mail original -----
De: "Dietmar Maurer" <dietmar at proxmox.com>
À: "Alexandre DERUMIER" <aderumier at odiso.com>
Cc: pve-devel at pve.proxmox.com
Envoyé: Lundi 15 Septembre 2014 05:43:56
Objet: RE: [pve-devel] corosync problems - need help
> seem to be in:
> data/src/dfsm.c
>
> result = cpg_mcast_joined(dfsm->cpg_handle, CPG_TYPE_AGREED, iov, len);
> if (retry && result == CPG_ERR_TRY_AGAIN) {
This just indicates that corosync does not work as expected.
_______________________________________________
pve-devel mailing list
pve-devel at pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
More information about the pve-devel
mailing list