[pve-devel] corosync/pmxcfs crash (same node that previous problem)

Alexandre DERUMIER aderumier at odiso.com
Tue Sep 18 14:35:32 CEST 2012


>> cman crashed - are there any cman related error logs? 
nothing in daemon.log

dlm_controld.log
Sep 17 10:20:21 dlm_controld cluster is down, exiting
Sep 17 10:20:21 dlm_controld daemon cpg_dispatch error 2


I have a some corosync log just before the crash

Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42b c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42b c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:56 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:57 corosync [TOTEM ] Retransmit List: c0f42c c0f42d c0f42e c0f430 c0f432 
Sep 17 10:19:57 corosync [TOTEM ] FAILED TO RECEIVE


----- Mail original ----- 

De: "Dietmar Maurer" <dietmar at proxmox.com> 
À: "Alexandre DERUMIER" <aderumier at odiso.com>, pve-devel at pve.proxmox.com 
Envoyé: Mardi 18 Septembre 2012 13:59:07 
Objet: RE: [pve-devel] corosync/pmxcfs crash (same node that previous problem) 

cman crashed - are there any cman related error logs? 

> -----Original Message----- 
> From: pve-devel-bounces at pve.proxmox.com [mailto:pve-devel- 
> bounces at pve.proxmox.com] On Behalf Of Alexandre DERUMIER 
> Sent: Dienstag, 18. September 2012 10:01 
> To: pve-devel at pve.proxmox.com 
> Subject: [pve-devel] corosync/pmxcfs crash (same node that previous 
> problem) 
> 
> Hi, I have add again a corosync (or related) problem on same node that at 
> the beginning of the month. 
> 
> I don't find too much logs 
> 
> Sep 17 10:20:18 kvm2 pvestatd[307538]: status update time (6.812 seconds) 
> Sep 17 10:20:21 kvm2 pmxcfs[3050]: [quorum] crit: quorum_dispatch failed: 
> 2 
> Sep 17 10:20:21 kvm2 pmxcfs[3050]: [libqb] warning: epoll_ctl(del): Bad file 
> descriptor (9) 
> Sep 17 10:20:21 kvm2 pmxcfs[3050]: [confdb] crit: confdb_dispatch failed: 2 
> Sep 17 10:20:23 kvm2 pmxcfs[3050]: [libqb] warning: epoll_ctl(del): Bad file 
> descriptor (9) 
> Sep 17 10:20:23 kvm2 pmxcfs[3050]: [dcdb] crit: cpg_dispatch failed: 2 
> Sep 17 10:20:25 kvm2 pmxcfs[3050]: [status] crit: cpg_send_message failed: 2 
> Sep 17 10:20:25 kvm2 pmxcfs[3050]: [status] crit: cpg_send_message failed: 2 
> Sep 17 10:20:27 kvm2 pmxcfs[3050]: [libqb] warning: epoll_ctl(del): Bad file 
> descriptor (9) 
> Sep 17 10:20:27 kvm2 pmxcfs[3050]: [dcdb] crit: cpg_dispatch failed: 2 
> Sep 17 10:20:29 kvm2 pmxcfs[3050]: [dcdb] crit: cpg_leave failed: 2 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [libqb] warning: epoll_ctl(del): Bad file 
> descriptor (9) 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [quorum] crit: quorum_initialize failed: 
> 6 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [quorum] crit: can't initialize service 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [confdb] crit: confdb_initialize failed: 6 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [quorum] crit: can't initialize service 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [dcdb] notice: start cluster connection 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [dcdb] crit: cpg_initialize failed: 6 
> Sep 17 10:20:31 kvm2 pmxcfs[3050]: [quorum] crit: can't initialize service 
> 
> 
> any idea ? 
> 
> _______________________________________________ 
> pve-devel mailing list 
> pve-devel at pve.proxmox.com 
> http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel 



More information about the pve-devel mailing list