[PVE-User] drbd issues after kernel upgrade

Robert Fantini rob at fantinibakery.com
Thu Jan 26 17:29:06 CET 2012


Hello
we're using Proxmox 2.0.
since updating pve-kernel on Tuesday  we've had drbd issues. the new 
kernel is Version: 2.6.32-55+ovzfix-1 .

Here are syslog entries:

on Primary:
Jan 26 09:58:11 fbc19 kernel: block drbd0: Digest mismatch, buffer 
modified by upper layers during write: 801263472s +4096
Jan 26 09:58:11 fbc19 kernel: block drbd0: sock was shut down by peer
Jan 26 09:58:11 fbc19 kernel: block drbd0: peer( Secondary -> Unknown ) 
conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
Jan 26 09:58:11 fbc19 kernel: block drbd0: new current UUID 
5536B9653B98E1D7:0D8AD9190B76300B:6F0A48DBE8C0066D:6F0948DBE8C0066D
Jan 26 09:58:11 fbc19 kernel: block drbd0: asender terminated
Jan 26 09:58:11 fbc19 kernel: block drbd0: Terminating asender thread
Jan 26 09:58:11 fbc19 kernel: block drbd0: Connection closed
Jan 26 09:58:11 fbc19 kernel: block drbd0: conn( NetworkFailure -> 
Unconnected )
Jan 26 09:58:11 fbc19 kernel: block drbd0: receiver terminated
Jan 26 09:58:11 fbc19 kernel: block drbd0: Restarting receiver thread
Jan 26 09:58:11 fbc19 kernel: block drbd0: receiver (re)started
Jan 26 09:58:11 fbc19 kernel: block drbd0: conn( Unconnected -> 
WFConnection )
Jan 26 09:58:12 fbc19 kernel: block drbd0: Handshake successful: Agreed 
network protocol version 96
Jan 26 09:58:12 fbc19 kernel: block drbd0: Peer authenticated using 20 
bytes of 'sha1' HMAC
Jan 26 09:58:12 fbc19 kernel: block drbd0: conn( WFConnection -> 
WFReportParams )
Jan 26 09:58:12 fbc19 kernel: block drbd0: Starting asender thread (from 
drbd0_receiver [3203])Jan 26 09:58:12 fbc19 kernel: block drbd0: 
data-integrity-alg: sha1
Jan 26 09:58:12 fbc19 kernel: block drbd0: drbd_sync_handshake:
Jan 26 09:58:12 fbc19 kernel: block drbd0: self 
5536B9653B98E1D7:0D8AD9190B76300B:6F0A48DBE8C0066D:6F0948DBE8C0066D 
bits:14 flags:0
Jan 26 09:58:12 fbc19 kernel: block drbd0: peer 
0D8AD9190B76300A:0000000000000000:6F0A48DBE8C0066C:6F0948DBE8C0066D 
bits:0 flags:0
Jan 26 09:58:12 fbc19 kernel: block drbd0: uuid_compare()=1 by rule 70
Jan 26 09:58:12 fbc19 kernel: block drbd0: peer( Unknown -> Secondary ) 
conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent )
Jan 26 09:58:12 fbc19 kernel: block drbd0: helper command: /sbin/drbdadm 
before-resync-source minor-0
Jan 26 09:58:12 fbc19 kernel: block drbd0: helper command: /sbin/drbdadm 
before-resync-source minor-0 exit code 0 (0x0)
Jan 26 09:58:12 fbc19 kernel: block drbd0: conn( WFBitMapS -> SyncSource 
) pdsk( Consistent -> Inconsistent )
Jan 26 09:58:12 fbc19 kernel: block drbd0: Began resync as SyncSource 
(will sync 56 KB [14 bits set]).
Jan 26 09:58:12 fbc19 kernel: block drbd0: updated sync UUID 
5536B9653B98E1D7:0D8BD9190B76300B:0D8AD9190B76300B:6F0A48DBE8C0066D
Jan 26 09:58:12 fbc19 kernel: block drbd0: Resync done (total 1 sec; 
paused 0 sec; 56 K/sec)
Jan 26 09:58:12 fbc19 kernel: block drbd0: updated UUIDs 
5536B9653B98E1D7:0000000000000000:0D8BD9190B76300B:0D8AD9190B76300B
Jan 26 09:58:12 fbc19 kernel: block drbd0: conn( SyncSource -> Connected 
) pdsk( Inconsistent -> UpToDate )
Jan 26 09:58:12 fbc19 kernel: block drbd0: bitmap WRITE of 3721 pages 
took 70 jiffies
Jan 26 09:58:12 fbc19 kernel: block drbd0: 0 KB (0 bits) marked 
out-of-sync by on disk bit-map.


on secondary:
Jan 26 09:58:11 fbc4 kernel: block drbd0: peer( Primary -> Unknown ) 
conn( Connected -> ProtocolError ) pdsk( UpToDate -> DUnknown )
Jan 26 09:58:11 fbc4 kernel: block drbd0: asender terminated
Jan 26 09:58:11 fbc4 kernel: block drbd0: Terminating asender thread
Jan 26 09:58:11 fbc4 kernel: block drbd0: Connection closed
Jan 26 09:58:11 fbc4 kernel: block drbd0: conn( ProtocolError -> 
Unconnected )
Jan 26 09:58:11 fbc4 kernel: block drbd0: receiver terminated
Jan 26 09:58:11 fbc4 kernel: block drbd0: Restarting receiver thread
Jan 26 09:58:11 fbc4 kernel: block drbd0: receiver (re)started
Jan 26 09:58:11 fbc4 kernel: block drbd0: conn( Unconnected -> 
WFConnection )
Jan 26 09:58:12 fbc4 kernel: block drbd0: Handshake successful: Agreed 
network protocol version 96
Jan 26 09:58:12 fbc4 kernel: block drbd0: Peer authenticated using 20 
bytes of 'sha1' HMAC
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( WFConnection -> 
WFReportParams )
Jan 26 09:58:12 fbc4 kernel: block drbd0: Starting asender thread (from 
drbd0_receiver [3092])
Jan 26 09:58:12 fbc4 kernel: block drbd0: data-integrity-alg: sha1
Jan 26 09:58:12 fbc4 kernel: block drbd0: drbd_sync_handshake:
Jan 26 09:58:12 fbc4 kernel: block drbd0: self 
0D8AD9190B76300A:0000000000000000:6F0A48DBE8C0066C:6F0948DBE8C0066D 
bits:0 flags:0
Jan 26 09:58:12 fbc4 kernel: block drbd0: peer 
5536B9653B98E1D7:0D8AD9190B76300B:6F0A48DBE8C0066D:6F0948DBE8C0066D 
bits:14 flags:0
Jan 26 09:58:12 fbc4 kernel: block drbd0: uuid_compare()=-1 by rule 50
Jan 26 09:58:12 fbc4 kernel: block drbd0: peer( Unknown -> Primary ) 
conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( 
DUnknown -> UpToDate )
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( WFBitMapT -> WFSyncUUID )
Jan 26 09:58:12 fbc4 kernel: block drbd0: updated sync uuid 
0D8BD9190B76300A:0000000000000000:6F0A48DBE8C0066C:6F0948DBE8C0066D
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm 
before-resync-target minor-0
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm 
before-resync-target minor-0 exit code 0 (0x0)
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( WFSyncUUID -> SyncTarget 
) disk( Outdated -> Inconsistent )
Jan 26 09:58:12 fbc4 kernel: block drbd0: Began resync as SyncTarget 
(will sync 56 KB [14 bits set]).
Jan 26 09:58:12 fbc4 kernel: block drbd0: Resync done (total 1 sec; 
paused 0 sec; 56 K/sec)
Jan 26 09:58:12 fbc4 kernel: block drbd0: updated UUIDs 
5536B9653B98E1D6:0000000000000000:0D8BD9190B76300A:0D8AD9190B76300B
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( SyncTarget -> Connected 
) disk( Inconsistent -> UpToDate )
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm 
after-resync-target minor-0
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm 
after-resync-target minor-0 exit code 0 (0x0)
Jan 26 09:58:12 fbc4 kernel: block drbd0: bitmap WRITE of 3721 pages 
took 37 jiffies
Jan 26 09:58:12 fbc4 kernel: block drbd0: 0 KB (0 bits) marked 
out-of-sync by on disk bit-map.



drbd version is : version: 8.3.10 (api:88/proto:86-96)

Is anyone else having this issue?  I know it could be a coincidence, but 
I checked the last 3 syslogs and we did not have this issue until after 
booting pve-kernel-2.6.32-6-pve 2.6.32-55+ovzfix-1 on both nodes.





More information about the pve-user mailing list