[PVE-User] drbd issues after kernel upgrade
Robert Fantini
rob at fantinibakery.com
Thu Jan 26 17:29:06 CET 2012
Hello
we're using Proxmox 2.0.
since updating pve-kernel on Tuesday we've had drbd issues. the new
kernel is Version: 2.6.32-55+ovzfix-1 .
Here are syslog entries:
on Primary:
Jan 26 09:58:11 fbc19 kernel: block drbd0: Digest mismatch, buffer
modified by upper layers during write: 801263472s +4096
Jan 26 09:58:11 fbc19 kernel: block drbd0: sock was shut down by peer
Jan 26 09:58:11 fbc19 kernel: block drbd0: peer( Secondary -> Unknown )
conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
Jan 26 09:58:11 fbc19 kernel: block drbd0: new current UUID
5536B9653B98E1D7:0D8AD9190B76300B:6F0A48DBE8C0066D:6F0948DBE8C0066D
Jan 26 09:58:11 fbc19 kernel: block drbd0: asender terminated
Jan 26 09:58:11 fbc19 kernel: block drbd0: Terminating asender thread
Jan 26 09:58:11 fbc19 kernel: block drbd0: Connection closed
Jan 26 09:58:11 fbc19 kernel: block drbd0: conn( NetworkFailure ->
Unconnected )
Jan 26 09:58:11 fbc19 kernel: block drbd0: receiver terminated
Jan 26 09:58:11 fbc19 kernel: block drbd0: Restarting receiver thread
Jan 26 09:58:11 fbc19 kernel: block drbd0: receiver (re)started
Jan 26 09:58:11 fbc19 kernel: block drbd0: conn( Unconnected ->
WFConnection )
Jan 26 09:58:12 fbc19 kernel: block drbd0: Handshake successful: Agreed
network protocol version 96
Jan 26 09:58:12 fbc19 kernel: block drbd0: Peer authenticated using 20
bytes of 'sha1' HMAC
Jan 26 09:58:12 fbc19 kernel: block drbd0: conn( WFConnection ->
WFReportParams )
Jan 26 09:58:12 fbc19 kernel: block drbd0: Starting asender thread (from
drbd0_receiver [3203])Jan 26 09:58:12 fbc19 kernel: block drbd0:
data-integrity-alg: sha1
Jan 26 09:58:12 fbc19 kernel: block drbd0: drbd_sync_handshake:
Jan 26 09:58:12 fbc19 kernel: block drbd0: self
5536B9653B98E1D7:0D8AD9190B76300B:6F0A48DBE8C0066D:6F0948DBE8C0066D
bits:14 flags:0
Jan 26 09:58:12 fbc19 kernel: block drbd0: peer
0D8AD9190B76300A:0000000000000000:6F0A48DBE8C0066C:6F0948DBE8C0066D
bits:0 flags:0
Jan 26 09:58:12 fbc19 kernel: block drbd0: uuid_compare()=1 by rule 70
Jan 26 09:58:12 fbc19 kernel: block drbd0: peer( Unknown -> Secondary )
conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent )
Jan 26 09:58:12 fbc19 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-source minor-0
Jan 26 09:58:12 fbc19 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-source minor-0 exit code 0 (0x0)
Jan 26 09:58:12 fbc19 kernel: block drbd0: conn( WFBitMapS -> SyncSource
) pdsk( Consistent -> Inconsistent )
Jan 26 09:58:12 fbc19 kernel: block drbd0: Began resync as SyncSource
(will sync 56 KB [14 bits set]).
Jan 26 09:58:12 fbc19 kernel: block drbd0: updated sync UUID
5536B9653B98E1D7:0D8BD9190B76300B:0D8AD9190B76300B:6F0A48DBE8C0066D
Jan 26 09:58:12 fbc19 kernel: block drbd0: Resync done (total 1 sec;
paused 0 sec; 56 K/sec)
Jan 26 09:58:12 fbc19 kernel: block drbd0: updated UUIDs
5536B9653B98E1D7:0000000000000000:0D8BD9190B76300B:0D8AD9190B76300B
Jan 26 09:58:12 fbc19 kernel: block drbd0: conn( SyncSource -> Connected
) pdsk( Inconsistent -> UpToDate )
Jan 26 09:58:12 fbc19 kernel: block drbd0: bitmap WRITE of 3721 pages
took 70 jiffies
Jan 26 09:58:12 fbc19 kernel: block drbd0: 0 KB (0 bits) marked
out-of-sync by on disk bit-map.
on secondary:
Jan 26 09:58:11 fbc4 kernel: block drbd0: peer( Primary -> Unknown )
conn( Connected -> ProtocolError ) pdsk( UpToDate -> DUnknown )
Jan 26 09:58:11 fbc4 kernel: block drbd0: asender terminated
Jan 26 09:58:11 fbc4 kernel: block drbd0: Terminating asender thread
Jan 26 09:58:11 fbc4 kernel: block drbd0: Connection closed
Jan 26 09:58:11 fbc4 kernel: block drbd0: conn( ProtocolError ->
Unconnected )
Jan 26 09:58:11 fbc4 kernel: block drbd0: receiver terminated
Jan 26 09:58:11 fbc4 kernel: block drbd0: Restarting receiver thread
Jan 26 09:58:11 fbc4 kernel: block drbd0: receiver (re)started
Jan 26 09:58:11 fbc4 kernel: block drbd0: conn( Unconnected ->
WFConnection )
Jan 26 09:58:12 fbc4 kernel: block drbd0: Handshake successful: Agreed
network protocol version 96
Jan 26 09:58:12 fbc4 kernel: block drbd0: Peer authenticated using 20
bytes of 'sha1' HMAC
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( WFConnection ->
WFReportParams )
Jan 26 09:58:12 fbc4 kernel: block drbd0: Starting asender thread (from
drbd0_receiver [3092])
Jan 26 09:58:12 fbc4 kernel: block drbd0: data-integrity-alg: sha1
Jan 26 09:58:12 fbc4 kernel: block drbd0: drbd_sync_handshake:
Jan 26 09:58:12 fbc4 kernel: block drbd0: self
0D8AD9190B76300A:0000000000000000:6F0A48DBE8C0066C:6F0948DBE8C0066D
bits:0 flags:0
Jan 26 09:58:12 fbc4 kernel: block drbd0: peer
5536B9653B98E1D7:0D8AD9190B76300B:6F0A48DBE8C0066D:6F0948DBE8C0066D
bits:14 flags:0
Jan 26 09:58:12 fbc4 kernel: block drbd0: uuid_compare()=-1 by rule 50
Jan 26 09:58:12 fbc4 kernel: block drbd0: peer( Unknown -> Primary )
conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk(
DUnknown -> UpToDate )
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( WFBitMapT -> WFSyncUUID )
Jan 26 09:58:12 fbc4 kernel: block drbd0: updated sync uuid
0D8BD9190B76300A:0000000000000000:6F0A48DBE8C0066C:6F0948DBE8C0066D
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-target minor-0
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-target minor-0 exit code 0 (0x0)
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( WFSyncUUID -> SyncTarget
) disk( Outdated -> Inconsistent )
Jan 26 09:58:12 fbc4 kernel: block drbd0: Began resync as SyncTarget
(will sync 56 KB [14 bits set]).
Jan 26 09:58:12 fbc4 kernel: block drbd0: Resync done (total 1 sec;
paused 0 sec; 56 K/sec)
Jan 26 09:58:12 fbc4 kernel: block drbd0: updated UUIDs
5536B9653B98E1D6:0000000000000000:0D8BD9190B76300A:0D8AD9190B76300B
Jan 26 09:58:12 fbc4 kernel: block drbd0: conn( SyncTarget -> Connected
) disk( Inconsistent -> UpToDate )
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm
after-resync-target minor-0
Jan 26 09:58:12 fbc4 kernel: block drbd0: helper command: /sbin/drbdadm
after-resync-target minor-0 exit code 0 (0x0)
Jan 26 09:58:12 fbc4 kernel: block drbd0: bitmap WRITE of 3721 pages
took 37 jiffies
Jan 26 09:58:12 fbc4 kernel: block drbd0: 0 KB (0 bits) marked
out-of-sync by on disk bit-map.
drbd version is : version: 8.3.10 (api:88/proto:86-96)
Is anyone else having this issue? I know it could be a coincidence, but
I checked the last 3 syslogs and we did not have this issue until after
booting pve-kernel-2.6.32-6-pve 2.6.32-55+ovzfix-1 on both nodes.
More information about the pve-user
mailing list