[pve-devel] VM locked after failed Snapshot
Alexandre DERUMIER
aderumier at odiso.com
Mon Sep 8 17:18:05 CEST 2014
Hi,
my 2cents, but could it be related to the vmstate bug ?
https://git.proxmox.com/?p=pve-qemu-kvm.git;a=commit;h=62d638ff1e9fb96ca078be2225426aaac8f909f6
(Is a a vm snasphot with vmstate ?)
----- Mail original -----
De: "Stefan Priebe - Profihost AG" <s.priebe at profihost.ag>
À: pve-devel at pve.proxmox.com
Envoyé: Lundi 8 Septembre 2014 12:06:48
Objet: [pve-devel] VM locked after failed Snapshot
Hi,
today i had the following problem.
1.) i wanted to create a snapshot of a vm
2.) it failed for unknown reason and i had the following output (PVE
Webgui):
image has watchers - not removing
Removing image: 0% complete...failed.
rbd: error: image still has watchers
TASK ERROR: received interrupt
3.) The VM was than in a locked state (VM is locked (snapshot))
I see multiple problems here.
1.) lock state should be removed by PVE in case of a failure.
Currently snapshot_create calls snapshot_prepare to set the lock. And at
the end snapshot_commit deletes the log.
But currently in case of $err
if ($err) {
warn "snapshot create failed: starting cleanup\n";
eval { snapshot_delete($vmid, $snapname, 0, $drivehash); };
warn $@ if $@;
die $err;
}
The lock isn't removed.
What is the correct way to remove a lock in this case?
2.) in case of an unexpected failure or signal ceph/rbd does not remove
it's watcher from the image. So the snapshot_delete failed in this case.
Output:
image has watchers - not removing
Removing image: 0% complete...failed.
rbd: error: image still has watchers
rbd has an automatic timeout after 30s should PVE handle this by waiting
30s and try it again?
Greets,
Stefan
_______________________________________________
pve-devel mailing list
pve-devel at pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
More information about the pve-devel
mailing list