[pve-devel] VM locked after failed Snapshot
    Alexandre DERUMIER 
    aderumier at odiso.com
       
    Mon Sep  8 17:18:05 CEST 2014
    
    
  
Hi,
my 2cents, but could it be related to the vmstate bug ? 
https://git.proxmox.com/?p=pve-qemu-kvm.git;a=commit;h=62d638ff1e9fb96ca078be2225426aaac8f909f6
(Is a a vm snasphot with vmstate ?)
----- Mail original ----- 
De: "Stefan Priebe - Profihost AG" <s.priebe at profihost.ag> 
À: pve-devel at pve.proxmox.com 
Envoyé: Lundi 8 Septembre 2014 12:06:48 
Objet: [pve-devel] VM locked after failed Snapshot 
Hi, 
today i had the following problem. 
1.) i wanted to create a snapshot of a vm 
2.) it failed for unknown reason and i had the following output (PVE 
Webgui): 
image has watchers - not removing 
Removing image: 0% complete...failed. 
rbd: error: image still has watchers 
TASK ERROR: received interrupt 
3.) The VM was than in a locked state (VM is locked (snapshot)) 
I see multiple problems here. 
1.) lock state should be removed by PVE in case of a failure. 
Currently snapshot_create calls snapshot_prepare to set the lock. And at 
the end snapshot_commit deletes the log. 
But currently in case of $err 
if ($err) { 
warn "snapshot create failed: starting cleanup\n"; 
eval { snapshot_delete($vmid, $snapname, 0, $drivehash); }; 
warn $@ if $@; 
die $err; 
} 
The lock isn't removed. 
What is the correct way to remove a lock in this case? 
2.) in case of an unexpected failure or signal ceph/rbd does not remove 
it's watcher from the image. So the snapshot_delete failed in this case. 
Output: 
image has watchers - not removing 
Removing image: 0% complete...failed. 
rbd: error: image still has watchers 
rbd has an automatic timeout after 30s should PVE handle this by waiting 
30s and try it again? 
Greets, 
Stefan 
_______________________________________________ 
pve-devel mailing list 
pve-devel at pve.proxmox.com 
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-devel 
    
    
More information about the pve-devel
mailing list