[PVE-User] Live migration fails with "Mismatched RAM page size ram-node0 (local) 2097152 != 1526773257204281392"

Stefan Reiter s.reiter at proxmox.com
Thu Feb 4 17:59:26 CET 2021


On 02/02/2021 15:50, Sergey Korobkov wrote:
> Hello,
> 
> Two exactly the same machines ( except ram size 380G and 1.5T ).
> 
> Upgraded on Debian 10.7 from:
> pve-manager/6.1-5/9bf06119
> Linux 5.3.13-1-pve #1 SMP PVE 5.3.13-1 (Thu, 05 Dec 2019 07:18:14 +0100)
> QEMU emulator version 4.1.1 (pve-qemu-kvm_4.1.1)
> 
> to:
> pve-manager/6.3-3/eee5f901
> Linux 5.4.78-2-pve #1 SMP PVE 5.4.78-2 (Thu, 03 Dec 2020 14:26:17 +0100)
> QEMU emulator version 5.1.0 (pve-qemu-kvm_5.1.0)
> 
> We had enabled hugepages for virtual machines( "hugepages: 2" specified 
> in virtual machine description).
> 
> Live migration fails with errors like this:
> 
> Feb 02 16:26:13 QEMU[12090]: kvm7: load of migration failed: Invalid 
> argument
> Feb 02 16:26:13 QEMU[12090]: kvm7: error while loading state for
> instance 0x0 of device 'ram'
> Feb 02 16:26:13 QEMU[12090]: kvm7: Mismatched RAM page size ram-node0
> (local) 2097152 != 1526773257204281392
> 
> We think it's some overflow issue.
> 

Hi!

After looking carefully I believe to have found the root cause of this 
issue in an upstream bug that we run into since pve-qemu-kvm 5.1.0-4, 
where we started migrating dirty bitmaps.

I have sent a potential fix to the upstream qemu-devel mailing list:
https://lists.nongnu.org/archive/html/qemu-devel/2021-02/msg01711.html

If the resident experts on there agree that this is indeed the solution, 
we will most likely ship it once we release our QEMU 5.2 build :)

Thanks for the report!





More information about the pve-user mailing list