Dear libvirt community,
Using recent Ubuntu Stein Cloud
Packages, we are observing random libvirtd live-migration
crashes on the target host.
Libvirt is having a SEGFAULT with the
qemu driver. Transferring block devices usually works without
issues.
However, the following memory transfer
is causing the target libvirtd randomly to close down its
socket, resulting in a roll-backed migration process.
I can reproduce this with large VMs, which have a large memory
pool.
The last error message we see in libvirt logs is:
error : qemuMonitorJSONCommandWithFd:315 : internal error: Missing
monitor reply object
With this, libvirt segfaults and restarts.
Before we encountered this issue, we used an older nova-compute
package (19.0.3).
Not sure if this made a difference with usage of libvirtd-api.
After upgrade, we also see a lot of recurring errors during
migration:
warning : qemuDomainObjBeginJobInternal:7044 : Cannot start job
(query, none, none) for domain instance-00008f56; current job is
(none, none, migration in) owned by (0 <null>, 0
<null>, 0 remoteDispatchDomainMigratePrepare3Params
(flags=0x809b)) for (0s, 0s, 14834s)
error : qemuDomainObjBeginJobInternal:7066 : Timed out during
operation: cannot acquire state change lock (held by
monitor=remoteDispatchDomainMigratePrepare3Params)
They don't abort the running migration process, but spam every
minute to the systemd journal.
Source and destination run the same packages:
Ubuntu 18.04.4 LTS (GNU/Linux 4.15.0-99-generic x86_64)
OpenStack Stein (Ubuntu Cloud Archive)
Libvirt+QEMU_x86
keystone-common 2:15.0.1-0ubuntu1~cloud0
libvirt-daemon 5.0.0-1ubuntu2.6~cloud0
qemu-system-x86 1:3.1+dfsg-2ubuntu3.7~cloud0
neutron-linuxbridge-agent 2:14.2.0-0ubuntu1~cloud0
neutron-plugin-ml2 2:14.2.0-0ubuntu1~cloud0
nova-compute 2:19.2.0-0ubuntu1~cloud0
nova-compute-libvirt 2:19.2.0-0ubuntu1~cloud0
I have attached source/destination
debug logs from libvirtd and nova-compute here:
https://denzelx.ddns.net/index.php/s/KPJ7vv4aTcb69XD
Any help would be nice!
Best Regards
--
M.Sc Alex Walender
de.NBI Cloud Bielefeld Administrator
Center for Biotechnology (CeBiTec)
University of Bielefeld
33594 Bielefeld
Germany
room: M3-118
phone: +49 (521) 106 2907