On Tue, Dec 07, 2021 at 10:19:42AM +0100, Jiri Denemark wrote:
Userfaultfd is by default allowed only for privileged processes.
Since
libvirt runs QEMU unprivileged, we need to enable unprivileged access to
userfaultfd before starting post-copy migration.
Rather than providing a static sysctl configuration file, we set the
sysctl knob in runtime once post-copy migration is requested. This way
unprivileged_userfaultfd is only enabled once actually used.
I'm really not a fan of silently changing sysctl knobs on the
fly like this, as it means the change is essentially invisible
to the host admin.
IIUC, the kernel change was made because of fear of risk this
feature exposes to the kernel when combined with other flaws.
Now I don't know how valid that fear is, but given that starting
point, I think if we're going to change it, then the change ought
to be visible to admins in a fairly obvious way.
IOW, we something ought to be droppping a file into /etc/sysctl.d/
that enables it. The downside then is that it applies to all installs,
even if they don't migrate. The flipside is that a default of 1 has
been the historical value since postcopy first arrived, so all QEMU
installs always had this behaviour.
If we drop in a file 50-qemu-postcopy.conf, someone else can drop
in a file 55-turn-it-off-again.conf to override our default.
Stil this all feels so awful every way I look at it :-(
+/**
+ * qemuMigrationParamsEnableUserfaultfd
+ *
+ * Try to enable unprivileged userfaultfd unless it's missing or already
+ * enabled. Only a warning is logged when we cannot enable it, QEMU will
+ * report an error when enabling post-copy migration capability.
+ */
+static void
+qemuMigrationParamsEnableUserfaultfd(void)
+{
+ const char *sysctl = "/proc/sys/vm/unprivileged_userfaultfd";
+ g_autofree char *buf = NULL;
+
+ if (!virFileExists(sysctl))
+ return;
+
+ if (virFileReadAll(sysctl, 10, &buf) < 0) {
+ VIR_WARN("Cannot read unprivileged userfaultfd state");
+ return;
+ }
+
+ if (STREQ(buf, "1\n")) {
+ VIR_DEBUG("Unprivileged userfaultfd already enabled");
+ return;
+ }
+
+ VIR_DEBUG("Enabling unprivileged userfaultfd for post-copy migration");
+
+ if (virFileWriteStr(sysctl, "1", 0) < 0) {
+ VIR_WARN("Failed to enable unprivileged userfaultfd: %s",
+ g_strerror(errno));
Why only a warning - surely we know it is going to fail at this
point, and QEMU will probably give an obcure EPERM error, while
we're in a position to tell the user exactly what's wrong with
the sysctl.
Regards,
Daniel
--
|:
https://berrange.com -o-
https://www.flickr.com/photos/dberrange :|
|:
https://libvirt.org -o-
https://fstop138.berrange.com :|
|:
https://entangle-photo.org -o-
https://www.instagram.com/dberrange :|