Re: [PATCH v1 02/10] qemu_domain.c: align memory modules before calculating 'initialmem'

Monday, 16 November 2020

On Fri, 2020-11-13 at 16:23 -0300, Daniel Henrique Barboza wrote:
...
 On 11/13/20 10:51 AM, Andrea Bolognani wrote:
 > I only skimmed the remaining patches and didn't dig into the code as
 > much, or as recently, as you have, but from a high-level perspective
 > I don't see why you wouldn't be able to simply move the existing
 > rounding logic from the command line generator to PostParse? It's not
 > like the former has access to additional information that the latter
 > can't get to, right?

 I was looking into the code and I think that might have the wrong idea here.
 Apparently we're not aligning memory during migration or snapshot restore.
 This specific line in qemu_command.c got my attention:

 -- qemuBuildCommandLine() --

      if (!migrateURI && !snapshot && qemuDomainAlignMemorySizes(def) <
0)
          return NULL;

 ------

 I investigated the history behind this logic and found the following commit:

 commit c7d7ba85a6242d789ba3f4dae313e950fbb638c5
 Author: Peter Krempa <pkrempa(a)redhat.com&gt;
 Date:   Thu Sep 17 08:14:05 2015 +0200

      qemu: command: Align memory sizes only on fresh starts

      When we are starting a qemu process for an incomming migration or
      snapshot reloading we should not modify the memory sizes in the domain
      since we could potentially change the guest ABI that was tediously
      checked before. Additionally the function now updates the initial memory
      size according to the NUMA node size, which should not happen if we are
      restoring state.

      Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1252685

 ---------

 This means that the changes made in this series will not break migration, since
 the alignment of 'initialmem' is not being triggered in those cases. Which
 is good.

 However, you also brought up in an earlier reply that these changes might break
 "the guest ABI across guest boots (if libvirt is upgraded in between them)".
This
 can't be helped I think - an older ppc64 guest with 4GiB of 'currentMemory'
in the
 XML, that is ending up having 4.256GiB (extra 256MiB) due to the way alignment was
 being done, will have exactly 4GiB of 'initialmem' after these changes. My point
is
 that we're giving the exact memory the guest is demanding, as intended by the domain
 XML, in a fresh guest start. This might be considered an ABI break probably, but
 why would one complain that Libvirt is now giving precisely what was asked for?
 Avoiding granting extra 256MBs of mem for domains seems worth it, given that we're
 not impacting live domains or migration. 
In general, changing guest ABI between boots is not something that we
want to happen.

I have trouble keeping all the details of memory alignment inside my
head and I can't quite spend the time necessary to swap them back in
right now, so please apologies if I'm being vague and of course
correct me if I'm wrong... Having Peter in the thread will also
surely help with that :)

The aim of this series should *not* be to change the calculations
that happen when aligning memory, but only to reflect them back to
the domain XML where they can be queried: so for example if the input

  <memory unit='GiB'>4</memory>
  <devices>
    <memory model='nvdimm'>
      <target>
        <size unit='MiB'>500</size>

results in the command line

  -m 4352m
  -object memory-backend-file,size=512m

(the exact numbers are not relevant), then what we want is for the
XML to be updated at define time so that it reads

  <memory unit='MiB'>4864</memory>
  <devices>
    <memory model='nvdimm'>
      <target>
        <size unit='MiB'>512</size>

(again, the numbers are almost certainly wrong) and we want *that*
XML to generate the same QEMU command line as before.

If this can't be achieved, or there are other side effects to it that
I haven't considered, then we're better off leaving the current
behavior alone (documenting the heck out of it if necessary) instead
of changing it in ways that would alter guest ABI between boots.

-- 
Andrea Bolognani / Red Hat / Virtualization

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Re: [PATCH v1 02/10] qemu_domain.c: align memory modules before calculating 'initialmem'