On 2/6/19 1:51 PM, Erik Skultety wrote:
On Wed, Feb 06, 2019 at 01:44:53PM -0200, Daniel Henrique Barboza
wrote:
> QEMU virtual machines with PCI passthrough of certain devices,
> such as the NVIDIA Tesla V100 GPU, requires allocation of an
> amount of memory pages that can break KVM limits. When that
Wow, ^this means only 1 thing, this Tesla card requires more than 1GB extra
memory to be locked, is that right?
Yes. Just checked in the dmesg of the guest out of curiosity. The very
fist big
DMA call (the one that causes the IPC_LOCK issue) is a bit larger than that
already.
There are a few nuances to be considered. As I mentioned earlier in Alex's
reply, this is a custom QEMU that has NVLink2 passthrough support. The
V100 card has 16GB of RAM that is allocated in its own NUMA node, and the
pseries machine emulates this behavior. So I am not sure if these DMAs are
locking regular host RAM or the GPU memory to populate the NUMA node
the guest uses.
If the latter, then a case can be made about allowing QEMU running with
IPC_LOCK - even if we limit it to the pseries machine assigning a Tesla
V100 device (although Yuval reported a similar issue with a Mellanox mlx5
card replied in the previous thread ...).
Thanks,
DHB
Erik