On 09/18/2017 09:24 PM, Wuzongyong (Euler Dept) wrote:

Hi,

 

In function virPCIDeviceIsBehindSwitchLackingACS, I noticed that(line 8):

 

1    if (virPCIDeviceGetParent(dev, &parent) < 0)

2        return -1;

3    if (!parent) {

4        /* if we have no parent, and this is the root bus, ACS doesn't come

5         * into play since devices on the root bus can't P2P without going

6         * through the root IOMMU.

7         */

8        if (dev->address.bus == 0) {

9            return 0;

10        } else {

11            virReportError(VIR_ERR_INTERNAL_ERROR,

12                           _("Failed to find parent device for %s"),

13                           dev->name);

14            return -1;

15        }

16    }

 

Why we just return 0 only if device’s bus is 0?

In my server, I can see a root bus which bus number is greater than 0, see the

results(just a part) after I run lspci -t:

 

+-[0000:80]-+-02.0-[81-83]--+-00.0

|           |               \-00.1

|           +-05.0

|           +-05.1

|           +-05.2

|           \-05.4

+-[0000:7f]-+-08.0

|           +-08.2

|           +-08.3

|           + . . .

|           \-1f.2

\-[0000:00]-+-00.0

             +-01.0-[01]----00.0

             +-02.0-[02]--+-00.0

             |            +-00.1

             |            +-00.2

             |            \-00.3

             +-02.2-[03]--

             +-03.0-[04-0b]----00.0-[05-0b]--+-08.0-[06-08]----00.0

             |                               \-10.0-[09-0b]----00.0

             +-05.0

             +-05.1

             +-05.2

             +-05.4

             +-11.0

             +-11.4

             +-16.0

             +-16.1

             +-1a.0

 

If I assign the device 0000:81:00.0 to a VM, I get “Failed to find parent device”.

I think I should get no error with return value 0 just like bus number is 0, because

bus 80 is the root bus as well in my case.

 

In the <<Intel C610 Series Chipset and Intel X99 Chipset Platform Controller Hub(PCH)>>

Datasheet, I found that(Chapter 9.1):

For some server platforms, it may be desirable to have multiple PCHs in the system

Which means some PCH’s may reside on a bus greater than 0.

 

So, is this a bug?


My memory is that if you're using VFIO for device assignment, all that checking should be performed by VFIO, and libvirt shouldn't be checking for ACS at all. (Alex, can you confirm or refute this?)

virPCIDeviceIsBehindSwitchLackingACS() is only called from virPCIDeviceIsAssignable(), and that function is only called if the device's stubDriver is set to something other than "vfio-pci" (see step 1 in virHostdevPreparePCIDevices()). Digging deeper, it looks like the device's stubDriver is set by virHostdevGetPCIHostDeviceList(), which appears to set it to vfio-pci if the backend is specified as vfio (i.e. <driver name='vfio'/> in the libvirt XML. This *should* be the default setting!)

Since I assume you're not using RHEL6, meaning that you will be using VFIO by default, not legacy KVM assignment.

TL;DR I think the bug here is that the ...CheckACS function is being called *at all*. That code path should be completely obsolete.