DPDK usage discussions
 help / color / mirror / Atom feed
* mlx5_common: No Verbs device matches PCI device 0000:01:00.1
@ 2022-04-05 16:17 fwefew 4t4tg
  2022-04-05 16:59 ` Erez Ferber
  0 siblings, 1 reply; 5+ messages in thread
From: fwefew 4t4tg @ 2022-04-05 16:17 UTC (permalink / raw)
  To: users

[-- Attachment #1: Type: text/plain, Size: 2363 bytes --]

I built the current version of DPDK directly from dpdk.org after I
installed the current OFED Mellanox driver set:

* MLNX_OFED_LINUX-5.5-1.0.3.2-ubuntu20.04-x86_64.iso

with ./install --dpdk

I am using a Mellanox Technologies MT27710 Family [ConnectX-4 Lx] which is
Ethernet only; there is no IB mode for this NIC. This is a MT_2420110034
board. However, when I run dpdk-testpmd I see "No Verbs device matches PCI
device 0000:01:00.1, are kernel drivers loaded?"

EAL: Detected CPU lcores: 16
EAL: Detected NUMA nodes: 1
EAL: Detected static linkage of DPDK
EAL: Selected IOVA mode 'PA'
EAL: No free 2048 kB hugepages reported on node 0
EAL: VFIO support initialized
EAL: Probe PCI driver: mlx5_pci (15b3:1015) device: 0000:01:00.1 (socket 0)
mlx5_common: No Verbs device matches PCI device 0000:01:00.1, are kernel
drivers loaded?
mlx5_common: Verbs device not found: 01:00.1
mlx5_common: Failed to initialize device context.
EAL: Requested device 0000:01:00.1 cannot be used
EAL: Bus (pci) probe failed.

As far as I can see all the kernel modules are loaded:

lsmod | egrep "(ib|mlx)" | sort
ib_cm                  53248  2 rdma_cm,ib_ipoib
ib_core               368640  8
rdma_cm,ib_ipoib,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
ib_ipoib              135168  0
ib_umad                24576  0
ib_uverbs             139264  2 rdma_ucm,mlx5_ib
libahci                36864  1 ahci
libcrc32c              16384  2 btrfs,raid456
mlx5_core            1634304  1 mlx5_ib
mlx5_ib               397312  0
mlx_compat             69632  11
rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core
mlxdevm               172032  1 mlx5_core
mlxfw                  32768  1 mlx5_core
pci_hyperv_intf        16384  1 mlx5_core
psample                20480  1 mlx5_core
tls                    94208  2 bonding,mlx5_core

root@dc-c3-small-x86-01:~/Dev/reinvent/scripts# mst status -v
MST modules:
------------
    MST PCI module is not loaded
    MST PCI configuration module loaded
PCI devices:
------------
DEVICE_TYPE             MST                           PCI       RDMA
     NET                       NUMA
ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0.1    01:00.1   mlx5_bond_0
    net-bond0                 -1

ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0      01:00.0   mlx5_bond_0
    net-bond0                 -1

[-- Attachment #2: Type: text/html, Size: 2822 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mlx5_common: No Verbs device matches PCI device 0000:01:00.1
  2022-04-05 16:17 mlx5_common: No Verbs device matches PCI device 0000:01:00.1 fwefew 4t4tg
@ 2022-04-05 16:59 ` Erez Ferber
  2022-04-05 20:54   ` fwefew 4t4tg
  0 siblings, 1 reply; 5+ messages in thread
From: Erez Ferber @ 2022-04-05 16:59 UTC (permalink / raw)
  To: fwefew 4t4tg; +Cc: users

[-- Attachment #1: Type: text/plain, Size: 2952 bytes --]

Hi,

Based on your output, the ConnectX-4LX device is configured in LAG mode
managed via the kernel bonding scripts. In this mode, both physical
functions share a single port (mlx5_bond_0). You should only probe the
first PCI BDF - 01:00.0, not the 2nd one.

By the way, the --dpdk installation flag should not be necessary, it is an
old flag keps for Mellanox OFED builds lower than 5.x.

Regards,
Erez

On Tue, 5 Apr 2022 at 19:17, fwefew 4t4tg <7532yahoo@gmail.com> wrote:

> I built the current version of DPDK directly from dpdk.org after I
> installed the current OFED Mellanox driver set:
>
> * MLNX_OFED_LINUX-5.5-1.0.3.2-ubuntu20.04-x86_64.iso
>
> with ./install --dpdk
>
> I am using a Mellanox Technologies MT27710 Family [ConnectX-4 Lx] which is
> Ethernet only; there is no IB mode for this NIC. This is a MT_2420110034
> board. However, when I run dpdk-testpmd I see "No Verbs device matches PCI
> device 0000:01:00.1, are kernel drivers loaded?"
>
> EAL: Detected CPU lcores: 16
> EAL: Detected NUMA nodes: 1
> EAL: Detected static linkage of DPDK
> EAL: Selected IOVA mode 'PA'
> EAL: No free 2048 kB hugepages reported on node 0
> EAL: VFIO support initialized
> EAL: Probe PCI driver: mlx5_pci (15b3:1015) device: 0000:01:00.1 (socket 0)
> mlx5_common: No Verbs device matches PCI device 0000:01:00.1, are kernel
> drivers loaded?
> mlx5_common: Verbs device not found: 01:00.1
> mlx5_common: Failed to initialize device context.
> EAL: Requested device 0000:01:00.1 cannot be used
> EAL: Bus (pci) probe failed.
>
> As far as I can see all the kernel modules are loaded:
>
> lsmod | egrep "(ib|mlx)" | sort
> ib_cm                  53248  2 rdma_cm,ib_ipoib
> ib_core               368640  8
> rdma_cm,ib_ipoib,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
> ib_ipoib              135168  0
> ib_umad                24576  0
> ib_uverbs             139264  2 rdma_ucm,mlx5_ib
> libahci                36864  1 ahci
> libcrc32c              16384  2 btrfs,raid456
> mlx5_core            1634304  1 mlx5_ib
> mlx5_ib               397312  0
> mlx_compat             69632  11
> rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core
> mlxdevm               172032  1 mlx5_core
> mlxfw                  32768  1 mlx5_core
> pci_hyperv_intf        16384  1 mlx5_core
> psample                20480  1 mlx5_core
> tls                    94208  2 bonding,mlx5_core
>
> root@dc-c3-small-x86-01:~/Dev/reinvent/scripts# mst status -v
> MST modules:
> ------------
>     MST PCI module is not loaded
>     MST PCI configuration module loaded
> PCI devices:
> ------------
> DEVICE_TYPE             MST                           PCI       RDMA
>      NET                       NUMA
> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0.1    01:00.1
> mlx5_bond_0     net-bond0                 -1
>
> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0      01:00.0
> mlx5_bond_0     net-bond0                 -1
>

[-- Attachment #2: Type: text/html, Size: 3677 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mlx5_common: No Verbs device matches PCI device 0000:01:00.1
  2022-04-05 16:59 ` Erez Ferber
@ 2022-04-05 20:54   ` fwefew 4t4tg
  2022-04-07  8:22     ` Erez Ferber
  0 siblings, 1 reply; 5+ messages in thread
From: fwefew 4t4tg @ 2022-04-05 20:54 UTC (permalink / raw)
  To: Erez Ferber, users

[-- Attachment #1: Type: text/plain, Size: 5873 bytes --]

Mr. Ferber, much appreciated. I knew this metal box came to me with two
mellanox NICs bonded. I used their util to unbond it, but alas it did not
do it all the way. The /etc/network/interfaces was bad. I fixed the config
and rebooted.

Voila. Success. ibv_devinfo now shows two devices. And the DPDK app runs.

Your point about bonding nudged me to double check.

*Now, on a second machine where I believe I have done everything I did on
the first machine*, running the DPDK application continues to look for the
wrong driver:

EAL: Detected CPU lcores: 16
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: libmlx4.so.1: cannot open shared object file: No such file or directory
EAL: FATAL: Cannot init plugins
EAL: Cannot init plugins

Somehow I fixed this on the first machine, but I cannot duplicate success
on the second machine.

Both machines report two devices; both machines show similar ifconifgs

    device             node GUID

    ------          ----------------

    mlx5_0          0c42a103007ea9b8

    mlx5_1          0c42a103007ea9b9


    device             node GUID

    ------          ----------------

    mlx5_0          0c42a103007ea3ec

    mlx5_1          0c42a103007ea3ed


root@server:~/Dev/reinvent/scripts# ibv_devinfo

hca_id: mlx5_0

transport: InfiniBand (0)

fw_ver: 14.32.1010

node_guid: 0c42:a103:007e:a3ec

sys_image_guid: 0c42:a103:007e:a3ec

vendor_id: 0x02c9

vendor_part_id: 4117

hw_ver: 0x0

board_id: MT_2420110034

phys_port_cnt: 1

port: 1

state: PORT_ACTIVE (4)

max_mtu: 4096 (5)

active_mtu: 1024 (3)

sm_lid: 0

port_lid: 0

port_lmc: 0x00

link_layer: Ethernet


hca_id: mlx5_1

transport: InfiniBand (0)

fw_ver: 14.32.1010

node_guid: 0c42:a103:007e:a3ed

sys_image_guid: 0c42:a103:007e:a3ec

vendor_id: 0x02c9

vendor_part_id: 4117

hw_ver: 0x0

board_id: MT_2420110034

phys_port_cnt: 1

port: 1

state: PORT_ACTIVE (4)

max_mtu: 4096 (5)

active_mtu: 1024 (3)

sm_lid: 0

port_lid: 0

port_lmc: 0x00

link_layer: Ethernet





ibv_devinfo

hca_id: mlx5_0

transport: InfiniBand (0)

fw_ver: 14.32.1010

node_guid: 0c42:a103:007e:a9b8

sys_image_guid: 0c42:a103:007e:a9b8

vendor_id: 0x02c9

vendor_part_id: 4117

hw_ver: 0x0

board_id: MT_2420110034

phys_port_cnt: 1

port: 1

state: PORT_ACTIVE (4)

max_mtu: 4096 (5)

active_mtu: 1024 (3)

sm_lid: 0

port_lid: 0

port_lmc: 0x00

link_layer: Ethernet


hca_id: mlx5_1

transport: InfiniBand (0)

fw_ver: 14.32.1010

node_guid: 0c42:a103:007e:a9b9

sys_image_guid: 0c42:a103:007e:a9b8

vendor_id: 0x02c9

vendor_part_id: 4117

hw_ver: 0x0

board_id: MT_2420110034

phys_port_cnt: 1

port: 1

state: PORT_ACTIVE (4)

max_mtu: 4096 (5)

active_mtu: 1024 (3)

sm_lid: 0

port_lid: 0

port_lmc: 0x00

link_layer: Ethernet

On Tue, Apr 5, 2022 at 1:00 PM Erez Ferber <erezferber@gmail.com> wrote:

> Hi,
>
> Based on your output, the ConnectX-4LX device is configured in LAG mode
> managed via the kernel bonding scripts. In this mode, both physical
> functions share a single port (mlx5_bond_0). You should only probe the
> first PCI BDF - 01:00.0, not the 2nd one.
>
> By the way, the --dpdk installation flag should not be necessary, it is an
> old flag keps for Mellanox OFED builds lower than 5.x.
>
> Regards,
> Erez
>
> On Tue, 5 Apr 2022 at 19:17, fwefew 4t4tg <7532yahoo@gmail.com> wrote:
>
>> I built the current version of DPDK directly from dpdk.org after I
>> installed the current OFED Mellanox driver set:
>>
>> * MLNX_OFED_LINUX-5.5-1.0.3.2-ubuntu20.04-x86_64.iso
>>
>> with ./install --dpdk
>>
>> I am using a Mellanox Technologies MT27710 Family [ConnectX-4 Lx] which
>> is Ethernet only; there is no IB mode for this NIC. This is a MT_2420110034
>> board. However, when I run dpdk-testpmd I see "No Verbs device matches PCI
>> device 0000:01:00.1, are kernel drivers loaded?"
>>
>> EAL: Detected CPU lcores: 16
>> EAL: Detected NUMA nodes: 1
>> EAL: Detected static linkage of DPDK
>> EAL: Selected IOVA mode 'PA'
>> EAL: No free 2048 kB hugepages reported on node 0
>> EAL: VFIO support initialized
>> EAL: Probe PCI driver: mlx5_pci (15b3:1015) device: 0000:01:00.1 (socket
>> 0)
>> mlx5_common: No Verbs device matches PCI device 0000:01:00.1, are kernel
>> drivers loaded?
>> mlx5_common: Verbs device not found: 01:00.1
>> mlx5_common: Failed to initialize device context.
>> EAL: Requested device 0000:01:00.1 cannot be used
>> EAL: Bus (pci) probe failed.
>>
>> As far as I can see all the kernel modules are loaded:
>>
>> lsmod | egrep "(ib|mlx)" | sort
>> ib_cm                  53248  2 rdma_cm,ib_ipoib
>> ib_core               368640  8
>> rdma_cm,ib_ipoib,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
>> ib_ipoib              135168  0
>> ib_umad                24576  0
>> ib_uverbs             139264  2 rdma_ucm,mlx5_ib
>> libahci                36864  1 ahci
>> libcrc32c              16384  2 btrfs,raid456
>> mlx5_core            1634304  1 mlx5_ib
>> mlx5_ib               397312  0
>> mlx_compat             69632  11
>> rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core
>> mlxdevm               172032  1 mlx5_core
>> mlxfw                  32768  1 mlx5_core
>> pci_hyperv_intf        16384  1 mlx5_core
>> psample                20480  1 mlx5_core
>> tls                    94208  2 bonding,mlx5_core
>>
>> root@dc-c3-small-x86-01:~/Dev/reinvent/scripts# mst status -v
>> MST modules:
>> ------------
>>     MST PCI module is not loaded
>>     MST PCI configuration module loaded
>> PCI devices:
>> ------------
>> DEVICE_TYPE             MST                           PCI       RDMA
>>        NET                       NUMA
>> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0.1    01:00.1
>> mlx5_bond_0     net-bond0                 -1
>>
>> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0      01:00.0
>> mlx5_bond_0     net-bond0                 -1
>>
>

[-- Attachment #2: Type: text/html, Size: 48262 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mlx5_common: No Verbs device matches PCI device 0000:01:00.1
  2022-04-05 20:54   ` fwefew 4t4tg
@ 2022-04-07  8:22     ` Erez Ferber
  2022-04-07 22:58       ` fwefew 4t4tg
  0 siblings, 1 reply; 5+ messages in thread
From: Erez Ferber @ 2022-04-07  8:22 UTC (permalink / raw)
  To: fwefew 4t4tg; +Cc: users

[-- Attachment #1: Type: text/plain, Size: 6530 bytes --]

I assume your tree assumes there's a ConnectX-3 device installed, while the
kernel driver doesnt support it for quite a while
I would suggest re-compiling while excluding mlx4 PMD support.

Thanks,
Erez

On Tue, 5 Apr 2022 at 23:54, fwefew 4t4tg <7532yahoo@gmail.com> wrote:

> Mr. Ferber, much appreciated. I knew this metal box came to me with two
> mellanox NICs bonded. I used their util to unbond it, but alas it did not
> do it all the way. The /etc/network/interfaces was bad. I fixed the config
> and rebooted.
>
> Voila. Success. ibv_devinfo now shows two devices. And the DPDK app runs.
>
> Your point about bonding nudged me to double check.
>
> *Now, on a second machine where I believe I have done everything I did on
> the first machine*, running the DPDK application continues to look for
> the wrong driver:
>
> EAL: Detected CPU lcores: 16
> EAL: Detected NUMA nodes: 1
> EAL: Detected shared linkage of DPDK
> EAL: libmlx4.so.1: cannot open shared object file: No such file or
> directory
> EAL: FATAL: Cannot init plugins
> EAL: Cannot init plugins
>
> Somehow I fixed this on the first machine, but I cannot duplicate success
> on the second machine.
>
> Both machines report two devices; both machines show similar ifconifgs
>
>     device             node GUID
>
>     ------          ----------------
>
>     mlx5_0          0c42a103007ea9b8
>
>     mlx5_1          0c42a103007ea9b9
>
>
>     device             node GUID
>
>     ------          ----------------
>
>     mlx5_0          0c42a103007ea3ec
>
>     mlx5_1          0c42a103007ea3ed
>
>
> root@server:~/Dev/reinvent/scripts# ibv_devinfo
>
> hca_id: mlx5_0
>
> transport: InfiniBand (0)
>
> fw_ver: 14.32.1010
>
> node_guid: 0c42:a103:007e:a3ec
>
> sys_image_guid: 0c42:a103:007e:a3ec
>
> vendor_id: 0x02c9
>
> vendor_part_id: 4117
>
> hw_ver: 0x0
>
> board_id: MT_2420110034
>
> phys_port_cnt: 1
>
> port: 1
>
> state: PORT_ACTIVE (4)
>
> max_mtu: 4096 (5)
>
> active_mtu: 1024 (3)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
> link_layer: Ethernet
>
>
> hca_id: mlx5_1
>
> transport: InfiniBand (0)
>
> fw_ver: 14.32.1010
>
> node_guid: 0c42:a103:007e:a3ed
>
> sys_image_guid: 0c42:a103:007e:a3ec
>
> vendor_id: 0x02c9
>
> vendor_part_id: 4117
>
> hw_ver: 0x0
>
> board_id: MT_2420110034
>
> phys_port_cnt: 1
>
> port: 1
>
> state: PORT_ACTIVE (4)
>
> max_mtu: 4096 (5)
>
> active_mtu: 1024 (3)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
> link_layer: Ethernet
>
>
>
>
>
> ibv_devinfo
>
> hca_id: mlx5_0
>
> transport: InfiniBand (0)
>
> fw_ver: 14.32.1010
>
> node_guid: 0c42:a103:007e:a9b8
>
> sys_image_guid: 0c42:a103:007e:a9b8
>
> vendor_id: 0x02c9
>
> vendor_part_id: 4117
>
> hw_ver: 0x0
>
> board_id: MT_2420110034
>
> phys_port_cnt: 1
>
> port: 1
>
> state: PORT_ACTIVE (4)
>
> max_mtu: 4096 (5)
>
> active_mtu: 1024 (3)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
> link_layer: Ethernet
>
>
> hca_id: mlx5_1
>
> transport: InfiniBand (0)
>
> fw_ver: 14.32.1010
>
> node_guid: 0c42:a103:007e:a9b9
>
> sys_image_guid: 0c42:a103:007e:a9b8
>
> vendor_id: 0x02c9
>
> vendor_part_id: 4117
>
> hw_ver: 0x0
>
> board_id: MT_2420110034
>
> phys_port_cnt: 1
>
> port: 1
>
> state: PORT_ACTIVE (4)
>
> max_mtu: 4096 (5)
>
> active_mtu: 1024 (3)
>
> sm_lid: 0
>
> port_lid: 0
>
> port_lmc: 0x00
>
> link_layer: Ethernet
>
> On Tue, Apr 5, 2022 at 1:00 PM Erez Ferber <erezferber@gmail.com> wrote:
>
>> Hi,
>>
>> Based on your output, the ConnectX-4LX device is configured in LAG mode
>> managed via the kernel bonding scripts. In this mode, both physical
>> functions share a single port (mlx5_bond_0). You should only probe the
>> first PCI BDF - 01:00.0, not the 2nd one.
>>
>> By the way, the --dpdk installation flag should not be necessary, it is
>> an old flag keps for Mellanox OFED builds lower than 5.x.
>>
>> Regards,
>> Erez
>>
>> On Tue, 5 Apr 2022 at 19:17, fwefew 4t4tg <7532yahoo@gmail.com> wrote:
>>
>>> I built the current version of DPDK directly from dpdk.org after I
>>> installed the current OFED Mellanox driver set:
>>>
>>> * MLNX_OFED_LINUX-5.5-1.0.3.2-ubuntu20.04-x86_64.iso
>>>
>>> with ./install --dpdk
>>>
>>> I am using a Mellanox Technologies MT27710 Family [ConnectX-4 Lx] which
>>> is Ethernet only; there is no IB mode for this NIC. This is a MT_2420110034
>>> board. However, when I run dpdk-testpmd I see "No Verbs device matches PCI
>>> device 0000:01:00.1, are kernel drivers loaded?"
>>>
>>> EAL: Detected CPU lcores: 16
>>> EAL: Detected NUMA nodes: 1
>>> EAL: Detected static linkage of DPDK
>>> EAL: Selected IOVA mode 'PA'
>>> EAL: No free 2048 kB hugepages reported on node 0
>>> EAL: VFIO support initialized
>>> EAL: Probe PCI driver: mlx5_pci (15b3:1015) device: 0000:01:00.1 (socket
>>> 0)
>>> mlx5_common: No Verbs device matches PCI device 0000:01:00.1, are kernel
>>> drivers loaded?
>>> mlx5_common: Verbs device not found: 01:00.1
>>> mlx5_common: Failed to initialize device context.
>>> EAL: Requested device 0000:01:00.1 cannot be used
>>> EAL: Bus (pci) probe failed.
>>>
>>> As far as I can see all the kernel modules are loaded:
>>>
>>> lsmod | egrep "(ib|mlx)" | sort
>>> ib_cm                  53248  2 rdma_cm,ib_ipoib
>>> ib_core               368640  8
>>> rdma_cm,ib_ipoib,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
>>> ib_ipoib              135168  0
>>> ib_umad                24576  0
>>> ib_uverbs             139264  2 rdma_ucm,mlx5_ib
>>> libahci                36864  1 ahci
>>> libcrc32c              16384  2 btrfs,raid456
>>> mlx5_core            1634304  1 mlx5_ib
>>> mlx5_ib               397312  0
>>> mlx_compat             69632  11
>>> rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core
>>> mlxdevm               172032  1 mlx5_core
>>> mlxfw                  32768  1 mlx5_core
>>> pci_hyperv_intf        16384  1 mlx5_core
>>> psample                20480  1 mlx5_core
>>> tls                    94208  2 bonding,mlx5_core
>>>
>>> root@dc-c3-small-x86-01:~/Dev/reinvent/scripts# mst status -v
>>> MST modules:
>>> ------------
>>>     MST PCI module is not loaded
>>>     MST PCI configuration module loaded
>>> PCI devices:
>>> ------------
>>> DEVICE_TYPE             MST                           PCI       RDMA
>>>        NET                       NUMA
>>> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0.1    01:00.1
>>> mlx5_bond_0     net-bond0                 -1
>>>
>>> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0      01:00.0
>>> mlx5_bond_0     net-bond0                 -1
>>>
>>

[-- Attachment #2: Type: text/html, Size: 37399 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mlx5_common: No Verbs device matches PCI device 0000:01:00.1
  2022-04-07  8:22     ` Erez Ferber
@ 2022-04-07 22:58       ` fwefew 4t4tg
  0 siblings, 0 replies; 5+ messages in thread
From: fwefew 4t4tg @ 2022-04-07 22:58 UTC (permalink / raw)
  To: Erez Ferber; +Cc: users

[-- Attachment #1: Type: text/plain, Size: 7750 bytes --]

This was my error. I linked the code incorrectly. For those new to DPDK the
correct linking methodology is, using cmake, follows:

- build DPDK from dpdk.org. For example see
https://github.com/rodgarrison/reinvent then see
https://github.com/rodgarrison/reinvent/blob/main/scripts/install#L58.
While this write-up is for DPDK on AWS ENA NICs, building DPDK is the same
no matter what NIC you are using.

- configure your linking libraries with the help of pkg-config. I do it
this way
https://github.com/rodgarrison/reinvent/blob/ubuntu/CMakeLists.txt#L27 to
setup the DPDK linking command defaulting to dynamic libraries. But static
link is also possible; see else subcommand. Then, in the actual,
application use this setup, for example,
https://github.com/rodgarrison/reinvent/blob/ubuntu/integration_tests/reinvent_dpdk_udp/CMakeLists.txt#L43


On Thu, Apr 7, 2022 at 4:22 AM Erez Ferber <erezferber@gmail.com> wrote:

> I assume your tree assumes there's a ConnectX-3 device installed, while
> the kernel driver doesnt support it for quite a while
> I would suggest re-compiling while excluding mlx4 PMD support.
>
> Thanks,
> Erez
>
> On Tue, 5 Apr 2022 at 23:54, fwefew 4t4tg <7532yahoo@gmail.com> wrote:
>
>> Mr. Ferber, much appreciated. I knew this metal box came to me with two
>> mellanox NICs bonded. I used their util to unbond it, but alas it did not
>> do it all the way. The /etc/network/interfaces was bad. I fixed the config
>> and rebooted.
>>
>> Voila. Success. ibv_devinfo now shows two devices. And the DPDK app runs.
>>
>> Your point about bonding nudged me to double check.
>>
>> *Now, on a second machine where I believe I have done everything I did on
>> the first machine*, running the DPDK application continues to look for
>> the wrong driver:
>>
>> EAL: Detected CPU lcores: 16
>> EAL: Detected NUMA nodes: 1
>> EAL: Detected shared linkage of DPDK
>> EAL: libmlx4.so.1: cannot open shared object file: No such file or
>> directory
>> EAL: FATAL: Cannot init plugins
>> EAL: Cannot init plugins
>>
>> Somehow I fixed this on the first machine, but I cannot duplicate success
>> on the second machine.
>>
>> Both machines report two devices; both machines show similar ifconifgs
>>
>>     device             node GUID
>>
>>     ------          ----------------
>>
>>     mlx5_0          0c42a103007ea9b8
>>
>>     mlx5_1          0c42a103007ea9b9
>>
>>
>>     device             node GUID
>>
>>     ------          ----------------
>>
>>     mlx5_0          0c42a103007ea3ec
>>
>>     mlx5_1          0c42a103007ea3ed
>>
>>
>> root@server:~/Dev/reinvent/scripts# ibv_devinfo
>>
>> hca_id: mlx5_0
>>
>> transport: InfiniBand (0)
>>
>> fw_ver: 14.32.1010
>>
>> node_guid: 0c42:a103:007e:a3ec
>>
>> sys_image_guid: 0c42:a103:007e:a3ec
>>
>> vendor_id: 0x02c9
>>
>> vendor_part_id: 4117
>>
>> hw_ver: 0x0
>>
>> board_id: MT_2420110034
>>
>> phys_port_cnt: 1
>>
>> port: 1
>>
>> state: PORT_ACTIVE (4)
>>
>> max_mtu: 4096 (5)
>>
>> active_mtu: 1024 (3)
>>
>> sm_lid: 0
>>
>> port_lid: 0
>>
>> port_lmc: 0x00
>>
>> link_layer: Ethernet
>>
>>
>> hca_id: mlx5_1
>>
>> transport: InfiniBand (0)
>>
>> fw_ver: 14.32.1010
>>
>> node_guid: 0c42:a103:007e:a3ed
>>
>> sys_image_guid: 0c42:a103:007e:a3ec
>>
>> vendor_id: 0x02c9
>>
>> vendor_part_id: 4117
>>
>> hw_ver: 0x0
>>
>> board_id: MT_2420110034
>>
>> phys_port_cnt: 1
>>
>> port: 1
>>
>> state: PORT_ACTIVE (4)
>>
>> max_mtu: 4096 (5)
>>
>> active_mtu: 1024 (3)
>>
>> sm_lid: 0
>>
>> port_lid: 0
>>
>> port_lmc: 0x00
>>
>> link_layer: Ethernet
>>
>>
>>
>>
>>
>> ibv_devinfo
>>
>> hca_id: mlx5_0
>>
>> transport: InfiniBand (0)
>>
>> fw_ver: 14.32.1010
>>
>> node_guid: 0c42:a103:007e:a9b8
>>
>> sys_image_guid: 0c42:a103:007e:a9b8
>>
>> vendor_id: 0x02c9
>>
>> vendor_part_id: 4117
>>
>> hw_ver: 0x0
>>
>> board_id: MT_2420110034
>>
>> phys_port_cnt: 1
>>
>> port: 1
>>
>> state: PORT_ACTIVE (4)
>>
>> max_mtu: 4096 (5)
>>
>> active_mtu: 1024 (3)
>>
>> sm_lid: 0
>>
>> port_lid: 0
>>
>> port_lmc: 0x00
>>
>> link_layer: Ethernet
>>
>>
>> hca_id: mlx5_1
>>
>> transport: InfiniBand (0)
>>
>> fw_ver: 14.32.1010
>>
>> node_guid: 0c42:a103:007e:a9b9
>>
>> sys_image_guid: 0c42:a103:007e:a9b8
>>
>> vendor_id: 0x02c9
>>
>> vendor_part_id: 4117
>>
>> hw_ver: 0x0
>>
>> board_id: MT_2420110034
>>
>> phys_port_cnt: 1
>>
>> port: 1
>>
>> state: PORT_ACTIVE (4)
>>
>> max_mtu: 4096 (5)
>>
>> active_mtu: 1024 (3)
>>
>> sm_lid: 0
>>
>> port_lid: 0
>>
>> port_lmc: 0x00
>>
>> link_layer: Ethernet
>>
>> On Tue, Apr 5, 2022 at 1:00 PM Erez Ferber <erezferber@gmail.com> wrote:
>>
>>> Hi,
>>>
>>> Based on your output, the ConnectX-4LX device is configured in LAG mode
>>> managed via the kernel bonding scripts. In this mode, both physical
>>> functions share a single port (mlx5_bond_0). You should only probe the
>>> first PCI BDF - 01:00.0, not the 2nd one.
>>>
>>> By the way, the --dpdk installation flag should not be necessary, it is
>>> an old flag keps for Mellanox OFED builds lower than 5.x.
>>>
>>> Regards,
>>> Erez
>>>
>>> On Tue, 5 Apr 2022 at 19:17, fwefew 4t4tg <7532yahoo@gmail.com> wrote:
>>>
>>>> I built the current version of DPDK directly from dpdk.org after I
>>>> installed the current OFED Mellanox driver set:
>>>>
>>>> * MLNX_OFED_LINUX-5.5-1.0.3.2-ubuntu20.04-x86_64.iso
>>>>
>>>> with ./install --dpdk
>>>>
>>>> I am using a Mellanox Technologies MT27710 Family [ConnectX-4 Lx] which
>>>> is Ethernet only; there is no IB mode for this NIC. This is a MT_2420110034
>>>> board. However, when I run dpdk-testpmd I see "No Verbs device matches PCI
>>>> device 0000:01:00.1, are kernel drivers loaded?"
>>>>
>>>> EAL: Detected CPU lcores: 16
>>>> EAL: Detected NUMA nodes: 1
>>>> EAL: Detected static linkage of DPDK
>>>> EAL: Selected IOVA mode 'PA'
>>>> EAL: No free 2048 kB hugepages reported on node 0
>>>> EAL: VFIO support initialized
>>>> EAL: Probe PCI driver: mlx5_pci (15b3:1015) device: 0000:01:00.1
>>>> (socket 0)
>>>> mlx5_common: No Verbs device matches PCI device 0000:01:00.1, are
>>>> kernel drivers loaded?
>>>> mlx5_common: Verbs device not found: 01:00.1
>>>> mlx5_common: Failed to initialize device context.
>>>> EAL: Requested device 0000:01:00.1 cannot be used
>>>> EAL: Bus (pci) probe failed.
>>>>
>>>> As far as I can see all the kernel modules are loaded:
>>>>
>>>> lsmod | egrep "(ib|mlx)" | sort
>>>> ib_cm                  53248  2 rdma_cm,ib_ipoib
>>>> ib_core               368640  8
>>>> rdma_cm,ib_ipoib,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
>>>> ib_ipoib              135168  0
>>>> ib_umad                24576  0
>>>> ib_uverbs             139264  2 rdma_ucm,mlx5_ib
>>>> libahci                36864  1 ahci
>>>> libcrc32c              16384  2 btrfs,raid456
>>>> mlx5_core            1634304  1 mlx5_ib
>>>> mlx5_ib               397312  0
>>>> mlx_compat             69632  11
>>>> rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core
>>>> mlxdevm               172032  1 mlx5_core
>>>> mlxfw                  32768  1 mlx5_core
>>>> pci_hyperv_intf        16384  1 mlx5_core
>>>> psample                20480  1 mlx5_core
>>>> tls                    94208  2 bonding,mlx5_core
>>>>
>>>> root@dc-c3-small-x86-01:~/Dev/reinvent/scripts# mst status -v
>>>> MST modules:
>>>> ------------
>>>>     MST PCI module is not loaded
>>>>     MST PCI configuration module loaded
>>>> PCI devices:
>>>> ------------
>>>> DEVICE_TYPE             MST                           PCI       RDMA
>>>>          NET                       NUMA
>>>> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0.1    01:00.1
>>>> mlx5_bond_0     net-bond0                 -1
>>>>
>>>> ConnectX4LX(rev:0)      /dev/mst/mt4117_pciconf0      01:00.0
>>>> mlx5_bond_0     net-bond0                 -1
>>>>
>>>

[-- Attachment #2: Type: text/html, Size: 39018 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2022-04-07 22:58 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-04-05 16:17 mlx5_common: No Verbs device matches PCI device 0000:01:00.1 fwefew 4t4tg
2022-04-05 16:59 ` Erez Ferber
2022-04-05 20:54   ` fwefew 4t4tg
2022-04-07  8:22     ` Erez Ferber
2022-04-07 22:58       ` fwefew 4t4tg

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).