Hi,

I have an issue with DPDK 24.11.1 and 2 port 100G Intel NIC (E810-C) on 22 core CPU dual socket server.

 

There is a dedicated CPU core to get the packets from DPDK using rte_eth_rx_burst() and enqueue the mbufs into a worker ring Q.  This thread does nothing else.  The NIC is dropping packets at 8.5 Gbps per port.

 

Studying the perf report, I was interested in the common_ring_mc_dequeue().  Perf tool shows common_ring_mc_dequeue() 92.86% Self and 92.86% Children.

 

I see further with perf tool rte_ring_enqueue_bulk() and rte_ring_enqueue_bulk_elem().  These are at 0.00% Self and 0.05% Children.

Perf tool shows rte_ring_sp_enqueue_bulk_elem (inlined) which is what I wanted to see (Single producer) representing the enqueue of the mbufs pointers to the worker ring Q.

 

Is it possible to change the common_ring_mc_dequeue() to common_ring_sc_dequeue()?  Can it be set to one consumer on single Queue 0.

 

I believe this is limiting DPDK from reaching 90 Gbps or higher in my setup, which is my goal.

 

I made sure the E810-C firmware was up to date, NIC FW Version: 4.80 0x80020543 1.3805.0

 

Perf report shows:

   - 99.65% input_thread                                                                                                                                                                   

      - 99.35% rte_eth_rx_burst (inlined)                                                                                                                                            

         - ice_recv_scattered_pkts                                                                                                                                                           

              92.83% common_ring_mc_dequeue                                    

 

Any thoughts or suggestions?

 

Thanks,

Ed