From: Maxime Coquelin <maxime.coquelin@redhat.com>
To: "Mattias Rönnblom" <hofors@lysator.liu.se>
Cc: "Mattias Rönnblom" <mattias.ronnblom@ericsson.com>,
dev@dpdk.org, "Morten Brørup" <mb@smartsharesystems.com>,
"Stephen Hemminger" <stephen@networkplumber.org>,
"Abdullah Sevincer" <abdullah.sevincer@intel.com>,
"Pavan Nikhilesh" <pbhagavatula@marvell.com>,
"David Hunt" <david.hunt@intel.com>,
"Vladimir Medvedkin" <vladimir.medvedkin@intel.com>,
"Bruce Richardson" <bruce.richardson@intel.com>
Subject: Re: [PATCH v4 00/13] Optionally have rte_memcpy delegate to compiler memcpy
Date: Wed, 26 Jun 2024 10:37:31 +0200 [thread overview]
Message-ID: <3eebd7f7-9ba2-424c-80d1-6efa8945641d@redhat.com> (raw)
In-Reply-To: <ZnsaCj1lupVs2JZe@isengard>
On 6/25/24 21:27, Mattias Rönnblom wrote:
> On Tue, Jun 25, 2024 at 05:29:35PM +0200, Maxime Coquelin wrote:
>> Hi Mattias,
>>
>> On 6/20/24 19:57, Mattias Rönnblom wrote:
>>> This patch set make DPDK library, driver, and application code use the
>>> compiler/libc memcpy() by default when functions in <rte_memcpy.h> are
>>> invoked.
>>>
>>> The various custom DPDK rte_memcpy() implementations may be retained
>>> by means of a build-time option.
>>>
>>> This patch set only make a difference on x86, PPC and ARM. Loongarch
>>> and RISCV already used compiler/libc memcpy().
>>
>> It indeed makes a difference on x86!
>>
>> Just tested latest main with and without your series on
>> Intel(R) Xeon(R) Gold 6438N.
>>
>> The test is a simple IO loop between a Vhost PMD and a Virtio-user PMD:
>> # dpdk-testpmd -l 4-6 --file-prefix=virtio1 --no-pci --vdev 'net_virtio_user0,mac=00:01:02:03:04:05,path=./vhost-net,server=1,mrg_rxbuf=1,in_order=1'
>> --single-file-segments -- -i
>> testpmd> start
>>
>> # dpdk-testpmd -l 8-10 --file-prefix=vhost1 --no-pci --vdev
>> 'net_vhost0,iface=vhost-net,client=1' --single-file-segments -- -i
>> testpmd> start tx_first 32
>>
>> Latest main: 14.5Mpps
>> Latest main + this series: 10Mpps
>>
>
> I ran the above benchmark on my Raptor Lake desktop (locked to 3,2
> GHz). GCC 12.3.0.
>
> Core use_cc_memcpy Mpps
> E false 9.5
> E true 9.7
> P false 16.4
> P true 13.5
>
> On the P-cores, there's a significant performance regression, although
> not as bad as the one you see on your Sapphire Rapids Xeon. On the
> E-cores, there's actually a slight performance gain.
>
> The virtio PMD does not directly invoke rte_memcpy() or anything else
> from <rte_memcpy.h>, but rather use memcpy(), so I'm not sure I
> understand what's going on here. Does the virtio driver delegate some
> performance-critical task to some module that in turns uses
> rte_memcpy()?
This is because Vhost is the bottleneck here, not Virtio driver.
Indeed, the virtqueues memory belongs to the Virtio driver and the
descriptors buffers are Virtio's mbufs, so not much memcpy's are done
there.
Vhost however, is a heavy memcpy user, as all the descriptors buffers
are copied to/from its mbufs.
>> So for me, it should be disabled by default.
>>
>> Regards,
>> Maxime
>>
>>> This patch set includes a number of fixes in drivers and libraries
>>> which errornously relied on <rte_memcpy.h> including header files
>>> (i.e., <rte_vect.h>) required by its implementation.
>>>
>>> Mattias Rönnblom (13):
>>> net/i40e: add missing vector API header include
>>> net/iavf: add missing vector API header include
>>> net/ice: add missing vector API header include
>>> net/ixgbe: add missing vector API header include
>>> net/ngbe: add missing vector API header include
>>> net/txgbe: add missing vector API header include
>>> net/virtio: add missing vector API header include
>>> net/fm10k: add missing vector API header include
>>> event/dlb2: include headers for vector and memory copy APIs
>>> net/octeon_ep: add missing vector API header include
>>> distributor: add missing vector API header include
>>> fib: add missing vector API header include
>>> eal: provide option to use compiler memcpy instead of RTE
>>>
>>> config/meson.build | 1 +
>>> doc/guides/rel_notes/release_24_07.rst | 21 +++++++
>>> drivers/event/dlb2/dlb2.c | 2 +
>>> drivers/net/fm10k/fm10k_rxtx_vec.c | 3 +-
>>> drivers/net/i40e/i40e_rxtx_vec_sse.c | 3 +-
>>> drivers/net/iavf/iavf_rxtx_vec_sse.c | 3 +-
>>> drivers/net/ice/ice_rxtx_vec_sse.c | 2 +-
>>> drivers/net/ixgbe/ixgbe_rxtx_vec_sse.c | 3 +-
>>> drivers/net/ngbe/ngbe_rxtx_vec_sse.c | 3 +-
>>> drivers/net/octeon_ep/otx_ep_ethdev.c | 2 +
>>> drivers/net/txgbe/txgbe_rxtx_vec_sse.c | 3 +-
>>> drivers/net/virtio/virtio_rxtx_simple_sse.c | 3 +-
>>> lib/distributor/rte_distributor.c | 1 +
>>> lib/eal/arm/include/rte_memcpy.h | 10 ++++
>>> lib/eal/include/generic/rte_memcpy.h | 61 ++++++++++++++++++---
>>> lib/eal/loongarch/include/rte_memcpy.h | 53 ++----------------
>>> lib/eal/ppc/include/rte_memcpy.h | 10 ++++
>>> lib/eal/riscv/include/rte_memcpy.h | 53 ++----------------
>>> lib/eal/x86/include/meson.build | 1 +
>>> lib/eal/x86/include/rte_memcpy.h | 11 +++-
>>> lib/fib/trie.c | 1 +
>>> meson_options.txt | 2 +
>>> 22 files changed, 131 insertions(+), 121 deletions(-)
>>>
>>
>
next prev parent reply other threads:[~2024-06-26 8:37 UTC|newest]
Thread overview: 128+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-05-27 11:11 [RFC] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-05-28 7:43 ` [RFC v2] " Mattias Rönnblom
2024-05-28 8:19 ` Mattias Rönnblom
2024-05-28 8:27 ` Bruce Richardson
2024-05-28 8:59 ` Mattias Rönnblom
2024-05-28 9:07 ` Morten Brørup
2024-05-28 16:17 ` Mattias Rönnblom
2024-05-28 14:59 ` Stephen Hemminger
2024-05-28 15:09 ` Bruce Richardson
2024-05-31 5:19 ` Mattias Rönnblom
2024-05-31 16:50 ` Stephen Hemminger
2024-06-02 11:33 ` Mattias Rönnblom
2024-05-28 16:03 ` Mattias Rönnblom
2024-05-29 21:55 ` Stephen Hemminger
2024-05-28 8:20 ` Bruce Richardson
2024-06-02 12:39 ` [RFC v3 0/5] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 1/5] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 0/5] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 1/5] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 2/5] net/octeon_ep: properly include vector API header file Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 3/5] distributor: " Mattias Rönnblom
2024-06-10 14:27 ` Tyler Retzlaff
2024-06-05 6:49 ` [PATCH 4/5] fib: " Mattias Rönnblom
2024-06-10 14:28 ` Tyler Retzlaff
2024-06-05 6:49 ` [PATCH 5/5] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-20 7:24 ` [PATCH v2 0/6] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-20 7:24 ` [PATCH v2 1/6] net/fm10k: add missing intrinsic include Mattias Rönnblom
2024-06-20 9:02 ` Bruce Richardson
2024-06-20 9:28 ` Bruce Richardson
2024-06-20 11:40 ` Mattias Rönnblom
2024-06-20 11:59 ` Bruce Richardson
2024-06-20 11:50 ` [PATCH v3 0/6] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 1/6] net/fm10k: add missing vector API header include Mattias Rönnblom
2024-06-20 12:34 ` Bruce Richardson
2024-06-20 17:57 ` [PATCH v4 00/13] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 01/13] net/i40e: add missing vector API header include Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 0/6] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 1/6] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-09-20 10:27 ` [PATCH v6 0/7] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-09-20 10:27 ` [PATCH v6 1/7] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-10-09 20:59 ` Morten Brørup
2024-10-09 22:01 ` Stephen Hemminger
2024-09-20 10:27 ` [PATCH v6 2/7] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-10-09 21:00 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 3/7] distributor: " Mattias Rönnblom
2024-10-09 21:00 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 4/7] fib: " Mattias Rönnblom
2024-10-09 21:00 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 5/7] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-10-04 7:52 ` David Marchand
2024-10-04 9:21 ` Mattias Rönnblom
2024-10-04 9:54 ` David Marchand
2024-10-04 12:07 ` Thomas Monjalon
2024-10-04 9:27 ` Mattias Rönnblom
2024-09-20 10:27 ` [PATCH v6 6/7] ci: test compiler memcpy Mattias Rönnblom
2024-10-04 7:56 ` David Marchand
2024-10-09 21:04 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 7/7] vhost: optimize memcpy routines when cc memcpy is used Mattias Rönnblom
2024-10-03 11:46 ` Maxime Coquelin
2024-10-09 21:25 ` Morten Brørup
2024-10-10 10:29 ` Mattias Rönnblom
2024-10-09 21:57 ` Stephen Hemminger
2024-10-10 10:35 ` Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 2/6] distributor: add missing vector API header include Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 3/6] fib: " Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 4/6] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 5/6] ci: test compiler memcpy Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 6/6] vhost: optimize memcpy routines when cc memcpy is used Mattias Rönnblom
2024-07-29 11:00 ` Morten Brørup
2024-07-29 19:27 ` Mattias Rönnblom
2024-07-29 19:56 ` Morten Brørup
2024-06-20 17:57 ` [PATCH v4 02/13] net/iavf: add missing vector API header include Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 03/13] net/ice: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 04/13] net/ixgbe: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 05/13] net/ngbe: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 06/13] net/txgbe: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 07/13] net/virtio: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 08/13] net/fm10k: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 09/13] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 10/13] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 11/13] distributor: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 12/13] fib: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 13/13] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-21 15:19 ` Stephen Hemminger
2024-06-24 10:05 ` Thomas Monjalon
2024-06-24 17:56 ` Mattias Rönnblom
2024-06-25 13:06 ` Mattias Rönnblom
2024-06-25 13:34 ` Thomas Monjalon
2024-06-20 18:53 ` [PATCH v4 00/13] Optionally have rte_memcpy delegate to compiler memcpy Morten Brørup
2024-06-21 6:56 ` Mattias Rönnblom
2024-06-21 7:04 ` David Marchand
2024-06-21 7:35 ` Mattias Rönnblom
2024-06-21 7:41 ` David Marchand
2024-06-25 15:29 ` Maxime Coquelin
2024-06-25 15:44 ` Stephen Hemminger
2024-06-25 19:27 ` Mattias Rönnblom
2024-06-26 8:37 ` Maxime Coquelin [this message]
2024-06-26 14:58 ` Stephen Hemminger
2024-06-26 15:24 ` Maxime Coquelin
2024-06-26 18:47 ` Mattias Rönnblom
2024-06-26 20:16 ` Morten Brørup
2024-06-27 11:06 ` Mattias Rönnblom
2024-06-27 15:10 ` Stephen Hemminger
2024-06-27 15:23 ` Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 2/6] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 3/6] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 4/6] distributor: " Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 5/6] fib: " Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 6/6] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-20 7:24 ` [PATCH v2 2/6] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-20 9:03 ` Bruce Richardson
2024-06-20 7:24 ` [PATCH v2 3/6] net/octeon_ep: properly include vector API header file Mattias Rönnblom
2024-06-20 14:43 ` Stephen Hemminger
2024-06-20 7:24 ` [PATCH v2 4/6] distributor: " Mattias Rönnblom
2024-06-20 9:13 ` Bruce Richardson
2024-06-20 7:24 ` [PATCH v2 5/6] fib: " Mattias Rönnblom
2024-06-20 9:14 ` Bruce Richardson
2024-06-20 14:43 ` Stephen Hemminger
2024-06-20 7:24 ` [PATCH v2 6/6] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 2/5] net/octeon_ep: properly include vector API header file Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 3/5] distributor: " Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 4/5] fib: " Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 5/5] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-02 20:58 ` Morten Brørup
2024-06-03 17:04 ` Mattias Rönnblom
2024-06-03 17:08 ` Stephen Hemminger
2024-05-29 21:56 ` [RFC] " Stephen Hemminger
2024-06-02 11:30 ` Mattias Rönnblom
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3eebd7f7-9ba2-424c-80d1-6efa8945641d@redhat.com \
--to=maxime.coquelin@redhat.com \
--cc=abdullah.sevincer@intel.com \
--cc=bruce.richardson@intel.com \
--cc=david.hunt@intel.com \
--cc=dev@dpdk.org \
--cc=hofors@lysator.liu.se \
--cc=mattias.ronnblom@ericsson.com \
--cc=mb@smartsharesystems.com \
--cc=pbhagavatula@marvell.com \
--cc=stephen@networkplumber.org \
--cc=vladimir.medvedkin@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).