From: "Mattias Rönnblom" <hofors@lysator.liu.se>
To: "Morten Brørup" <mb@smartsharesystems.com>,
"Mattias Rönnblom" <mattias.ronnblom@ericsson.com>,
dev@dpdk.org, maxime.coquelin@redhat.com
Cc: Stephen Hemminger <stephen@networkplumber.org>,
David Marchand <david.marchand@redhat.com>,
Pavan Nikhilesh <pbhagavatula@marvell.com>,
Bruce Richardson <bruce.richardson@intel.com>
Subject: Re: [PATCH v6 7/7] vhost: optimize memcpy routines when cc memcpy is used
Date: Thu, 10 Oct 2024 12:29:59 +0200 [thread overview]
Message-ID: <99ebb9d7-7da2-4d57-9e4d-81b5d90c6ddd@lysator.liu.se> (raw)
In-Reply-To: <98CBD80474FA8B44BF855DF32C47DC35E9F7B2@smartserver.smartshare.dk>
On 2024-10-09 23:25, Morten Brørup wrote:
>> +#if defined(RTE_USE_CC_MEMCPY) && defined(RTE_ARCH_X86_64)
>> +static __rte_always_inline void
>> +pktcpy(void *restrict in_dst, const void *restrict in_src, size_t len)
>> +{
>
> A comment describing why batch_copy_elem.dst and src point to 16 byte aligned data would be nice.
>
Good point. As I think I mentioned at some point, I'm not sure they are.
From what I recall, having (or pretending) the data is 16-bit aligned
does give a noticeable performance increase on x86_64.
Is this something I should look into for 24.11, or this patch set is not
going to make it anyway?
>> + void *dst = __builtin_assume_aligned(in_dst, 16);
>> + const void *src = __builtin_assume_aligned(in_src, 16);
>> +
>> + if (len <= 256) {
>> + size_t left;
>> +
>> + for (left = len; left >= 32; left -= 32) {
>> + memcpy(dst, src, 32);
>> + dst = RTE_PTR_ADD(dst, 32);
>> + src = RTE_PTR_ADD(src, 32);
>> + }
>> +
>> + memcpy(dst, src, left);
>> + } else
>> + memcpy(dst, src, len);
>> +}
>> +#else
>> +static __rte_always_inline void
>> +pktcpy(void *dst, const void *src, size_t len)
>> +{
>> + rte_memcpy(dst, src, len);
>> +}
>> +#endif
>> +
>> static inline void
>> do_data_copy_enqueue(struct virtio_net *dev, struct vhost_virtqueue
>> *vq)
>> __rte_shared_locks_required(&vq->iotlb_lock)
>> @@ -240,7 +273,7 @@ do_data_copy_enqueue(struct virtio_net *dev, struct
>> vhost_virtqueue *vq)
>> int i;
>>
>> for (i = 0; i < count; i++) {
>> - rte_memcpy(elem[i].dst, elem[i].src, elem[i].len);
>> + pktcpy(elem[i].dst, elem[i].src, elem[i].len);
>> vhost_log_cache_write_iova(dev, vq, elem[i].log_addr,
>> elem[i].len);
>> PRINT_PACKET(dev, (uintptr_t)elem[i].dst, elem[i].len, 0);
>> @@ -257,7 +290,7 @@ do_data_copy_dequeue(struct vhost_virtqueue *vq)
>> int i;
>>
>> for (i = 0; i < count; i++)
>> - rte_memcpy(elem[i].dst, elem[i].src, elem[i].len);
>> + pktcpy(elem[i].dst, elem[i].src, elem[i].len);
>>
>> vq->batch_copy_nb_elems = 0;
>> }
>> --
>> 2.43.0
>
> Anyway,
> Acked-by: Morten Brørup <mb@smartsharesystems.com>
>
next prev parent reply other threads:[~2024-10-10 10:30 UTC|newest]
Thread overview: 128+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-05-27 11:11 [RFC] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-05-28 7:43 ` [RFC v2] " Mattias Rönnblom
2024-05-28 8:19 ` Mattias Rönnblom
2024-05-28 8:27 ` Bruce Richardson
2024-05-28 8:59 ` Mattias Rönnblom
2024-05-28 9:07 ` Morten Brørup
2024-05-28 16:17 ` Mattias Rönnblom
2024-05-28 14:59 ` Stephen Hemminger
2024-05-28 15:09 ` Bruce Richardson
2024-05-31 5:19 ` Mattias Rönnblom
2024-05-31 16:50 ` Stephen Hemminger
2024-06-02 11:33 ` Mattias Rönnblom
2024-05-28 16:03 ` Mattias Rönnblom
2024-05-29 21:55 ` Stephen Hemminger
2024-05-28 8:20 ` Bruce Richardson
2024-06-02 12:39 ` [RFC v3 0/5] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 1/5] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 0/5] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 1/5] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 2/5] net/octeon_ep: properly include vector API header file Mattias Rönnblom
2024-06-05 6:49 ` [PATCH 3/5] distributor: " Mattias Rönnblom
2024-06-10 14:27 ` Tyler Retzlaff
2024-06-05 6:49 ` [PATCH 4/5] fib: " Mattias Rönnblom
2024-06-10 14:28 ` Tyler Retzlaff
2024-06-05 6:49 ` [PATCH 5/5] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-20 7:24 ` [PATCH v2 0/6] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-20 7:24 ` [PATCH v2 1/6] net/fm10k: add missing intrinsic include Mattias Rönnblom
2024-06-20 9:02 ` Bruce Richardson
2024-06-20 9:28 ` Bruce Richardson
2024-06-20 11:40 ` Mattias Rönnblom
2024-06-20 11:59 ` Bruce Richardson
2024-06-20 11:50 ` [PATCH v3 0/6] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 1/6] net/fm10k: add missing vector API header include Mattias Rönnblom
2024-06-20 12:34 ` Bruce Richardson
2024-06-20 17:57 ` [PATCH v4 00/13] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 01/13] net/i40e: add missing vector API header include Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 0/6] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 1/6] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-09-20 10:27 ` [PATCH v6 0/7] Optionally have rte_memcpy delegate to compiler memcpy Mattias Rönnblom
2024-09-20 10:27 ` [PATCH v6 1/7] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-10-09 20:59 ` Morten Brørup
2024-10-09 22:01 ` Stephen Hemminger
2024-09-20 10:27 ` [PATCH v6 2/7] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-10-09 21:00 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 3/7] distributor: " Mattias Rönnblom
2024-10-09 21:00 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 4/7] fib: " Mattias Rönnblom
2024-10-09 21:00 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 5/7] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-10-04 7:52 ` David Marchand
2024-10-04 9:21 ` Mattias Rönnblom
2024-10-04 9:54 ` David Marchand
2024-10-04 12:07 ` Thomas Monjalon
2024-10-04 9:27 ` Mattias Rönnblom
2024-09-20 10:27 ` [PATCH v6 6/7] ci: test compiler memcpy Mattias Rönnblom
2024-10-04 7:56 ` David Marchand
2024-10-09 21:04 ` Morten Brørup
2024-09-20 10:27 ` [PATCH v6 7/7] vhost: optimize memcpy routines when cc memcpy is used Mattias Rönnblom
2024-10-03 11:46 ` Maxime Coquelin
2024-10-09 21:25 ` Morten Brørup
2024-10-10 10:29 ` Mattias Rönnblom [this message]
2024-10-09 21:57 ` Stephen Hemminger
2024-10-10 10:35 ` Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 2/6] distributor: add missing vector API header include Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 3/6] fib: " Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 4/6] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 5/6] ci: test compiler memcpy Mattias Rönnblom
2024-07-24 7:53 ` [PATCH v5 6/6] vhost: optimize memcpy routines when cc memcpy is used Mattias Rönnblom
2024-07-29 11:00 ` Morten Brørup
2024-07-29 19:27 ` Mattias Rönnblom
2024-07-29 19:56 ` Morten Brørup
2024-06-20 17:57 ` [PATCH v4 02/13] net/iavf: add missing vector API header include Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 03/13] net/ice: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 04/13] net/ixgbe: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 05/13] net/ngbe: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 06/13] net/txgbe: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 07/13] net/virtio: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 08/13] net/fm10k: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 09/13] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 10/13] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 11/13] distributor: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 12/13] fib: " Mattias Rönnblom
2024-06-20 17:57 ` [PATCH v4 13/13] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-21 15:19 ` Stephen Hemminger
2024-06-24 10:05 ` Thomas Monjalon
2024-06-24 17:56 ` Mattias Rönnblom
2024-06-25 13:06 ` Mattias Rönnblom
2024-06-25 13:34 ` Thomas Monjalon
2024-06-20 18:53 ` [PATCH v4 00/13] Optionally have rte_memcpy delegate to compiler memcpy Morten Brørup
2024-06-21 6:56 ` Mattias Rönnblom
2024-06-21 7:04 ` David Marchand
2024-06-21 7:35 ` Mattias Rönnblom
2024-06-21 7:41 ` David Marchand
2024-06-25 15:29 ` Maxime Coquelin
2024-06-25 15:44 ` Stephen Hemminger
2024-06-25 19:27 ` Mattias Rönnblom
2024-06-26 8:37 ` Maxime Coquelin
2024-06-26 14:58 ` Stephen Hemminger
2024-06-26 15:24 ` Maxime Coquelin
2024-06-26 18:47 ` Mattias Rönnblom
2024-06-26 20:16 ` Morten Brørup
2024-06-27 11:06 ` Mattias Rönnblom
2024-06-27 15:10 ` Stephen Hemminger
2024-06-27 15:23 ` Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 2/6] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 3/6] net/octeon_ep: add missing vector API header include Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 4/6] distributor: " Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 5/6] fib: " Mattias Rönnblom
2024-06-20 11:50 ` [PATCH v3 6/6] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-20 7:24 ` [PATCH v2 2/6] event/dlb2: include headers for vector and memory copy APIs Mattias Rönnblom
2024-06-20 9:03 ` Bruce Richardson
2024-06-20 7:24 ` [PATCH v2 3/6] net/octeon_ep: properly include vector API header file Mattias Rönnblom
2024-06-20 14:43 ` Stephen Hemminger
2024-06-20 7:24 ` [PATCH v2 4/6] distributor: " Mattias Rönnblom
2024-06-20 9:13 ` Bruce Richardson
2024-06-20 7:24 ` [PATCH v2 5/6] fib: " Mattias Rönnblom
2024-06-20 9:14 ` Bruce Richardson
2024-06-20 14:43 ` Stephen Hemminger
2024-06-20 7:24 ` [PATCH v2 6/6] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 2/5] net/octeon_ep: properly include vector API header file Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 3/5] distributor: " Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 4/5] fib: " Mattias Rönnblom
2024-06-02 12:39 ` [RFC v3 5/5] eal: provide option to use compiler memcpy instead of RTE Mattias Rönnblom
2024-06-02 20:58 ` Morten Brørup
2024-06-03 17:04 ` Mattias Rönnblom
2024-06-03 17:08 ` Stephen Hemminger
2024-05-29 21:56 ` [RFC] " Stephen Hemminger
2024-06-02 11:30 ` Mattias Rönnblom
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=99ebb9d7-7da2-4d57-9e4d-81b5d90c6ddd@lysator.liu.se \
--to=hofors@lysator.liu.se \
--cc=bruce.richardson@intel.com \
--cc=david.marchand@redhat.com \
--cc=dev@dpdk.org \
--cc=mattias.ronnblom@ericsson.com \
--cc=maxime.coquelin@redhat.com \
--cc=mb@smartsharesystems.com \
--cc=pbhagavatula@marvell.com \
--cc=stephen@networkplumber.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).