From: Scott Mitchell <scott.k.mitch1@gmail.com>
To: "Morten Brørup" <mb@smartsharesystems.com>
Cc: dev@dpdk.org, stephen@networkplumber.org
Subject: Re: [PATCH v11] net: optimize raw checksum computation
Date: Fri, 9 Jan 2026 10:27:07 -0500 [thread overview]
Message-ID: <CAFn2buDULHpj-5m1TXEcp0xUfMpTA9dKqEuEC98UhDp4qW21og@mail.gmail.com> (raw)
In-Reply-To: <98CBD80474FA8B44BF855DF32C47DC35F65638@smartserver.smartshare.dk>
On Fri, Jan 9, 2026 at 4:26 AM Morten Brørup <mb@smartsharesystems.com> wrote:
>
> > Changes in v8:
> > - __rte_raw_cksum: use native pointer arithmetic instead of RTE_PTR_ADD
> > to avoid incorrect results with -O3 for UDP checksums. Also improves
> > performance due to less assembly generated with Clang.
>
> Personally, I also have observed GCC's optimizer behave as if it loses some contextual information when using RTE_PTR_ADD, and thus emitting less optimal code.
> I didn't look further into it, and thus have no data or examples to back up the claim. Which is why I haven't started a discussion about discouraging the use of RTE_PTR_ADD.
> In other words: I support this change.
Sounds good! I observed ~600 (dpdk ptr macros) vs ~500 (native c ptr
operations) TSC cycles/block in cksum_perf_autotest.
>
> > /* if length is odd, keeping it byte order independent */
> > - if (unlikely(len % 2)) {
> > + if (len & 1) {
> > uint16_t left = 0;
> > -
> > memcpy(&left, end, 1);
> > sum += left;
> > }
>
> Changing "len % 2" to "len & 1" made sense for consistency in previous versions handling 32/16/8/4/2-byte chunks before this 1-byte chunk; now it makes no difference, so consider not changing this part at all.
> Under all circumstances, don't remove the unlikely() for handling odd length in __rte_raw_cksum(). The vast majority of packets (and partial packets, e.g. headers) being checksummed are even length.
>
Sounds good. I will restore the original.
The use case that motivated these changes was software interfaces (veth)
with encapsulation requiring software checksum on inner IPv4 payloads,
where lengths may be odd/even. However, I agree that header checksums
with even lengths are the more common case and unlikely() is appropriate.
next prev parent reply other threads:[~2026-01-09 15:27 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-08 23:05 scott.k.mitch1
2026-01-09 0:44 ` Scott Mitchell
2026-01-09 9:26 ` Morten Brørup
2026-01-09 15:27 ` Scott Mitchell [this message]
2026-01-09 15:58 ` Morten Brørup
2026-01-09 17:23 ` Scott Mitchell
2026-01-09 22:12 ` Morten Brørup
2026-01-10 4:19 ` Scott Mitchell
2026-01-09 18:28 ` Morten Brørup
2026-01-10 3:41 ` Scott Mitchell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAFn2buDULHpj-5m1TXEcp0xUfMpTA9dKqEuEC98UhDp4qW21og@mail.gmail.com \
--to=scott.k.mitch1@gmail.com \
--cc=dev@dpdk.org \
--cc=mb@smartsharesystems.com \
--cc=stephen@networkplumber.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).