From: Daniel Gregory <daniel.gregory@bytedance.com>
To: Stanislaw Kardach <stanislaw.kardach@gmail.com>
Cc: dev@dpdk.org, Punit Agrawal <punit.agrawal@bytedance.com>,
Liang Ma <liangma@liangbit.com>,
Pengcheng Wang <wangpengcheng.pp@bytedance.com>,
Chunsong Feng <fengchunsong@bytedance.com>,
Daniel Gregory <daniel.gregory@bytedance.com>,
Stephen Hemminger <stephen@networkplumber.org>
Subject: [PATCH v2 0/9] riscv: implement accelerated crc using zbc
Date: Fri, 12 Jul 2024 16:46:36 +0100 [thread overview]
Message-ID: <20240712154645.80622-1-daniel.gregory@bytedance.com> (raw)
In-Reply-To: <20240618174133.33457-1-daniel.gregory@bytedance.com>
The RISC-V Zbc extension adds instructions for carry-less multiplication
we can use to implement CRC in hardware. This patch set contains two new
implementations:
- one in lib/hash/rte_crc_riscv64.h that uses a Barrett reduction to
implement the four rte_hash_crc_* functions
- one in lib/net/net_crc_zbc.c that uses repeated single-folds to reduce
the buffer until it is small enough for a Barrett reduction to
implement rte_crc16_ccitt_zbc_handler and rte_crc32_eth_zbc_handler
My approach is largely based on the Intel's "Fast CRC Computation Using
PCLMULQDQ Instruction" white paper
https://www.researchgate.net/publication/263424619_Fast_CRC_computation
and a post about "Optimizing CRC32 for small payload sizes on x86"
https://mary.rs/lab/crc32/
Whether these new implementations are enabled is controlled by new
build-time and run-time detection of the RISC-V extensions present in
the compiler and on the target system.
I have carried out some performance comparisons between the generic
table implementations and the new hardware implementations. Listed below
is the number of cycles it takes to compute the CRC hash for buffers of
various sizes (as reported by rte_get_timer_cycles()). These results
were collected on a Kendryte K230 and averaged over 20 samples:
|Buffer | CRC32-ETH (lib/net) | CRC32C (lib/hash) |
|Size (MB) | Table | Hardware | Table | Hardware |
|----------|----------|----------|----------|----------|
| 1 | 155168 | 11610 | 73026 | 18385 |
| 2 | 311203 | 22998 | 145586 | 35886 |
| 3 | 466744 | 34370 | 218536 | 53939 |
| 4 | 621843 | 45536 | 291574 | 71944 |
| 5 | 777908 | 56989 | 364152 | 89706 |
| 6 | 932736 | 68023 | 437016 | 107726 |
| 7 | 1088756 | 79236 | 510197 | 125426 |
| 8 | 1243794 | 90467 | 583231 | 143614 |
These results suggest a speed-up of lib/net by thirteen times, and of
lib/hash by four times.
I have also run the hash_functions_autotest benchmark in dpdk_test,
which measures the performance of the lib/hash implementation on small
buffers, getting the following times:
| Key Length | Time (ticks/op) |
| (bytes) | Table | Hardware |
|------------|----------|----------|
| 1 | 0.47 | 0.85 |
| 2 | 0.57 | 0.87 |
| 4 | 0.99 | 0.88 |
| 8 | 1.35 | 0.88 |
| 9 | 1.20 | 1.09 |
| 13 | 1.76 | 1.35 |
| 16 | 1.87 | 1.02 |
| 32 | 2.96 | 0.98 |
| 37 | 3.35 | 1.45 |
| 40 | 3.49 | 1.12 |
| 48 | 4.02 | 1.25 |
| 64 | 5.08 | 1.54 |
v2:
- replace compile flag with build-time (riscv extension macros) and
run-time detection (linux hwprobe syscall) (Stephen Hemminger)
- add qemu target that supports zbc (Stanislaw Kardach)
- fix spelling error in commit message
- fix a bug in the net/ implementation that would cause segfaults on
small unaligned buffers
- refactor net/ implemementation to move variable declarations to top
of functions
- enable the optimisation in a couple other places optimised crc is
preferred to jhash
- l3fwd-power
- cuckoo-hash
Daniel Gregory (9):
config/riscv: detect presence of Zbc extension
hash: implement crc using riscv carryless multiply
net: implement crc using riscv carryless multiply
config/riscv: add qemu crossbuild target
examples/l3fwd: use accelerated crc on riscv
ipfrag: use accelerated crc on riscv
examples/l3fwd-power: use accelerated crc on riscv
hash/cuckoo: use accelerated crc on riscv
member: use accelerated crc on riscv
MAINTAINERS | 2 +
app/test/test_crc.c | 9 +
app/test/test_hash.c | 7 +
config/riscv/meson.build | 44 +++-
config/riscv/riscv64_qemu_linux_gcc | 17 ++
.../linux_gsg/cross_build_dpdk_for_riscv.rst | 5 +
examples/l3fwd-power/main.c | 2 +-
examples/l3fwd/l3fwd_em.c | 2 +-
lib/eal/riscv/include/rte_cpuflags.h | 2 +
lib/eal/riscv/rte_cpuflags.c | 112 +++++++---
lib/hash/meson.build | 1 +
lib/hash/rte_crc_riscv64.h | 89 ++++++++
lib/hash/rte_cuckoo_hash.c | 3 +
lib/hash/rte_hash_crc.c | 13 +-
lib/hash/rte_hash_crc.h | 6 +-
lib/ip_frag/ip_frag_internal.c | 6 +-
lib/member/rte_member.h | 2 +-
lib/net/meson.build | 4 +
lib/net/net_crc.h | 11 +
lib/net/net_crc_zbc.c | 191 ++++++++++++++++++
lib/net/rte_net_crc.c | 40 ++++
lib/net/rte_net_crc.h | 2 +
22 files changed, 529 insertions(+), 41 deletions(-)
create mode 100644 config/riscv/riscv64_qemu_linux_gcc
create mode 100644 lib/hash/rte_crc_riscv64.h
create mode 100644 lib/net/net_crc_zbc.c
--
2.39.2
next prev parent reply other threads:[~2024-07-12 15:46 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-18 17:41 [PATCH 0/5] " Daniel Gregory
2024-06-18 17:41 ` [PATCH 1/5] config/riscv: add flag for using Zbc extension Daniel Gregory
2024-06-18 20:03 ` Stephen Hemminger
2024-06-19 7:08 ` Morten Brørup
2024-06-19 14:49 ` Stephen Hemminger
2024-06-19 16:41 ` Daniel Gregory
2024-06-18 17:41 ` [PATCH 2/5] hash: implement crc using riscv carryless multiply Daniel Gregory
2024-06-18 17:41 ` [PATCH 3/5] net: " Daniel Gregory
2024-06-18 17:41 ` [PATCH 4/5] examples/l3fwd: use accelerated crc on riscv Daniel Gregory
2024-06-18 17:41 ` [PATCH 5/5] ipfrag: " Daniel Gregory
2024-07-12 15:46 ` Daniel Gregory [this message]
2024-07-12 15:46 ` [PATCH v2 1/9] config/riscv: detect presence of Zbc extension Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 2/9] hash: implement crc using riscv carryless multiply Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 3/9] net: " Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 4/9] config/riscv: add qemu crossbuild target Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 5/9] examples/l3fwd: use accelerated crc on riscv Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 6/9] ipfrag: " Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 7/9] examples/l3fwd-power: " Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 8/9] hash/cuckoo: " Daniel Gregory
2024-07-12 15:46 ` [PATCH v2 9/9] member: " Daniel Gregory
2024-07-12 17:19 ` [PATCH v2 0/9] riscv: implement accelerated crc using zbc David Marchand
2024-08-27 15:32 ` [PATCH v3 " Daniel Gregory
2024-08-27 15:32 ` [PATCH v3 1/9] config/riscv: detect presence of Zbc extension Daniel Gregory
2024-08-27 15:32 ` [PATCH v3 2/9] hash: implement CRC using riscv carryless multiply Daniel Gregory
2024-08-27 15:32 ` [PATCH v3 3/9] net: " Daniel Gregory
2024-08-27 15:32 ` [PATCH v3 4/9] config/riscv: add qemu crossbuild target Daniel Gregory
2024-08-27 15:36 ` [PATCH v3 5/9] examples/l3fwd: use accelerated CRC on riscv Daniel Gregory
2024-08-27 15:36 ` [PATCH v3 6/9] ipfrag: " Daniel Gregory
2024-08-27 15:36 ` [PATCH v3 7/9] examples/l3fwd-power: " Daniel Gregory
2024-08-27 15:36 ` [PATCH v3 8/9] hash/cuckoo: " Daniel Gregory
2024-08-27 15:36 ` [PATCH v3 9/9] member: " Daniel Gregory
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20240712154645.80622-1-daniel.gregory@bytedance.com \
--to=daniel.gregory@bytedance.com \
--cc=dev@dpdk.org \
--cc=fengchunsong@bytedance.com \
--cc=liangma@liangbit.com \
--cc=punit.agrawal@bytedance.com \
--cc=stanislaw.kardach@gmail.com \
--cc=stephen@networkplumber.org \
--cc=wangpengcheng.pp@bytedance.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).