patches for DPDK stable branches
 help / color / mirror / Atom feed
From: luca.boccassi@gmail.com
To: Jieqiang Wang <jieqiang.wang@arm.com>
Cc: Feifei Wang <feifei.wang2@arm.com>,
	Ruifeng Wang <ruifeng.wang@arm.com>,
	Bruce Richardson <bruce.richardson@intel.com>,
	dpdk stable <stable@dpdk.org>
Subject: patch 'hash: align SSE lookup to scalar implementation' has been queued to stable release 20.11.10
Date: Thu, 19 Oct 2023 00:59:00 +0100	[thread overview]
Message-ID: <20231018235930.3144-11-luca.boccassi@gmail.com> (raw)
In-Reply-To: <20231018235930.3144-1-luca.boccassi@gmail.com>

Hi,

FYI, your patch has been queued to stable release 20.11.10

Note it hasn't been pushed to http://dpdk.org/browse/dpdk-stable yet.
It will be pushed if I get no objections before 10/21/23. So please
shout if anyone has objections.

Also note that after the patch there's a diff of the upstream commit vs the
patch applied to the branch. This will indicate if there was any rebasing
needed to apply to the stable branch. If there were code changes for rebasing
(ie: not only metadata diffs), please double check that the rebase was
correctly done.

Queued patches are on a temporary branch at:
https://github.com/bluca/dpdk-stable

This queued commit can be viewed at:
https://github.com/bluca/dpdk-stable/commit/733bc367addec56156086b338e17d9c0f21145ea

Thanks.

Luca Boccassi

---
From 733bc367addec56156086b338e17d9c0f21145ea Mon Sep 17 00:00:00 2001
From: Jieqiang Wang <jieqiang.wang@arm.com>
Date: Sat, 7 Oct 2023 15:36:34 +0800
Subject: [PATCH] hash: align SSE lookup to scalar implementation

[ upstream commit e93bbaa72cca7ec912d756afdf10e393f9d71791 ]

__mm_cmpeq_epi16 returns 0xFFFF if the corresponding 16-bit elements are
equal. In original SSE2 implementation for function compare_signatures,
it utilizes _mm_movemask_epi8 to create mask from the MSB of each 8-bit
element, while we should only care about the MSB of lower 8-bit in each
16-bit element.

For example, if the comparison result is all equal, SSE2 path returns
0xFFFF while NEON and default scalar path return 0x5555.
Although this bug is not causing any negative effects since the caller
function solely examines the trailing zeros of each match mask, we
recommend this fix to ensure consistency with NEON and default scalar
code behaviors.

Fixes: c7d93df552c2 ("hash: use partial-key hashing")

Signed-off-by: Feifei Wang <feifei.wang2@arm.com>
Signed-off-by: Jieqiang Wang <jieqiang.wang@arm.com>
Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com>
Acked-by: Bruce Richardson <bruce.richardson@intel.com>
---
 lib/librte_hash/rte_cuckoo_hash.c | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/lib/librte_hash/rte_cuckoo_hash.c b/lib/librte_hash/rte_cuckoo_hash.c
index cff4242f79..95c3e6c2f4 100644
--- a/lib/librte_hash/rte_cuckoo_hash.c
+++ b/lib/librte_hash/rte_cuckoo_hash.c
@@ -1866,11 +1866,15 @@ compare_signatures(uint32_t *prim_hash_matches, uint32_t *sec_hash_matches,
 				_mm_load_si128(
 					(__m128i const *)prim_bkt->sig_current),
 				_mm_set1_epi16(sig)));
+		/* Extract the even-index bits only */
+		*prim_hash_matches &= 0x5555;
 		/* Compare all signatures in the bucket */
 		*sec_hash_matches = _mm_movemask_epi8(_mm_cmpeq_epi16(
 				_mm_load_si128(
 					(__m128i const *)sec_bkt->sig_current),
 				_mm_set1_epi16(sig)));
+		/* Extract the even-index bits only */
+		*sec_hash_matches &= 0x5555;
 		break;
 #elif defined(__ARM_NEON)
 	case RTE_HASH_COMPARE_NEON: {
-- 
2.39.2

---
  Diff of the applied patch vs upstream commit (please double-check if non-empty:
---
--- -	2023-10-19 00:28:56.792228058 +0100
+++ 0011-hash-align-SSE-lookup-to-scalar-implementation.patch	2023-10-19 00:28:56.205804204 +0100
@@ -1 +1 @@
-From e93bbaa72cca7ec912d756afdf10e393f9d71791 Mon Sep 17 00:00:00 2001
+From 733bc367addec56156086b338e17d9c0f21145ea Mon Sep 17 00:00:00 2001
@@ -5,0 +6,2 @@
+[ upstream commit e93bbaa72cca7ec912d756afdf10e393f9d71791 ]
+
@@ -20 +21,0 @@
-Cc: stable@dpdk.org
@@ -27 +28 @@
- lib/hash/rte_cuckoo_hash.c | 4 ++++
+ lib/librte_hash/rte_cuckoo_hash.c | 4 ++++
@@ -30,5 +31,5 @@
-diff --git a/lib/hash/rte_cuckoo_hash.c b/lib/hash/rte_cuckoo_hash.c
-index d92a903bb3..19b23f2a97 100644
---- a/lib/hash/rte_cuckoo_hash.c
-+++ b/lib/hash/rte_cuckoo_hash.c
-@@ -1868,11 +1868,15 @@ compare_signatures(uint32_t *prim_hash_matches, uint32_t *sec_hash_matches,
+diff --git a/lib/librte_hash/rte_cuckoo_hash.c b/lib/librte_hash/rte_cuckoo_hash.c
+index cff4242f79..95c3e6c2f4 100644
+--- a/lib/librte_hash/rte_cuckoo_hash.c
++++ b/lib/librte_hash/rte_cuckoo_hash.c
+@@ -1866,11 +1866,15 @@ compare_signatures(uint32_t *prim_hash_matches, uint32_t *sec_hash_matches,

  parent reply	other threads:[~2023-10-19  0:00 UTC|newest]

Thread overview: 111+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-10-18 23:58 patch 'mempool: fix default ops for an empty mempool' " luca.boccassi
2023-10-18 23:58 ` patch 'rawdev: fix device class in log message' " luca.boccassi
2023-10-18 23:58 ` patch 'net/bonding: fix header for C++' " luca.boccassi
2023-10-18 23:58 ` patch 'net/hns3: fix VF default MAC modified when set failed' " luca.boccassi
2023-10-18 23:58 ` patch 'net/hns3: fix error code for multicast resource' " luca.boccassi
2023-10-18 23:58 ` patch 'net/hns3: fix flushing multicast MAC address' " luca.boccassi
2023-10-18 23:58 ` patch 'app/test: fix reference to master in bonding test' " luca.boccassi
2023-10-18 23:58 ` patch 'net/hns3: fix order in NEON Rx' " luca.boccassi
2023-10-18 23:58 ` patch 'random: initialize state for unregistered non-EAL threads' " luca.boccassi
2023-10-18 23:58 ` patch 'bus/dpaa: fix build with asserts for GCC 13' " luca.boccassi
2023-10-18 23:59 ` luca.boccassi [this message]
2023-10-18 23:59 ` patch 'net/netvsc: increase VSP response timeout to 60 seconds' " luca.boccassi
2023-10-18 23:59 ` patch 'net/txgbe: add Tx queue maximum limit' " luca.boccassi
2023-10-18 23:59 ` patch 'net/txgbe: reconfigure MAC Rx when link update' " luca.boccassi
2023-10-18 23:59 ` patch 'net/txgbe: keep link down after device close' " luca.boccassi
2023-10-18 23:59 ` patch 'net/txgbe: check process type in close operation' " luca.boccassi
2023-10-18 23:59 ` patch 'net/tap: use MAC address parse API instead of local parser' " luca.boccassi
2023-10-18 23:59 ` patch 'app/testpmd: fix help string' " luca.boccassi
2023-10-18 23:59 ` patch 'cryptodev: add missing doc for security context' " luca.boccassi
2023-10-18 23:59 ` patch 'doc: replace code blocks with includes in security guide' " luca.boccassi
2023-10-18 23:59 ` patch 'test/crypto: fix IV in some vectors' " luca.boccassi
2023-10-18 23:59 ` patch 'test/crypto: skip some synchronous tests with CPU crypto' " luca.boccassi
2023-10-19 10:22   ` Power, Ciara
2023-10-19 10:28   ` Power, Ciara
2023-10-18 23:59 ` patch 'test/crypto: fix typo in asym tests' " luca.boccassi
2023-10-18 23:59 ` patch 'bus/pci: fix device ID log' " luca.boccassi
2023-10-18 23:59 ` patch 'test/event: fix crypto null device creation' " luca.boccassi
2023-10-18 23:59 ` patch 'event/sw: remove obsolete comment' " luca.boccassi
2023-10-18 23:59 ` patch 'event/sw: fix ordering corruption with op release' " luca.boccassi
2023-10-18 23:59 ` patch 'net/i40e: fix FDIR queue receives broadcast packets' " luca.boccassi
2023-10-18 23:59 ` patch 'net/iavf: fix VLAN offload strip flag' " luca.boccassi
2023-10-18 23:59 ` patch 'net/i40e: fix buffer leak on Rx reconfiguration' " luca.boccassi
2023-10-18 23:59 ` patch 'net/iavf: fix port stats clearing' " luca.boccassi
2023-10-18 23:59 ` patch 'net/ice: fix initial link status' " luca.boccassi
2023-10-18 23:59 ` patch 'net/iavf: remove log from Tx prepare function' " luca.boccassi
2023-10-18 23:59 ` patch 'net/iavf: fix TSO with big segments' " luca.boccassi
2023-10-18 23:59 ` patch 'net/ice: remove log from Tx prepare function' " luca.boccassi
2023-10-18 23:59 ` patch 'net/ice: fix TSO with big segments' " luca.boccassi
2023-10-18 23:59 ` patch 'net/mlx5: fix leak in sysfs port name translation' " luca.boccassi
2023-10-18 23:59 ` patch 'fib: fix adding default route overwriting entire table' " luca.boccassi
2023-10-18 23:59 ` patch 'fib6: fix adding default route as first route' " luca.boccassi
2023-10-18 23:59 ` patch 'net/bonding: fix link status callback stop' " luca.boccassi
2023-10-18 23:59 ` patch 'app/procinfo: remove unnecessary rte_malloc' " luca.boccassi
2023-11-08 19:24   ` patch 'malloc: remove return from void functions' " luca.boccassi
2023-11-08 19:25     ` patch 'eventdev: fix device pointer for vdev-based devices' " luca.boccassi
2023-11-08 19:25     ` patch 'eventdev: fix missing driver names in info struct' " luca.boccassi
2023-11-08 19:25     ` patch 'net/virtio: fix missing next flag in Tx packed ring' " luca.boccassi
2023-11-08 19:25     ` patch 'net/virtio: fix link state interrupt vector setting' " luca.boccassi
2023-11-08 19:25     ` patch 'vhost: fix missing vring call check on virtqueue access' " luca.boccassi
2023-11-08 19:25     ` patch 'vhost: fix missing " luca.boccassi
2023-11-08 19:25     ` patch 'ethdev: fix function name in comment' " luca.boccassi
2023-11-08 19:25     ` patch 'app/testpmd: fix early exit from signal' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix unchecked Rx free threshold' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix crash for NEON and SVE' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix double stats for IMP and global reset' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: remove reset log in secondary' " luca.boccassi
2023-11-08 19:25     ` patch 'test/bonding: remove unreachable statement' " luca.boccassi
2023-11-08 19:25     ` patch 'test/bonding: add missing check' " luca.boccassi
2023-11-08 19:25     ` patch 'net/bonding: fix possible overrun' " luca.boccassi
2023-11-08 19:25     ` patch 'net/txgbe: add proper memory barriers in Rx' " luca.boccassi
2023-11-08 19:25     ` patch 'ethdev: fix 32-bit build with GCC 13' " luca.boccassi
2023-11-08 19:25     ` patch 'net/enic: avoid extra unlock in MTU set' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix setting DCB capability' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix some return values' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix some error logs' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: keep set/get algo key functions local' " luca.boccassi
2023-11-08 19:25     ` patch 'net/hns3: fix uninitialized hash algo value' " luca.boccassi
2023-11-08 19:25     ` patch 'net/tap: fix L4 checksum offloading' " luca.boccassi
2023-11-08 19:25     ` patch 'net/tap: fix IPv4 " luca.boccassi
2023-11-08 19:25     ` patch 'app/procinfo: fix RSS info' " luca.boccassi
2023-11-08 19:25     ` patch 'app/procinfo: adjust format of " luca.boccassi
2023-11-08 19:25     ` patch 'net/nfp: fix link status interrupt' " luca.boccassi
2023-11-08 19:25     ` patch 'net/nfp: fix DMA error after abnormal exit' " luca.boccassi
2023-11-08 19:25     ` patch 'net/tap: fix RSS for fragmented packets' " luca.boccassi
2023-11-08 19:25     ` patch 'common/mlx5: fix controller index parsing' " luca.boccassi
2023-11-08 19:25     ` patch 'net/ice: fix L1 check interval' " luca.boccassi
2023-11-08 19:25     ` patch 'net/ice: fix DCF port statistics' " luca.boccassi
2023-11-08 19:25     ` patch 'crypto/nitrox: fix panic with high number of segments' " luca.boccassi
2023-11-08 19:25     ` patch 'net/iavf: fix Tx preparation' " luca.boccassi
2023-11-08 19:25     ` patch 'net/ice: " luca.boccassi
2023-11-15 11:45       ` patch 'event/dlb2: fix name check in self-test' " luca.boccassi
2023-11-15 11:45         ` patch 'test/bbdev: fix Python script subprocess' " luca.boccassi
2023-11-15 11:45         ` patch 'test/bbdev: assert failed test for queue configure' " luca.boccassi
2023-11-15 11:45         ` patch 'event/dlb2: fix missing queue ordering capability flag' " luca.boccassi
2023-11-15 11:45         ` patch 'meter: fix RFC4115 trTCM API Doxygen' " luca.boccassi
2023-11-15 11:45         ` patch 'app/testpmd: remove useless check in TSO command' " luca.boccassi
2023-11-15 11:45         ` patch 'test/bonding: fix uninitialized RSS configuration' " luca.boccassi
2023-11-15 11:45         ` patch 'net/hns3: fix mailbox sync' " luca.boccassi
2023-11-15 11:45         ` patch 'app/testpmd: fix tunnel TSO capability check' " luca.boccassi
2023-11-15 11:45         ` patch 'app/testpmd: add explicit check for tunnel TSO' " luca.boccassi
2023-11-15 11:45         ` patch 'app/testpmd: fix tunnel TSO configuration' " luca.boccassi
2023-11-15 11:45         ` patch 'net/mlx5: fix hairpin queue unbind' " luca.boccassi
2023-11-15 11:45         ` patch 'net/mlx5: fix hairpin queue states' " luca.boccassi
2023-11-15 11:45         ` patch 'net/mlx5: fix multi-segment Tx inline data length' " luca.boccassi
2023-11-15 11:45         ` patch 'net/mlx5: zero UDP checksum over IPv4 in encapsulation' " luca.boccassi
2023-11-15 11:45         ` patch 'net/mlx5: fix MPRQ stride size check' " luca.boccassi
2023-11-23 16:17           ` patch 'net/txgbe: fix out of bound access' " luca.boccassi
2023-11-23 16:17             ` patch 'doc: update features in hns3 guide' " luca.boccassi
2023-11-23 16:17             ` patch 'doc: update versions recommendations for i40e and ice' " luca.boccassi
2023-11-23 16:17             ` patch 'examples/ipsec-secgw: fix partial overflow' " luca.boccassi
2023-11-23 16:17             ` patch 'eal/windows: fix build with recent MinGW' " luca.boccassi
2023-11-23 16:17             ` patch 'pdump: fix error number on IPC response' " luca.boccassi
2023-11-23 16:17             ` patch 'examples/ethtool: fix pause configuration' " luca.boccassi
2023-11-23 16:17             ` patch 'test/hash: fix creation error log' " luca.boccassi
2023-11-23 16:17             ` patch 'app/pipeline: add sigint handler' " luca.boccassi
2023-11-29  2:32               ` patch 'doc: remove restriction on ixgbe vector support' " luca.boccassi
2023-11-29  2:32                 ` patch 'doc: fix some ordered lists' " luca.boccassi
2023-11-29  2:32                 ` patch 'doc: remove number of commands in vDPA guide' " luca.boccassi
2023-11-29  2:32                 ` patch 'mempool: fix get function documentation' " luca.boccassi
2023-11-29  2:32                 ` patch 'mempool: clarify enqueue/dequeue ops " luca.boccassi
2023-11-29  2:32                 ` patch 'ethdev: fix ESP packet type description' " luca.boccassi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20231018235930.3144-11-luca.boccassi@gmail.com \
    --to=luca.boccassi@gmail.com \
    --cc=bruce.richardson@intel.com \
    --cc=feifei.wang2@arm.com \
    --cc=jieqiang.wang@arm.com \
    --cc=ruifeng.wang@arm.com \
    --cc=stable@dpdk.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).