Caution: This message originated from an External Source. Use proper caution when opening attachments, clicking links, or responding. On Wed, 21 Aug 2024 20:08:55 +0530 Vipin Varghese <vipin.varghese@amd.com> wrote:diff --git a/app/test-pmd/macswap_sse.h b/app/test-pmd/macswap_sse.h index 223f87a539..29088843b7 100644 --- a/app/test-pmd/macswap_sse.h +++ b/app/test-pmd/macswap_sse.h @@ -16,13 +16,13 @@ do_macswap(struct rte_mbuf *pkts[], uint16_t nb, uint64_t ol_flags; int i; int r; - __m128i addr0, addr1, addr2, addr3; + register __m128i addr0, addr1, addr2, addr3;Some compilers treat register as a no-op. Are you sure? Did you check with godbolt.
Thank you Stephen, I have tested the code changes on Linux using GCC and Clang compiler.
In both cases in Linux environment, we have seen the the values loaded onto register `xmm`.
```
Both cases we have performance improvement.
Can you please help us understand if we have missed out
something?