From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dpdk.org (dpdk.org [92.243.14.124]) by inbox.dpdk.org (Postfix) with ESMTP id BB67EA0598; Fri, 10 Apr 2020 18:41:51 +0200 (CEST) Received: from [92.243.14.124] (localhost [127.0.0.1]) by dpdk.org (Postfix) with ESMTP id 6CF371D5D0; Fri, 10 Apr 2020 18:41:48 +0200 (CEST) Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by dpdk.org (Postfix) with ESMTP id DE2F41D5CF for ; Fri, 10 Apr 2020 18:41:46 +0200 (CEST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 5755930E; Fri, 10 Apr 2020 09:41:46 -0700 (PDT) Received: from net-arm-thunderx2-01.shanghai.arm.com (net-arm-thunderx2-01.shanghai.arm.com [10.169.41.214]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPA id 26E633F52E; Fri, 10 Apr 2020 09:41:41 -0700 (PDT) From: Gavin Hu To: dev@dpdk.org Cc: nd@arm.com, david.marchand@redhat.com, thomas@monjalon.net, rasland@mellanox.com, drc@linux.vnet.ibm.com, bruce.richardson@intel.com, konstantin.ananyev@intel.com, matan@mellanox.com, shahafs@mellanox.com, viacheslavo@mellanox.com, jerinj@marvell.com, Honnappa.Nagarahalli@arm.com, ruifeng.wang@arm.com, phil.yang@arm.com, joyce.kong@arm.com, steve.capper@arm.com Date: Sat, 11 Apr 2020 00:41:21 +0800 Message-Id: <20200410164127.54229-2-gavin.hu@arm.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20200410164127.54229-1-gavin.hu@arm.com> References: <20200410164127.54229-1-gavin.hu@arm.com> In-Reply-To: <20200213123854.203566-1-gavin.hu@arm.com> References: <20200213123854.203566-1-gavin.hu@arm.com> Subject: [dpdk-dev] [PATCH RFC v2 1/7] eal: introduce new class of barriers for DMA use cases X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" In DPDK we use rte_*mb barriers to ensure that memory accesses to DMA regions are observed before MMIO accesses to hardware registers. On AArch64, the rte_*mb barriers are implemented by "DSB" (Data Synchronisation Barrier) style instructions which are the strongest barriers possible. Recently, however, it has been realised [1], that for devices where the MMIO regions are shared between all CPUs, that it is possible to relax this memory barrier. There are cases where we wish to retain the strength of the rte_*mb memory barriers; thus rather than relax rte_*mb we opt instead to introduce a new class of barrier rte_dma_*mb. For AArch64, rte_dma_*mb will be implemented by a relaxed "DMB OSH" style of barrier. For other architectures, we implement rte_dma_*mb as rte_*mb so this should not result in any functional changes. [1] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/ commit/?id=22ec71615d824f4f11d38d0e55a88d8956b7e45f Signed-off-by: Gavin Hu Reviewed-by: Steve Capper --- lib/librte_eal/arm/include/rte_atomic_32.h | 6 ++++ lib/librte_eal/arm/include/rte_atomic_64.h | 6 ++++ lib/librte_eal/include/generic/rte_atomic.h | 31 +++++++++++++++++++++ lib/librte_eal/ppc/include/rte_atomic.h | 6 ++++ lib/librte_eal/x86/include/rte_atomic.h | 6 ++++ 5 files changed, 55 insertions(+) diff --git a/lib/librte_eal/arm/include/rte_atomic_32.h b/lib/librte_eal/arm/include/rte_atomic_32.h index 7dc0d06d1..80208467e 100644 --- a/lib/librte_eal/arm/include/rte_atomic_32.h +++ b/lib/librte_eal/arm/include/rte_atomic_32.h @@ -33,6 +33,12 @@ extern "C" { #define rte_io_rmb() rte_rmb() +#define rte_dma_mb() rte_mb() + +#define rte_dma_wmb() rte_wmb() + +#define rte_dma_rmb() rte_rmb() + #define rte_cio_wmb() rte_wmb() #define rte_cio_rmb() rte_rmb() diff --git a/lib/librte_eal/arm/include/rte_atomic_64.h b/lib/librte_eal/arm/include/rte_atomic_64.h index 7b7099cdc..608726c29 100644 --- a/lib/librte_eal/arm/include/rte_atomic_64.h +++ b/lib/librte_eal/arm/include/rte_atomic_64.h @@ -37,6 +37,12 @@ extern "C" { #define rte_io_rmb() rte_rmb() +#define rte_dma_mb() asm volatile("dmb osh" : : : "memory") + +#define rte_dma_wmb() asm volatile("dmb oshst" : : : "memory") + +#define rte_dma_rmb() asm volatile("dmb oshld" : : : "memory") + #define rte_cio_wmb() asm volatile("dmb oshst" : : : "memory") #define rte_cio_rmb() asm volatile("dmb oshld" : : : "memory") diff --git a/lib/librte_eal/include/generic/rte_atomic.h b/lib/librte_eal/include/generic/rte_atomic.h index e6ab15a97..042264c7e 100644 --- a/lib/librte_eal/include/generic/rte_atomic.h +++ b/lib/librte_eal/include/generic/rte_atomic.h @@ -107,6 +107,37 @@ static inline void rte_io_wmb(void); static inline void rte_io_rmb(void); ///@} +/** @name DMA Memory Barrier + */ +///@{ +/** + * memory barrier for DMA use cases + * + * Guarantees that the LOAD and STORE operations that precede the rte_dma_mb() + * call are visible to CPU and I/O device that is shared between all CPUs + * before the LOAD and STORE operations that follow it. + */ +static inline void rte_dma_mb(void); + +/** + * Write memory barrier for DMA use cases + * + * Guarantees that the STORE operations that precede the rte_dma_wmb() call are + * visible to CPU and I/O device that is shared between all CPUs before the + * STORE operations that follow it. + */ +static inline void rte_dma_wmb(void); + +/** + * Read memory barrier for DMA use cases + * + * Guarantees that the LOAD operations that precede the rte_dma_rmb() call are + * visible to CPU and IO device that is shared between all CPUs before the LOAD + * operations that follow it. + */ +static inline void rte_dma_rmb(void); +///@} + /** @name Coherent I/O Memory Barrier * * Coherent I/O memory barrier is a lightweight version of I/O memory diff --git a/lib/librte_eal/ppc/include/rte_atomic.h b/lib/librte_eal/ppc/include/rte_atomic.h index 7e3e13118..faa36bb76 100644 --- a/lib/librte_eal/ppc/include/rte_atomic.h +++ b/lib/librte_eal/ppc/include/rte_atomic.h @@ -36,6 +36,12 @@ extern "C" { #define rte_io_rmb() rte_rmb() +#define rte_dma_mb() rte_mb() + +#define rte_dma_wmb() rte_wmb() + +#define rte_dma_rmb() rte_rmb() + #define rte_cio_wmb() rte_wmb() #define rte_cio_rmb() rte_rmb() diff --git a/lib/librte_eal/x86/include/rte_atomic.h b/lib/librte_eal/x86/include/rte_atomic.h index 148398f50..0b1d452f3 100644 --- a/lib/librte_eal/x86/include/rte_atomic.h +++ b/lib/librte_eal/x86/include/rte_atomic.h @@ -79,6 +79,12 @@ rte_smp_mb(void) #define rte_io_rmb() rte_compiler_barrier() +#define rte_dma_mb() rte_mb() + +#define rte_dma_wmb() rte_wmb() + +#define rte_dma_rmb() rte_rmb() + #define rte_cio_wmb() rte_compiler_barrier() #define rte_cio_rmb() rte_compiler_barrier() -- 2.17.1