From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <konstantin.ananyev@intel.com>
Received: from mga11.intel.com (mga11.intel.com [192.55.52.93])
 by dpdk.org (Postfix) with ESMTP id D20B92E8A
 for <dev@dpdk.org>; Tue, 20 May 2014 12:05:06 +0200 (CEST)
Received: from fmsmga001.fm.intel.com ([10.253.24.23])
 by fmsmga102.fm.intel.com with ESMTP; 20 May 2014 03:05:15 -0700
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="4.98,873,1392192000"; d="scan'208";a="534746445"
Received: from irsmsx103.ger.corp.intel.com ([163.33.3.157])
 by fmsmga001.fm.intel.com with ESMTP; 20 May 2014 03:05:13 -0700
Received: from irsmsx151.ger.corp.intel.com (163.33.192.59) by
 IRSMSX103.ger.corp.intel.com (163.33.3.157) with Microsoft SMTP Server (TLS)
 id 14.3.123.3; Tue, 20 May 2014 11:05:13 +0100
Received: from irsmsx105.ger.corp.intel.com ([169.254.7.70]) by
 IRSMSX151.ger.corp.intel.com ([163.33.192.59]) with mapi id 14.03.0123.003;
 Tue, 20 May 2014 11:05:12 +0100
From: "Ananyev, Konstantin" <konstantin.ananyev@intel.com>
To: Olivier Matz <olivier.matz@6wind.com>, "dev@dpdk.org" <dev@dpdk.org>
Thread-Topic: [dpdk-dev] [PATCH] atomic: clarify use of memory barriers
Thread-Index: AQHPdA8XFpR5Q78M6UqUBVjJHldsjZtJN2Xg
Date: Tue, 20 May 2014 10:05:11 +0000
Message-ID: <2601191342CEEE43887BDE71AB9772580EFA776F@IRSMSX105.ger.corp.intel.com>
References: <1400578588-21137-1-git-send-email-olivier.matz@6wind.com>
In-Reply-To: <1400578588-21137-1-git-send-email-olivier.matz@6wind.com>
Accept-Language: en-IE, en-US
Content-Language: en-US
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
x-originating-ip: [163.33.239.182]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Subject: Re: [dpdk-dev] [PATCH] atomic: clarify use of memory barriers
X-BeenThere: dev@dpdk.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: patches and discussions about DPDK <dev.dpdk.org>
List-Unsubscribe: <http://dpdk.org/ml/options/dev>,
 <mailto:dev-request@dpdk.org?subject=unsubscribe>
List-Archive: <http://dpdk.org/ml/archives/dev/>
List-Post: <mailto:dev@dpdk.org>
List-Help: <mailto:dev-request@dpdk.org?subject=help>
List-Subscribe: <http://dpdk.org/ml/listinfo/dev>,
 <mailto:dev-request@dpdk.org?subject=subscribe>
X-List-Received-Date: Tue, 20 May 2014 10:05:07 -0000

Hi Oliver,

> Note that on x86 CPUs, memory barriers between different cores can be gua=
ranteed by a simple compiler barrier.

I don't think this is totally correct.
Yes, for  Intel cpus in many cases memory barrier could be avoided due to n=
early strict memory ordering.
Though there are few cases where reordering is possible and when fence inst=
ructions would be needed.
 For me:
+#define	rte_smp_rmb() rte_compiler_barrier()

Seems a bit misleading, as there is no real fence.
So I suggest we keep rte_compiler_barrier() naming and usage.

Thanks
Konstantin

-----Original Message-----
From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Olivier Matz
Sent: Tuesday, May 20, 2014 10:36 AM
To: dev@dpdk.org
Subject: [dpdk-dev] [PATCH] atomic: clarify use of memory barriers

This commit introduce rte_smp_mb(), rte_smp_wmb() and rte_smp_rmb(), in
order to differentiate memory barriers used between lcores, and memory
barriers used between a lcore and a device. The patch does not provide
any functional change, the goal is to have more explicit call, making
the code more readable.

Note that on x86 CPUs, memory barriers between different cores can be
guaranted by a simple compiler barrier. If the memory is also accessed
by a device, a real memory barrier instruction has to be used.

Signed-off-by: Olivier Matz <olivier.matz@6wind.com>
---
 app/test/test_mbuf.c                       |  2 +-
 lib/librte_eal/bsdapp/eal/eal_thread.c     |  2 +-
 lib/librte_eal/common/eal_common_launch.c  |  2 +-
 lib/librte_eal/common/include/rte_atomic.h | 24 ++++++++++++++++++++++++
 lib/librte_eal/linuxapp/eal/eal_thread.c   |  2 +-
 lib/librte_pmd_virtio/virtqueue.h          |  6 +++---
 lib/librte_pmd_xenvirt/rte_eth_xenvirt.c   |  4 ++--
 lib/librte_pmd_xenvirt/virtqueue.h         |  2 +-
 lib/librte_ring/rte_ring.h                 |  8 ++++----
 lib/librte_timer/rte_timer.c               |  8 ++++----
 10 files changed, 42 insertions(+), 18 deletions(-)

diff --git a/app/test/test_mbuf.c b/app/test/test_mbuf.c
index ba2f2d9..7945069 100644
--- a/app/test/test_mbuf.c
+++ b/app/test/test_mbuf.c
@@ -603,7 +603,7 @@ test_refcnt_master(void)
 		test_refcnt_iter(lcore, i);
=20
 	refcnt_stop_slaves =3D 1;
-	rte_wmb();
+	rte_smp_wmb();
=20
 	printf("%s finished at lcore %u\n", __func__, lcore);
 	return (0);
diff --git a/lib/librte_eal/bsdapp/eal/eal_thread.c b/lib/librte_eal/bsdapp=
/eal/eal_thread.c
index d2bec2e..00e1647 100644
--- a/lib/librte_eal/bsdapp/eal/eal_thread.c
+++ b/lib/librte_eal/bsdapp/eal/eal_thread.c
@@ -223,7 +223,7 @@ eal_thread_loop(__attribute__((unused)) void *arg)
 		fct_arg =3D lcore_config[lcore_id].arg;
 		ret =3D lcore_config[lcore_id].f(fct_arg);
 		lcore_config[lcore_id].ret =3D ret;
-		rte_wmb();
+		rte_smp_wmb();
 		lcore_config[lcore_id].state =3D FINISHED;
 	}
=20
diff --git a/lib/librte_eal/common/eal_common_launch.c b/lib/librte_eal/com=
mon/eal_common_launch.c
index 6f7c696..a3cc282 100644
--- a/lib/librte_eal/common/eal_common_launch.c
+++ b/lib/librte_eal/common/eal_common_launch.c
@@ -57,7 +57,7 @@ rte_eal_wait_lcore(unsigned slave_id)
 	while (lcore_config[slave_id].state !=3D WAIT &&
 	       lcore_config[slave_id].state !=3D FINISHED);
=20
-	rte_rmb();
+	rte_smp_rmb();
=20
 	/* we are in finished state, go to wait state */
 	lcore_config[slave_id].state =3D WAIT;
diff --git a/lib/librte_eal/common/include/rte_atomic.h b/lib/librte_eal/co=
mmon/include/rte_atomic.h
index ef95160..4056e53 100644
--- a/lib/librte_eal/common/include/rte_atomic.h
+++ b/lib/librte_eal/common/include/rte_atomic.h
@@ -82,6 +82,30 @@ extern "C" {
 #define	rte_rmb() _mm_lfence()
=20
 /**
+ * General memory barrier between 2 lcore.
+ *
+ * On Intel CPUs, it's already guaranted by the hardware so only a
+ * compiler barrier is needed.
+ */
+#define	rte_smp_mb() rte_compiler_barrier()
+
+/**
+ * Write memory barrier between 2 lcores.
+ *
+ * On Intel CPUs, it's already guaranted by the hardware so only a
+ * compiler barrier is needed.
+ */
+#define	rte_smp_wmb() rte_compiler_barrier()
+
+/**
+ * Read memory barrier between 2 lcores.
+ *
+ * On Intel CPUs, it's already guaranted by the hardware so only a
+ * compiler barrier is needed.
+ */
+#define	rte_smp_rmb() rte_compiler_barrier()
+
+/**
  * Compiler barrier.
  *
  * Guarantees that operation reordering does not occur at compile time=20
diff --git a/lib/librte_eal/linuxapp/eal/eal_thread.c b/lib/librte_eal/linu=
xapp/eal/eal_thread.c
index bf77873..2d839e6 100644
--- a/lib/librte_eal/linuxapp/eal/eal_thread.c
+++ b/lib/librte_eal/linuxapp/eal/eal_thread.c
@@ -223,7 +223,7 @@ eal_thread_loop(__attribute__((unused)) void *arg)
 		fct_arg =3D lcore_config[lcore_id].arg;
 		ret =3D lcore_config[lcore_id].f(fct_arg);
 		lcore_config[lcore_id].ret =3D ret;
-		rte_wmb();
+		rte_smp_wmb();
 		lcore_config[lcore_id].state =3D FINISHED;
 	}
=20
diff --git a/lib/librte_pmd_virtio/virtqueue.h b/lib/librte_pmd_virtio/virt=
queue.h
index 11f908c..f45dd47 100644
--- a/lib/librte_pmd_virtio/virtqueue.h
+++ b/lib/librte_pmd_virtio/virtqueue.h
@@ -46,9 +46,9 @@
 #include "virtio_ring.h"
 #include "virtio_logs.h"
=20
-#define mb()  rte_mb()
-#define wmb() rte_wmb()
-#define rmb() rte_rmb()
+#define mb()  rte_smp_mb()
+#define wmb() rte_smp_wmb()
+#define rmb() rte_smp_rmb()
=20
 #ifdef RTE_PMD_PACKET_PREFETCH
 #define rte_packet_prefetch(p)  rte_prefetch1(p)
diff --git a/lib/librte_pmd_xenvirt/rte_eth_xenvirt.c b/lib/librte_pmd_xenv=
irt/rte_eth_xenvirt.c
index 4f8b712..7ebde3b 100644
--- a/lib/librte_pmd_xenvirt/rte_eth_xenvirt.c
+++ b/lib/librte_pmd_xenvirt/rte_eth_xenvirt.c
@@ -98,7 +98,7 @@ eth_xenvirt_rx(void *q, struct rte_mbuf **rx_pkts, uint16=
_t nb_pkts)
=20
 	nb_used =3D VIRTQUEUE_NUSED(rxvq);
=20
-	rte_compiler_barrier(); /* rmb */
+	rte_smp_rmb();
 	num =3D (uint16_t)(likely(nb_used <=3D nb_pkts) ? nb_used : nb_pkts);
 	num =3D (uint16_t)(likely(num <=3D VIRTIO_MBUF_BURST_SZ) ? num : VIRTIO_M=
BUF_BURST_SZ);
 	if (unlikely(num =3D=3D 0)) return 0;
@@ -149,7 +149,7 @@ eth_xenvirt_tx(void *tx_queue, struct rte_mbuf **tx_pkt=
s, uint16_t nb_pkts)
 	PMD_TX_LOG(DEBUG, "%d packets to xmit", nb_pkts);
 	nb_used =3D VIRTQUEUE_NUSED(txvq);
=20
-	rte_compiler_barrier();   /* rmb */
+	rte_smp_rmb();
=20
 	num =3D (uint16_t)(likely(nb_used <=3D VIRTIO_MBUF_BURST_SZ) ? nb_used : =
VIRTIO_MBUF_BURST_SZ);
 	num =3D virtqueue_dequeue_burst(txvq, snd_pkts, len, num);
diff --git a/lib/librte_pmd_xenvirt/virtqueue.h b/lib/librte_pmd_xenvirt/vi=
rtqueue.h
index f36030f..59b4469 100644
--- a/lib/librte_pmd_xenvirt/virtqueue.h
+++ b/lib/librte_pmd_xenvirt/virtqueue.h
@@ -150,7 +150,7 @@ vq_ring_update_avail(struct virtqueue *vq, uint16_t des=
c_idx)
 	 */
 	avail_idx =3D (uint16_t)(vq->vq_ring.avail->idx & (vq->vq_nentries - 1));
 	vq->vq_ring.avail->ring[avail_idx] =3D desc_idx;
-	rte_compiler_barrier();  /* wmb , for IA memory model barrier is enough*/
+	rte_smp_wmb();
 	vq->vq_ring.avail->idx++;
 }
=20
diff --git a/lib/librte_ring/rte_ring.h b/lib/librte_ring/rte_ring.h
index da54e34..1cbf8a9 100644
--- a/lib/librte_ring/rte_ring.h
+++ b/lib/librte_ring/rte_ring.h
@@ -451,7 +451,7 @@ __rte_ring_mp_do_enqueue(struct rte_ring *r, void * con=
st *obj_table,
=20
 	/* write entries in ring */
 	ENQUEUE_PTRS();
-	rte_compiler_barrier();
+	rte_smp_wmb();
=20
 	/* if we exceed the watermark */
 	if (unlikely(((mask + 1) - free_entries + n) > r->prod.watermark)) {
@@ -537,7 +537,7 @@ __rte_ring_sp_do_enqueue(struct rte_ring *r, void * con=
st *obj_table,
=20
 	/* write entries in ring */
 	ENQUEUE_PTRS();
-	rte_compiler_barrier();
+	rte_smp_wmb();
=20
 	/* if we exceed the watermark */
 	if (unlikely(((mask + 1) - free_entries + n) > r->prod.watermark)) {
@@ -628,7 +628,7 @@ __rte_ring_mc_do_dequeue(struct rte_ring *r, void **obj=
_table,
=20
 	/* copy in table */
 	DEQUEUE_PTRS();
-	rte_compiler_barrier();
+	rte_smp_rmb();
=20
 	/*
 	 * If there are other dequeues in progress that preceded us,
@@ -703,7 +703,7 @@ __rte_ring_sc_do_dequeue(struct rte_ring *r, void **obj=
_table,
=20
 	/* copy in table */
 	DEQUEUE_PTRS();
-	rte_compiler_barrier();
+	rte_smp_rmb();
=20
 	__RING_STAT_ADD(r, deq_success, n);
 	r->cons.tail =3D cons_next;
diff --git a/lib/librte_timer/rte_timer.c b/lib/librte_timer/rte_timer.c
index 884ee0e..520360e 100755
--- a/lib/librte_timer/rte_timer.c
+++ b/lib/librte_timer/rte_timer.c
@@ -396,7 +396,7 @@ __rte_timer_reset(struct rte_timer *tim, uint64_t expir=
e,
=20
 	/* update state: as we are in CONFIG state, only us can modify
 	 * the state so we don't need to use cmpset() here */
-	rte_wmb();
+	rte_smp_wmb();
 	status.state =3D RTE_TIMER_PENDING;
 	status.owner =3D (int16_t)tim_lcore;
 	tim->status.u32 =3D status.u32;
@@ -462,7 +462,7 @@ rte_timer_stop(struct rte_timer *tim)
 	}
=20
 	/* mark timer as stopped */
-	rte_wmb();
+	rte_smp_wmb();
 	status.state =3D RTE_TIMER_STOP;
 	status.owner =3D RTE_TIMER_NO_OWNER;
 	tim->status.u32 =3D status.u32;
@@ -558,14 +558,14 @@ void rte_timer_manage(void)
 			__TIMER_STAT_ADD(pending, -1);
 			status.state =3D RTE_TIMER_STOP;
 			status.owner =3D RTE_TIMER_NO_OWNER;
-			rte_wmb();
+			rte_smp_wmb();
 			tim->status.u32 =3D status.u32;
 		}
 		else {
 			/* keep it in list and mark timer as pending */
 			status.state =3D RTE_TIMER_PENDING;
 			status.owner =3D (int16_t)lcore_id;
-			rte_wmb();
+			rte_smp_wmb();
 			tim->status.u32 =3D status.u32;
 			__rte_timer_reset(tim, cur_time + tim->period,
 					tim->period, lcore_id, tim->f, tim->arg, 1);
--=20
1.9.2