From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <pablo.de.lara.guarch@intel.com>
Received: from mga06.intel.com (mga06.intel.com [134.134.136.31])
 by dpdk.org (Postfix) with ESMTP id 3978B376C
 for <dev@dpdk.org>; Mon,  4 Sep 2017 16:24:16 +0200 (CEST)
Received: from orsmga003.jf.intel.com ([10.7.209.27])
 by orsmga104.jf.intel.com with ESMTP; 04 Sep 2017 07:24:15 -0700
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="5.41,475,1498546800"; d="scan'208";a="1010799513"
Received: from irsmsx151.ger.corp.intel.com ([163.33.192.59])
 by orsmga003.jf.intel.com with ESMTP; 04 Sep 2017 07:24:14 -0700
Received: from irsmsx108.ger.corp.intel.com ([169.254.11.167]) by
 IRSMSX151.ger.corp.intel.com ([169.254.4.108]) with mapi id 14.03.0319.002;
 Mon, 4 Sep 2017 15:24:13 +0100
From: "De Lara Guarch, Pablo" <pablo.de.lara.guarch@intel.com>
To: "Burakov, Anatoly" <anatoly.burakov@intel.com>, "dev@dpdk.org"
 <dev@dpdk.org>
CC: "Doherty, Declan" <declan.doherty@intel.com>
Thread-Topic: [PATCH 3/4] test-crypto-perf: add new PMD benchmarking mode
Thread-Index: AQHTHMaYPcI/KQyq4kerzQTaAC0dPKKkzOFA
Date: Mon, 4 Sep 2017 14:24:12 +0000
Message-ID: <E115CCD9D858EF4F90C690B0DCB4D8976CC153B1@IRSMSX108.ger.corp.intel.com>
References: <cover.1503566892.git.anatoly.burakov@intel.com>
 <74e5903a777894da0c170bfdb7e6b1e425503342.1503566892.git.anatoly.burakov@intel.com>
In-Reply-To: <74e5903a777894da0c170bfdb7e6b1e425503342.1503566892.git.anatoly.burakov@intel.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
dlp-product: dlpe-windows
dlp-version: 11.0.0.116
dlp-reaction: no-action
x-originating-ip: [163.33.239.181]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Subject: Re: [dpdk-dev] [PATCH 3/4] test-crypto-perf: add new PMD
	benchmarking mode
X-BeenThere: dev@dpdk.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: DPDK patches and discussions <dev.dpdk.org>
List-Unsubscribe: <http://dpdk.org/ml/options/dev>,
 <mailto:dev-request@dpdk.org?subject=unsubscribe>
List-Archive: <http://dpdk.org/ml/archives/dev/>
List-Post: <mailto:dev@dpdk.org>
List-Help: <mailto:dev-request@dpdk.org?subject=help>
List-Subscribe: <http://dpdk.org/ml/listinfo/dev>,
 <mailto:dev-request@dpdk.org?subject=subscribe>
X-List-Received-Date: Mon, 04 Sep 2017 14:24:16 -0000


> -----Original Message-----
> From: Burakov, Anatoly
> Sent: Thursday, August 24, 2017 11:48 AM
> To: dev@dpdk.org
> Cc: De Lara Guarch, Pablo <pablo.de.lara.guarch@intel.com>; Doherty,
> Declan <declan.doherty@intel.com>; Burakov, Anatoly
> <anatoly.burakov@intel.com>
> Subject: [PATCH 3/4] test-crypto-perf: add new PMD benchmarking mode
>=20
> This patch adds a new benchmarking mode, which is intended for
> microbenchmarking individual parts of the cryptodev framework,
> specifically crypto ops alloc-build-free, cryptodev PMD enqueue and
> cryptodev PMD dequeue.
>=20
> It works by first benchmarking crypto operation alloc-build-free loop (no
> enqueues/dequeues happening), and then benchmarking enqueue and
> dequeue separately, by first completely filling up the TX queue, and then
> completely draining the RX queue.
>=20
> Results are shown as cycle counts per alloc/build/free, PMD enqueue and
> PMD dequeue.
>=20
> One new test mode is added: "pmd-cyclecount"
>   (called with --ptest=3Dpmd-cyclecount)
>=20
> New command-line argument is also added:
>   --pmd-cyclecount-delay-ms: this is a pmd-cyclecount-specific parameter
>       that controls the delay between enqueue and dequeue. This is
>       useful for benchmarking hardware acceleration, as hardware may
>       not be able to keep up with enqueued packets. This parameter
>       can be increased if there are large amounts of dequeue
>       retries.
>=20
> Signed-off-by: Burakov, Anatoly <anatoly.burakov@intel.com>

Git-check-log is complaining about this tag. Actually, the author is "Anato=
ly Burakov",
but here is "Burakov, Anatoly". Looks like this is something to fix.

> ---

...

>=20
> diff --git a/app/test-crypto-perf/cperf_test_pmd_cyclecount.c b/app/test-
> crypto-perf/cperf_test_pmd_cyclecount.c
> new file mode 100644
> index 0000000..ef7fa83

...


> +void *
> +cperf_pmd_cyclecount_test_constructor(struct rte_mempool *sess_mp,
> +			uint8_t dev_id, uint16_t qp_id,
> +			const struct cperf_options *options,
> +			const struct cperf_test_vector *test_vector,
> +			const struct cperf_op_fns *op_fns)
> +{
> +	struct cperf_pmd_cyclecount_ctx *ctx =3D NULL;
> +	unsigned int mbuf_idx =3D 0;
> +	char pool_name[32] =3D "";
> +
> +	/* preallocate buffers for crypto ops as they can get quite big */
> +	size_t alloc_sz =3D sizeof(struct rte_crypto_op *) *
> +			options->nb_descriptors;
> +
> +	ctx =3D rte_malloc(NULL, sizeof(struct cperf_pmd_cyclecount_ctx), 0);
> +	if (ctx =3D=3D NULL)
> +		goto err;
> +
> +	ctx->dev_id =3D dev_id;
> +	ctx->qp_id =3D qp_id;
> +
> +	ctx->populate_ops =3D op_fns->populate_ops;
> +	ctx->options =3D options;
> +	ctx->test_vector =3D test_vector;
> +
> +	/* IV goes at the end of the cryptop operation */

Typo here: "crypto"

> +	uint16_t iv_offset =3D sizeof(struct rte_crypto_op) +
> +		sizeof(struct rte_crypto_sym_op);
> +

...

> +
> +	snprintf(pool_name, sizeof(pool_name), "cperf_op_pool_cdev_%d",
> +			dev_id);
> +
> +	uint16_t priv_size =3D test_vector->cipher_iv.length +
> +		test_vector->auth_iv.length;

Missing "+ test_vector->aead_iv.length", added in a patch at the end of the=
 previous release.

> +

...

> +/* benchmark alloc-build-free of ops */ static inline int
> +pmd_cyclecount_bench_ops(struct pmd_cyclecount_state *state,
> uint32_t cur_op,
> +			 uint16_t test_burst_size)

Use two tabs for the next line, instead of aligning with the parenthesis ab=
ove.
Look at other functions that have the same issue.

> +{
> +	uint32_t iter_ops_left =3D state->opts->total_ops - cur_op;
> +	uint32_t iter_ops_needed =3D RTE_MIN(state->opts->nb_descriptors,
> +					   iter_ops_left);
> +	uint32_t cur_iter_op;
> +
> +	for (cur_iter_op =3D 0; cur_iter_op < iter_ops_needed;
> +	     cur_iter_op +=3D test_burst_size) {

Same comment as above, about the alignment.

> +		uint32_t burst_size =3D RTE_MIN(state->opts->total_ops -
> cur_op,
> +					      test_burst_size);
> +		struct rte_crypto_op **ops =3D &state->ctx-
> >ops[cur_iter_op];
> +
> +		if (burst_size !=3D rte_crypto_op_bulk_alloc(
> +			state->ctx->crypto_op_pool,
> +			RTE_CRYPTO_OP_TYPE_SYMMETRIC,
> +			ops, burst_size))

Same as above, add an extra tab.

> +			return -1;
> +
> +		/* Setup crypto op, attach mbuf etc */
> +		(state->ctx->populate_ops)(ops,
> +				&state->ctx->mbufs_in[cur_iter_op],
> +				&state->ctx->mbufs_out[cur_iter_op],
> +				burst_size, state->ctx->sess,
> +				state->opts,
> +				state->ctx->test_vector, iv_offset);
> +
> +#ifdef CPERF_LINEARIZATION_ENABLE
> +		/* Check if source mbufs require coalescing */
> +		if (state->linearize) {
> +			uint8_t i;
> +			for (i =3D 0; i < burst_size; i++) {
> +				struct rte_mbuf *src =3D
> +						ops[i]->sym->m_src;
> +				rte_pktmbuf_linearize(src);
> +			}
> +		}
> +#endif /* CPERF_LINEARIZATION_ENABLE */
> +		rte_mempool_put_bulk(state->ctx->crypto_op_pool,
> +				(void **)ops, burst_size);
> +	}
> +
> +	return 0;
> +}
> +

...

> +
> +/* benchmark enqueue, returns number of ops enqueued */ static
> uint32_t
> +pmd_cyclecount_bench_enq(struct pmd_cyclecount_state *state,
> +			 uint32_t iter_ops_needed, uint16_t
> test_burst_size) {
> +	/* Enqueue full descriptor ring of ops on crypto device */
> +	uint32_t cur_iter_op =3D 0;
> +	while (cur_iter_op < iter_ops_needed) {
> +		uint32_t burst_size =3D
> +				RTE_MIN(iter_ops_needed - cur_iter_op,
> +					test_burst_size);
> +		struct rte_crypto_op **ops =3D
> +				&state->ctx->ops[cur_iter_op];
> +		uint32_t burst_enqd;
> +
> +		burst_enqd =3D

No need to place the function in a separate line, I think.

> +				rte_cryptodev_enqueue_burst(
> +					state->ctx->dev_id,
> +					state->ctx->qp_id,
> +					ops, burst_size);
> +
> +		/* if we couldn't enqueue anything, the queue is full */
> +		if (!burst_enqd) {
> +			/* don't try to dequeue anything we didn't enqueue
> */
> +			return cur_iter_op;
> +		}
> +
> +		if (burst_enqd < burst_size)
> +			state->ops_enq_retries++;
> +		state->ops_enqd +=3D burst_enqd;
> +		cur_iter_op +=3D burst_enqd;
> +	}
> +	return iter_ops_needed;
> +}
> +