From: fengchengwen <fengchengwen@huawei.com>
To: "Morten Brørup" <mb@smartsharesystems.com>,
andrew.rybchenko@oktetlabs.ru
Cc: <dev@dpdk.org>
Subject: Re: [PATCH v2] mempool: test performance with larger bursts
Date: Wed, 24 Jan 2024 10:41:40 +0800 [thread overview]
Message-ID: <6f36e290-1cf5-144c-3886-6016907b7b25@huawei.com> (raw)
In-Reply-To: <20240122143445.45276-1-mb@smartsharesystems.com>
Hi Morten,
On 2024/1/22 22:34, Morten Brørup wrote:
> Bursts of up to 64 or 128 packets are not uncommon, so increase the
> maximum tested get and put burst sizes from 32 to 128.
>
> Some applications keep more than 512 objects, so increase the maximum
> number of kept objects from 512 to 8192, still in jumps of factor four.
> This exceeds the typical mempool cache size of 512 objects, so the test
> also exercises the mempool driver.
>
> Signed-off-by: Morten Brørup <mb@smartsharesystems.com>
>
> ---
>
> v2: Addressed feedback by Chengwen Feng
> * Added get and put burst sizes of 64 packets, which is probably also not
> uncommon.
> * Fixed list of number of kept objects so list remains in jumps of factor
> four.
> * Added three derivative test cases, for faster testing.
> ---
> app/test/test_mempool_perf.c | 107 ++++++++++++++++++++---------------
> 1 file changed, 62 insertions(+), 45 deletions(-)
>
> diff --git a/app/test/test_mempool_perf.c b/app/test/test_mempool_perf.c
> index 96de347f04..a5a7d43608 100644
> --- a/app/test/test_mempool_perf.c
> +++ b/app/test/test_mempool_perf.c
> @@ -1,6 +1,6 @@
> /* SPDX-License-Identifier: BSD-3-Clause
> * Copyright(c) 2010-2014 Intel Corporation
> - * Copyright(c) 2022 SmartShare Systems
> + * Copyright(c) 2022-2024 SmartShare Systems
> */
>
> #include <string.h>
> @@ -54,22 +54,24 @@
> *
> * - Bulk size (*n_get_bulk*, *n_put_bulk*)
> *
> - * - Bulk get from 1 to 32
> - * - Bulk put from 1 to 32
> - * - Bulk get and put from 1 to 32, compile time constant
> + * - Bulk get from 1 to 128
> + * - Bulk put from 1 to 128
> + * - Bulk get and put from 1 to 128, compile time constant
> *
> * - Number of kept objects (*n_keep*)
> *
> * - 32
> * - 128
> * - 512
> + * - 2048
> + * - 8192
> */
>
> #define N 65536
> #define TIME_S 5
> #define MEMPOOL_ELT_SIZE 2048
> -#define MAX_KEEP 512
> -#define MEMPOOL_SIZE ((rte_lcore_count()*(MAX_KEEP+RTE_MEMPOOL_CACHE_MAX_SIZE))-1)
> +#define MAX_KEEP 8192
> +#define MEMPOOL_SIZE ((rte_lcore_count()*(MAX_KEEP+RTE_MEMPOOL_CACHE_MAX_SIZE*2))-1)
>
> /* Number of pointers fitting into one cache line. */
> #define CACHE_LINE_BURST (RTE_CACHE_LINE_SIZE / sizeof(uintptr_t))
> @@ -204,6 +206,10 @@ per_lcore_mempool_test(void *arg)
> CACHE_LINE_BURST, CACHE_LINE_BURST);
> else if (n_get_bulk == 32)
> ret = test_loop(mp, cache, n_keep, 32, 32);
> + else if (n_get_bulk == 64)
> + ret = test_loop(mp, cache, n_keep, 64, 64);
> + else if (n_get_bulk == 128)
> + ret = test_loop(mp, cache, n_keep, 128, 128);
> else
> ret = -1;
>
> @@ -289,9 +295,9 @@ launch_cores(struct rte_mempool *mp, unsigned int cores)
> static int
> do_one_mempool_test(struct rte_mempool *mp, unsigned int cores)
> {
> - unsigned int bulk_tab_get[] = { 1, 4, CACHE_LINE_BURST, 32, 0 };
> - unsigned int bulk_tab_put[] = { 1, 4, CACHE_LINE_BURST, 32, 0 };
> - unsigned int keep_tab[] = { 32, 128, 512, 0 };
> + unsigned int bulk_tab_get[] = { 1, 4, CACHE_LINE_BURST, 32, 64, 128, 0 };
> + unsigned int bulk_tab_put[] = { 1, 4, CACHE_LINE_BURST, 32, 64, 128, 0 };
> + unsigned int keep_tab[] = { 32, 128, 512, 2048, 8192, 0 };
> unsigned *get_bulk_ptr;
> unsigned *put_bulk_ptr;
> unsigned *keep_ptr;
> @@ -301,6 +307,9 @@ do_one_mempool_test(struct rte_mempool *mp, unsigned int cores)
> for (put_bulk_ptr = bulk_tab_put; *put_bulk_ptr; put_bulk_ptr++) {
> for (keep_ptr = keep_tab; *keep_ptr; keep_ptr++) {
>
> + if (*keep_ptr < *get_bulk_ptr || *keep_ptr < *put_bulk_ptr)
> + continue;
> +
> use_constant_values = 0;
> n_get_bulk = *get_bulk_ptr;
> n_put_bulk = *put_bulk_ptr;
> @@ -323,7 +332,7 @@ do_one_mempool_test(struct rte_mempool *mp, unsigned int cores)
> }
>
> static int
> -test_mempool_perf(void)
> +do_all_mempool_perf_tests(unsigned int cores)
> {
> struct rte_mempool *mp_cache = NULL;
> struct rte_mempool *mp_nocache = NULL;
> @@ -376,65 +385,73 @@ test_mempool_perf(void)
>
> rte_mempool_obj_iter(default_pool, my_obj_init, NULL);
>
> - /* performance test with 1, 2 and max cores */
> printf("start performance test (without cache)\n");
> -
> - if (do_one_mempool_test(mp_nocache, 1) < 0)
> + if (do_one_mempool_test(mp_nocache, cores) < 0)
> goto err;
>
> - if (do_one_mempool_test(mp_nocache, 2) < 0)
> - goto err;
> -
> - if (do_one_mempool_test(mp_nocache, rte_lcore_count()) < 0)
> - goto err;
> -
> - /* performance test with 1, 2 and max cores */
> printf("start performance test for %s (without cache)\n",
> default_pool_ops);
> -
> - if (do_one_mempool_test(default_pool, 1) < 0)
> + if (do_one_mempool_test(default_pool, cores) < 0)
> goto err;
>
> - if (do_one_mempool_test(default_pool, 2) < 0)
> + printf("start performance test (with cache)\n");
> + if (do_one_mempool_test(mp_cache, cores) < 0)
> goto err;
>
> - if (do_one_mempool_test(default_pool, rte_lcore_count()) < 0)
> + printf("start performance test (with user-owned cache)\n");
> + use_external_cache = 1;
This variable should set to zero after next test, because we may repeat execute command again.
I think the original code already has this bug, suggest add a bugfix first and then with this commit.
> + if (do_one_mempool_test(mp_nocache, cores) < 0)
> goto err;
>
> - /* performance test with 1, 2 and max cores */
> - printf("start performance test (with cache)\n");
> + rte_mempool_list_dump(stdout);
>
> - if (do_one_mempool_test(mp_cache, 1) < 0)
> - goto err;
> + ret = 0;
>
> - if (do_one_mempool_test(mp_cache, 2) < 0)
> - goto err;
> +err:
> + rte_mempool_free(mp_cache);
> + rte_mempool_free(mp_nocache);
> + rte_mempool_free(default_pool);
> + return ret;
> +}
>
> - if (do_one_mempool_test(mp_cache, rte_lcore_count()) < 0)
> - goto err;
> +static int
> +test_mempool_perf_1core(void)
> +{
> + return do_all_mempool_perf_tests(1);
> +}
>
> - /* performance test with 1, 2 and max cores */
> - printf("start performance test (with user-owned cache)\n");
> - use_external_cache = 1;
> +static int
> +test_mempool_perf_2cores(void)
> +{
> + return do_all_mempool_perf_tests(2);
> +}
>
> - if (do_one_mempool_test(mp_nocache, 1) < 0)
> - goto err;
> +static int
> +test_mempool_perf_allcores(void)
> +{
> + return do_all_mempool_perf_tests(rte_lcore_count());
> +}
>
> - if (do_one_mempool_test(mp_nocache, 2) < 0)
> - goto err;
> +static int
> +test_mempool_perf(void)
> +{
> + int ret = -1;
>
> - if (do_one_mempool_test(mp_nocache, rte_lcore_count()) < 0)
> + /* performance test with 1, 2 and max cores */
> + if (do_all_mempool_perf_tests(1) < 0)
> + goto err;
> + if (do_all_mempool_perf_tests(2) < 0)
> + goto err;
> + if (do_all_mempool_perf_tests(rte_lcore_count()) < 0)
> goto err;
> -
> - rte_mempool_list_dump(stdout);
>
> ret = 0;
>
> err:
> - rte_mempool_free(mp_cache);
> - rte_mempool_free(mp_nocache);
> - rte_mempool_free(default_pool);
> return ret;
> }
>
> REGISTER_PERF_TEST(mempool_perf_autotest, test_mempool_perf);
> +REGISTER_PERF_TEST(mempool_perf_autotest_1core, test_mempool_perf_1core);
> +REGISTER_PERF_TEST(mempool_perf_autotest_2cores, test_mempool_perf_2cores);
> +REGISTER_PERF_TEST(mempool_perf_autotest_allcores, test_mempool_perf_allcores);
I'm OK for derivative tests by core-number.
With above bug fixed,
Acked-by: Chengwen Feng <fengchengwen@huawei.com>
Thanks
>
next prev parent reply other threads:[~2024-01-24 2:41 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-01-21 4:52 [PATCH] " Morten Brørup
2024-01-22 7:10 ` fengchengwen
2024-01-22 14:34 ` [PATCH v2] " Morten Brørup
2024-01-24 2:41 ` fengchengwen [this message]
2024-01-24 8:58 ` [PATCH v3] " Morten Brørup
2024-01-24 9:10 ` [PATCH v4] " Morten Brørup
2024-01-24 11:21 ` [PATCH v5] " Morten Brørup
2024-02-18 18:03 ` Thomas Monjalon
2024-02-20 13:49 ` Morten Brørup
2024-02-21 10:22 ` Thomas Monjalon
2024-02-21 10:38 ` Morten Brørup
2024-02-21 10:40 ` Bruce Richardson
2024-02-20 14:01 ` [PATCH v6] " Morten Brørup
2024-03-02 20:04 ` [PATCH v7] " Morten Brørup
2024-04-04 9:26 ` Morten Brørup
2024-06-10 8:56 ` Morten Brørup
2024-06-18 13:21 ` Bruce Richardson
2024-06-18 13:48 ` Morten Brørup
2024-09-13 14:58 ` Morten Brørup
2024-09-16 12:40 ` Thomas Monjalon
2024-09-16 13:08 ` Morten Brørup
2024-09-16 14:04 ` Thomas Monjalon
2024-09-16 15:37 ` [PATCH v8] " Morten Brørup
2024-09-17 8:10 ` [PATCH v9] " Morten Brørup
2024-10-08 9:14 ` Morten Brørup
2024-10-11 12:50 ` David Marchand
2024-01-21 5:32 [RFC] mbuf: performance optimization Morten Brørup
2024-01-22 14:27 ` [PATCH v2] mempool: test performance with larger bursts Morten Brørup
2024-01-22 14:39 ` Morten Brørup
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6f36e290-1cf5-144c-3886-6016907b7b25@huawei.com \
--to=fengchengwen@huawei.com \
--cc=andrew.rybchenko@oktetlabs.ru \
--cc=dev@dpdk.org \
--cc=mb@smartsharesystems.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).