* Re: [dpdk-dev] [PATCH] eal: Place EAL thread stack in a reserved per-lcore memzone
2020-04-13 9:09 [dpdk-dev] [PATCH] eal: Place EAL thread stack in a reserved per-lcore memzone Christos Ricudis
@ 2020-04-13 9:45 ` Jerin Jacob
2020-04-14 11:23 ` Ananyev, Konstantin
2020-04-27 16:51 ` Pavan Nikhilesh Bhagavatula
2 siblings, 0 replies; 4+ messages in thread
From: Jerin Jacob @ 2020-04-13 9:45 UTC (permalink / raw)
To: Christos Ricudis; +Cc: Anatoly Burakov, dpdk-dev
On Mon, Apr 13, 2020 at 2:39 PM Christos Ricudis <ricudis@niometrics.com> wrote:
>
> Reserve a per-lcore 4MB memzone and allocate thread stack of EAL threads there for better NUMA locality of stack-allocated variables
It looks like a good idea to me.
Some questions/feedback.
1) It is better to get the stack size from OS through
pthread_attr_getstack() rather than DPDK defining it.
2) There is an element of security issue here as one can get the lcore
stack using rte_memzone_lookup().
Why we need to use memzone, just rte_malloc_socket() is enough here.
Right? This will avoid the security issue.
3) Need to handle EAL --no-huge case as well.
>
> Signed-off-by: Christos Ricudis <ricudis@niometrics.com>
> ---
> lib/librte_eal/linux/eal.c | 20 ++++++++++++++++++++
> 1 file changed, 20 insertions(+)
>
> diff --git a/lib/librte_eal/linux/eal.c b/lib/librte_eal/linux/eal.c
> index 9530ee5..e047107 100644
> --- a/lib/librte_eal/linux/eal.c
> +++ b/lib/librte_eal/linux/eal.c
> @@ -68,6 +68,8 @@
>
> #define KERNEL_IOMMU_GROUPS_PATH "/sys/kernel/iommu_groups"
>
> +#define THREAD_STACK_SIZE_DEFAULT (4ULL * 1024ULL * 1024ULL)
> +#include <rte_memzone.h>
> /* Allow the application to print its usage message too if set */
> static rte_usage_hook_t rte_application_usage_hook = NULL;
>
> @@ -1224,6 +1226,24 @@ static void rte_eal_init_alert(const char *msg)
>
> lcore_config[i].state = WAIT;
>
> + pthread_attr_t attr;
> + pthread_attr_init(&attr);
> + size_t thread_stack_size = THREAD_STACK_SIZE_DEFAULT;
> + char thread_stack_name[64];
> + snprintf(thread_stack_name, sizeof thread_stack_name, "rte:lcore:%s:%d:threadstack", rte_eal_process_type() == RTE_PROC_PRIMARY ? "p" : "s", i);
> + const struct rte_memzone *mz = rte_memzone_lookup(thread_stack_name);
> + if (mz == NULL) {
> + if ((mz = rte_memzone_reserve(thread_stack_name, thread_stack_size, lcore_config[i].socket_id, 0)) == NULL) {
> + rte_panic("Cannot allocate memzone for thread stack");
> + }
> + }
> + void *thread_stack = mz->addr;
> +
> + if (pthread_attr_setstack(&attr, thread_stack, thread_stack_size) < 0) {
> + rte_panic("Cannot set thread stack\n");
> + }
> + RTE_LOG(DEBUG, EAL, "Thread stack for lcore %d on socket %d set to %p\n", i, lcore_config[i].socket_id, thread_stack);
> +
> /* create a thread for each lcore */
> ret = pthread_create(&lcore_config[i].thread_id, NULL,
> eal_thread_loop, NULL);
> --
> 1.8.3.1
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] eal: Place EAL thread stack in a reserved per-lcore memzone
2020-04-13 9:09 [dpdk-dev] [PATCH] eal: Place EAL thread stack in a reserved per-lcore memzone Christos Ricudis
2020-04-13 9:45 ` Jerin Jacob
@ 2020-04-14 11:23 ` Ananyev, Konstantin
2020-04-27 16:51 ` Pavan Nikhilesh Bhagavatula
2 siblings, 0 replies; 4+ messages in thread
From: Ananyev, Konstantin @ 2020-04-14 11:23 UTC (permalink / raw)
To: Christos Ricudis, Burakov, Anatoly; +Cc: dev
Hi,
>
> Reserve a per-lcore 4MB memzone and allocate thread stack of EAL threads there for better NUMA locality of stack-allocated variables
I wonder if there any real performance improvement seen with that change?
Any case (existing DPDK app/example) that can demonstrate it?
Konstantin
>
> Signed-off-by: Christos Ricudis <ricudis@niometrics.com>
> ---
> lib/librte_eal/linux/eal.c | 20 ++++++++++++++++++++
> 1 file changed, 20 insertions(+)
>
> diff --git a/lib/librte_eal/linux/eal.c b/lib/librte_eal/linux/eal.c
> index 9530ee5..e047107 100644
> --- a/lib/librte_eal/linux/eal.c
> +++ b/lib/librte_eal/linux/eal.c
> @@ -68,6 +68,8 @@
>
> #define KERNEL_IOMMU_GROUPS_PATH "/sys/kernel/iommu_groups"
>
> +#define THREAD_STACK_SIZE_DEFAULT (4ULL * 1024ULL * 1024ULL)
> +#include <rte_memzone.h>
> /* Allow the application to print its usage message too if set */
> static rte_usage_hook_t rte_application_usage_hook = NULL;
>
> @@ -1224,6 +1226,24 @@ static void rte_eal_init_alert(const char *msg)
>
> lcore_config[i].state = WAIT;
>
> + pthread_attr_t attr;
> + pthread_attr_init(&attr);
> + size_t thread_stack_size = THREAD_STACK_SIZE_DEFAULT;
> + char thread_stack_name[64];
> + snprintf(thread_stack_name, sizeof thread_stack_name, "rte:lcore:%s:%d:threadstack", rte_eal_process_type() ==
> RTE_PROC_PRIMARY ? "p" : "s", i);
> + const struct rte_memzone *mz = rte_memzone_lookup(thread_stack_name);
> + if (mz == NULL) {
> + if ((mz = rte_memzone_reserve(thread_stack_name, thread_stack_size, lcore_config[i].socket_id, 0)) == NULL) {
> + rte_panic("Cannot allocate memzone for thread stack");
> + }
> + }
> + void *thread_stack = mz->addr;
> +
> + if (pthread_attr_setstack(&attr, thread_stack, thread_stack_size) < 0) {
> + rte_panic("Cannot set thread stack\n");
> + }
> + RTE_LOG(DEBUG, EAL, "Thread stack for lcore %d on socket %d set to %p\n", i, lcore_config[i].socket_id, thread_stack);
> +
> /* create a thread for each lcore */
> ret = pthread_create(&lcore_config[i].thread_id, NULL,
> eal_thread_loop, NULL);
> --
> 1.8.3.1
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] eal: Place EAL thread stack in a reserved per-lcore memzone
2020-04-13 9:09 [dpdk-dev] [PATCH] eal: Place EAL thread stack in a reserved per-lcore memzone Christos Ricudis
2020-04-13 9:45 ` Jerin Jacob
2020-04-14 11:23 ` Ananyev, Konstantin
@ 2020-04-27 16:51 ` Pavan Nikhilesh Bhagavatula
2 siblings, 0 replies; 4+ messages in thread
From: Pavan Nikhilesh Bhagavatula @ 2020-04-27 16:51 UTC (permalink / raw)
To: Christos Ricudis, Anatoly Burakov, Konstantin Ananyev,
Jerin Jacob Kollanukkaran
Cc: dev
> lib/librte_eal/linux/eal.c | 20 ++++++++++++++++++++
> 1 file changed, 20 insertions(+)
>
>diff --git a/lib/librte_eal/linux/eal.c b/lib/librte_eal/linux/eal.c
>index 9530ee5..e047107 100644
>--- a/lib/librte_eal/linux/eal.c
>+++ b/lib/librte_eal/linux/eal.c
>@@ -68,6 +68,8 @@
>
> #define KERNEL_IOMMU_GROUPS_PATH
>"/sys/kernel/iommu_groups"
>
>+#define THREAD_STACK_SIZE_DEFAULT (4ULL * 1024ULL * 1024ULL)
>+#include <rte_memzone.h>
> /* Allow the application to print its usage message too if set */
> static rte_usage_hook_t rte_application_usage_hook = NULL;
>
>@@ -1224,6 +1226,24 @@ static void rte_eal_init_alert(const char
>*msg)
>
> lcore_config[i].state = WAIT;
>
>+ pthread_attr_t attr;
>+ pthread_attr_init(&attr);
>+ size_t thread_stack_size =
>THREAD_STACK_SIZE_DEFAULT;
>+ char thread_stack_name[64];
>+ snprintf(thread_stack_name, sizeof
>thread_stack_name, "rte:lcore:%s:%d:threadstack",
>rte_eal_process_type() == RTE_PROC_PRIMARY ? "p" : "s", i);
>+ const struct rte_memzone *mz =
>rte_memzone_lookup(thread_stack_name);
>+ if (mz == NULL) {
>+ if ((mz =
>rte_memzone_reserve(thread_stack_name, thread_stack_size,
>lcore_config[i].socket_id, 0)) == NULL) {
>+ rte_panic("Cannot allocate memzone
>for thread stack");
>+ }
>+ }
>+ void *thread_stack = mz->addr;
>+
>+ if (pthread_attr_setstack(&attr, thread_stack,
>thread_stack_size) < 0) {
>+ rte_panic("Cannot set thread stack\n");
>+ }
>+ RTE_LOG(DEBUG, EAL, "Thread stack for lcore %d on
>socket %d set to %p\n", i, lcore_config[i].socket_id, thread_stack);
>+
> /* create a thread for each lcore */
> ret = pthread_create(&lcore_config[i].thread_id, NULL,
> eal_thread_loop, NULL);
Don't we need to pass attr struct created above to ptherad_create as 2nd argument?.
Also, since there is no way to modify master_lcore stack space most of DPDK testsuite wouldn't show any difference.
Pavan.
>--
>1.8.3.1
^ permalink raw reply [flat|nested] 4+ messages in thread