DPDK patches and discussions
 help / color / mirror / Atom feed
* [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
@ 2021-01-29 15:29 Anatoly Burakov
  2021-01-29 15:40 ` Thomas Monjalon
  0 siblings, 1 reply; 4+ messages in thread
From: Anatoly Burakov @ 2021-01-29 15:29 UTC (permalink / raw)
  To: dev; +Cc: thomas, james.r.harris

Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
heap was an external heap. However, that API is thread-safe, and when
we're inside the allocation process, we're already write-locked, so
calling `rte_malloc_heap_socket_is_external()` will result in a
deadlock followed by a timeout.

Fix it by replacing the API call with a check against maximum number of
NUMA nodes, because external heaps always have higher socket ID's.

Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")

Reported-by: Jim Harris <james.r.harris@intel.com>

Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
---
 lib/librte_eal/common/malloc_mp.c | 9 +++++++--
 1 file changed, 7 insertions(+), 2 deletions(-)

diff --git a/lib/librte_eal/common/malloc_mp.c b/lib/librte_eal/common/malloc_mp.c
index 0b19d4d5fb..b1f7f7824b 100644
--- a/lib/librte_eal/common/malloc_mp.c
+++ b/lib/librte_eal/common/malloc_mp.c
@@ -241,8 +241,13 @@ handle_alloc_request(const struct malloc_mp_req *m,
 
 	heap = &mcfg->malloc_heaps[ar->malloc_heap_idx];
 
-	/* for allocations, we must only use internal heaps */
-	if (rte_malloc_heap_socket_is_external(heap->socket_id)) {
+	/*
+	 * for allocations, we must only use internal heaps, but since the
+	 * rte_malloc_heap_socket_is_external() is thread-safe and we're already
+	 * read-locked, we'll have to take advantage of the fac that internal
+	 * socket ID's are always lower than RTE_MAX_NUMA_NODES.
+	 */
+	if (heap->socket_id >= RTE_MAX_NUMA_NODES) {
 		RTE_LOG(ERR, EAL, "Attempting to allocate from external heap\n");
 		return -1;
 	}
-- 
2.25.1

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
  2021-01-29 15:29 [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation Anatoly Burakov
@ 2021-01-29 15:40 ` Thomas Monjalon
  2021-01-29 16:07   ` Burakov, Anatoly
  0 siblings, 1 reply; 4+ messages in thread
From: Thomas Monjalon @ 2021-01-29 15:40 UTC (permalink / raw)
  To: Anatoly Burakov; +Cc: dev, james.r.harris

29/01/2021 16:29, Anatoly Burakov:
> Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
> heap was an external heap. However, that API is thread-safe, and when
> we're inside the allocation process, we're already write-locked, so
> calling `rte_malloc_heap_socket_is_external()` will result in a
> deadlock followed by a timeout.
> 
> Fix it by replacing the API call with a check against maximum number of
> NUMA nodes, because external heaps always have higher socket ID's.

Is there some unit tests for such thing?

> 
> Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
> 
> Reported-by: Jim Harris <james.r.harris@intel.com>
> 

No need of blank line here.

> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
> ---
>  lib/librte_eal/common/malloc_mp.c | 9 +++++++--
>  1 file changed, 7 insertions(+), 2 deletions(-)
> 
> diff --git a/lib/librte_eal/common/malloc_mp.c b/lib/librte_eal/common/malloc_mp.c
> index 0b19d4d5fb..b1f7f7824b 100644
> --- a/lib/librte_eal/common/malloc_mp.c
> +++ b/lib/librte_eal/common/malloc_mp.c
> -	/* for allocations, we must only use internal heaps */
> -	if (rte_malloc_heap_socket_is_external(heap->socket_id)) {
> +	/*
> +	 * for allocations, we must only use internal heaps, but since the
> +	 * rte_malloc_heap_socket_is_external() is thread-safe and we're already
> +	 * read-locked, we'll have to take advantage of the fac that internal

fac -> fact?

> +	 * socket ID's are always lower than RTE_MAX_NUMA_NODES.
> +	 */
> +	if (heap->socket_id >= RTE_MAX_NUMA_NODES) {





^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
  2021-01-29 15:40 ` Thomas Monjalon
@ 2021-01-29 16:07   ` Burakov, Anatoly
  2021-01-29 23:37     ` Thomas Monjalon
  0 siblings, 1 reply; 4+ messages in thread
From: Burakov, Anatoly @ 2021-01-29 16:07 UTC (permalink / raw)
  To: Thomas Monjalon; +Cc: dev, james.r.harris

On 29-Jan-21 3:40 PM, Thomas Monjalon wrote:
> 29/01/2021 16:29, Anatoly Burakov:
>> Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
>> heap was an external heap. However, that API is thread-safe, and when
>> we're inside the allocation process, we're already write-locked, so
>> calling `rte_malloc_heap_socket_is_external()` will result in a
>> deadlock followed by a timeout.
>>
>> Fix it by replacing the API call with a check against maximum number of
>> NUMA nodes, because external heaps always have higher socket ID's.
> 
> Is there some unit tests for such thing?

I couldn't reproduce this using autotests, but Jim has SPDK tests which 
triggered this error.

Since this is dependent upon secondary process, any test would 
necessarily have to be manual here, i think.

> 
>>
>> Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
>>
>> Reported-by: Jim Harris <james.r.harris@intel.com>
>>
> 
> No need of blank line here.

Need to update my scripts :P

> 
>> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
>> ---
>>   lib/librte_eal/common/malloc_mp.c | 9 +++++++--
>>   1 file changed, 7 insertions(+), 2 deletions(-)
>>
>> diff --git a/lib/librte_eal/common/malloc_mp.c b/lib/librte_eal/common/malloc_mp.c
>> index 0b19d4d5fb..b1f7f7824b 100644
>> --- a/lib/librte_eal/common/malloc_mp.c
>> +++ b/lib/librte_eal/common/malloc_mp.c
>> -	/* for allocations, we must only use internal heaps */
>> -	if (rte_malloc_heap_socket_is_external(heap->socket_id)) {
>> +	/*
>> +	 * for allocations, we must only use internal heaps, but since the
>> +	 * rte_malloc_heap_socket_is_external() is thread-safe and we're already
>> +	 * read-locked, we'll have to take advantage of the fac that internal
> 
> fac -> fact?
> 

Yes.

>> +	 * socket ID's are always lower than RTE_MAX_NUMA_NODES.
>> +	 */
>> +	if (heap->socket_id >= RTE_MAX_NUMA_NODES) {
> 
> 
> 
> 


-- 
Thanks,
Anatoly

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
  2021-01-29 16:07   ` Burakov, Anatoly
@ 2021-01-29 23:37     ` Thomas Monjalon
  0 siblings, 0 replies; 4+ messages in thread
From: Thomas Monjalon @ 2021-01-29 23:37 UTC (permalink / raw)
  To: Burakov, Anatoly; +Cc: dev, james.r.harris

29/01/2021 17:07, Burakov, Anatoly:
> On 29-Jan-21 3:40 PM, Thomas Monjalon wrote:
> > 29/01/2021 16:29, Anatoly Burakov:
> >> Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
> >> heap was an external heap. However, that API is thread-safe, and when
> >> we're inside the allocation process, we're already write-locked, so
> >> calling `rte_malloc_heap_socket_is_external()` will result in a
> >> deadlock followed by a timeout.
> >>
> >> Fix it by replacing the API call with a check against maximum number of
> >> NUMA nodes, because external heaps always have higher socket ID's.
> > 
> > Is there some unit tests for such thing?
> 
> I couldn't reproduce this using autotests, but Jim has SPDK tests which 
> triggered this error.
> 
> Since this is dependent upon secondary process, any test would 
> necessarily have to be manual here, i think.
> 
> >> Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
> >>
> >> Reported-by: Jim Harris <james.r.harris@intel.com>
> >>
> > 
> > No need of blank line here.
> 
> Need to update my scripts :P
> 
> >> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
> >> ---
[...]
> >> +	/*
> >> +	 * for allocations, we must only use internal heaps, but since the
> >> +	 * rte_malloc_heap_socket_is_external() is thread-safe and we're already
> >> +	 * read-locked, we'll have to take advantage of the fac that internal
> > 
> > fac -> fact?
> 
> Yes.
> 
> >> +	 * socket ID's are always lower than RTE_MAX_NUMA_NODES.
> >> +	 */

Applied with minor changes, thanks.



^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2021-01-29 23:37 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-01-29 15:29 [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation Anatoly Burakov
2021-01-29 15:40 ` Thomas Monjalon
2021-01-29 16:07   ` Burakov, Anatoly
2021-01-29 23:37     ` Thomas Monjalon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).