* [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
@ 2021-01-29 15:29 Anatoly Burakov
2021-01-29 15:40 ` Thomas Monjalon
0 siblings, 1 reply; 4+ messages in thread
From: Anatoly Burakov @ 2021-01-29 15:29 UTC (permalink / raw)
To: dev; +Cc: thomas, james.r.harris
Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
heap was an external heap. However, that API is thread-safe, and when
we're inside the allocation process, we're already write-locked, so
calling `rte_malloc_heap_socket_is_external()` will result in a
deadlock followed by a timeout.
Fix it by replacing the API call with a check against maximum number of
NUMA nodes, because external heaps always have higher socket ID's.
Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
Reported-by: Jim Harris <james.r.harris@intel.com>
Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
---
lib/librte_eal/common/malloc_mp.c | 9 +++++++--
1 file changed, 7 insertions(+), 2 deletions(-)
diff --git a/lib/librte_eal/common/malloc_mp.c b/lib/librte_eal/common/malloc_mp.c
index 0b19d4d5fb..b1f7f7824b 100644
--- a/lib/librte_eal/common/malloc_mp.c
+++ b/lib/librte_eal/common/malloc_mp.c
@@ -241,8 +241,13 @@ handle_alloc_request(const struct malloc_mp_req *m,
heap = &mcfg->malloc_heaps[ar->malloc_heap_idx];
- /* for allocations, we must only use internal heaps */
- if (rte_malloc_heap_socket_is_external(heap->socket_id)) {
+ /*
+ * for allocations, we must only use internal heaps, but since the
+ * rte_malloc_heap_socket_is_external() is thread-safe and we're already
+ * read-locked, we'll have to take advantage of the fac that internal
+ * socket ID's are always lower than RTE_MAX_NUMA_NODES.
+ */
+ if (heap->socket_id >= RTE_MAX_NUMA_NODES) {
RTE_LOG(ERR, EAL, "Attempting to allocate from external heap\n");
return -1;
}
--
2.25.1
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
2021-01-29 15:29 [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation Anatoly Burakov
@ 2021-01-29 15:40 ` Thomas Monjalon
2021-01-29 16:07 ` Burakov, Anatoly
0 siblings, 1 reply; 4+ messages in thread
From: Thomas Monjalon @ 2021-01-29 15:40 UTC (permalink / raw)
To: Anatoly Burakov; +Cc: dev, james.r.harris
29/01/2021 16:29, Anatoly Burakov:
> Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
> heap was an external heap. However, that API is thread-safe, and when
> we're inside the allocation process, we're already write-locked, so
> calling `rte_malloc_heap_socket_is_external()` will result in a
> deadlock followed by a timeout.
>
> Fix it by replacing the API call with a check against maximum number of
> NUMA nodes, because external heaps always have higher socket ID's.
Is there some unit tests for such thing?
>
> Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
>
> Reported-by: Jim Harris <james.r.harris@intel.com>
>
No need of blank line here.
> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
> ---
> lib/librte_eal/common/malloc_mp.c | 9 +++++++--
> 1 file changed, 7 insertions(+), 2 deletions(-)
>
> diff --git a/lib/librte_eal/common/malloc_mp.c b/lib/librte_eal/common/malloc_mp.c
> index 0b19d4d5fb..b1f7f7824b 100644
> --- a/lib/librte_eal/common/malloc_mp.c
> +++ b/lib/librte_eal/common/malloc_mp.c
> - /* for allocations, we must only use internal heaps */
> - if (rte_malloc_heap_socket_is_external(heap->socket_id)) {
> + /*
> + * for allocations, we must only use internal heaps, but since the
> + * rte_malloc_heap_socket_is_external() is thread-safe and we're already
> + * read-locked, we'll have to take advantage of the fac that internal
fac -> fact?
> + * socket ID's are always lower than RTE_MAX_NUMA_NODES.
> + */
> + if (heap->socket_id >= RTE_MAX_NUMA_NODES) {
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
2021-01-29 15:40 ` Thomas Monjalon
@ 2021-01-29 16:07 ` Burakov, Anatoly
2021-01-29 23:37 ` Thomas Monjalon
0 siblings, 1 reply; 4+ messages in thread
From: Burakov, Anatoly @ 2021-01-29 16:07 UTC (permalink / raw)
To: Thomas Monjalon; +Cc: dev, james.r.harris
On 29-Jan-21 3:40 PM, Thomas Monjalon wrote:
> 29/01/2021 16:29, Anatoly Burakov:
>> Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
>> heap was an external heap. However, that API is thread-safe, and when
>> we're inside the allocation process, we're already write-locked, so
>> calling `rte_malloc_heap_socket_is_external()` will result in a
>> deadlock followed by a timeout.
>>
>> Fix it by replacing the API call with a check against maximum number of
>> NUMA nodes, because external heaps always have higher socket ID's.
>
> Is there some unit tests for such thing?
I couldn't reproduce this using autotests, but Jim has SPDK tests which
triggered this error.
Since this is dependent upon secondary process, any test would
necessarily have to be manual here, i think.
>
>>
>> Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
>>
>> Reported-by: Jim Harris <james.r.harris@intel.com>
>>
>
> No need of blank line here.
Need to update my scripts :P
>
>> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
>> ---
>> lib/librte_eal/common/malloc_mp.c | 9 +++++++--
>> 1 file changed, 7 insertions(+), 2 deletions(-)
>>
>> diff --git a/lib/librte_eal/common/malloc_mp.c b/lib/librte_eal/common/malloc_mp.c
>> index 0b19d4d5fb..b1f7f7824b 100644
>> --- a/lib/librte_eal/common/malloc_mp.c
>> +++ b/lib/librte_eal/common/malloc_mp.c
>> - /* for allocations, we must only use internal heaps */
>> - if (rte_malloc_heap_socket_is_external(heap->socket_id)) {
>> + /*
>> + * for allocations, we must only use internal heaps, but since the
>> + * rte_malloc_heap_socket_is_external() is thread-safe and we're already
>> + * read-locked, we'll have to take advantage of the fac that internal
>
> fac -> fact?
>
Yes.
>> + * socket ID's are always lower than RTE_MAX_NUMA_NODES.
>> + */
>> + if (heap->socket_id >= RTE_MAX_NUMA_NODES) {
>
>
>
>
--
Thanks,
Anatoly
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation
2021-01-29 16:07 ` Burakov, Anatoly
@ 2021-01-29 23:37 ` Thomas Monjalon
0 siblings, 0 replies; 4+ messages in thread
From: Thomas Monjalon @ 2021-01-29 23:37 UTC (permalink / raw)
To: Burakov, Anatoly; +Cc: dev, james.r.harris
29/01/2021 17:07, Burakov, Anatoly:
> On 29-Jan-21 3:40 PM, Thomas Monjalon wrote:
> > 29/01/2021 16:29, Anatoly Burakov:
> >> Previous fix used `rte_malloc_heap_socket_is_external()` to check if the
> >> heap was an external heap. However, that API is thread-safe, and when
> >> we're inside the allocation process, we're already write-locked, so
> >> calling `rte_malloc_heap_socket_is_external()` will result in a
> >> deadlock followed by a timeout.
> >>
> >> Fix it by replacing the API call with a check against maximum number of
> >> NUMA nodes, because external heaps always have higher socket ID's.
> >
> > Is there some unit tests for such thing?
>
> I couldn't reproduce this using autotests, but Jim has SPDK tests which
> triggered this error.
>
> Since this is dependent upon secondary process, any test would
> necessarily have to be manual here, i think.
>
> >> Fixes: 7ac31e82bc8f ("mem: improve parameter checking on memory hotplug")
> >>
> >> Reported-by: Jim Harris <james.r.harris@intel.com>
> >>
> >
> > No need of blank line here.
>
> Need to update my scripts :P
>
> >> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>
> >> ---
[...]
> >> + /*
> >> + * for allocations, we must only use internal heaps, but since the
> >> + * rte_malloc_heap_socket_is_external() is thread-safe and we're already
> >> + * read-locked, we'll have to take advantage of the fac that internal
> >
> > fac -> fact?
>
> Yes.
>
> >> + * socket ID's are always lower than RTE_MAX_NUMA_NODES.
> >> + */
Applied with minor changes, thanks.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2021-01-29 23:37 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-01-29 15:29 [dpdk-dev] [PATCH] mem: fix deadlock on secondary allocation Anatoly Burakov
2021-01-29 15:40 ` Thomas Monjalon
2021-01-29 16:07 ` Burakov, Anatoly
2021-01-29 23:37 ` Thomas Monjalon
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).