* [PATCH] Fix unused memseg length for memory issue
@ 2024-12-26 8:10 Yang Ming
2025-01-01 12:34 ` Dmitry Kozlyuk
2025-01-02 8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
0 siblings, 2 replies; 6+ messages in thread
From: Yang Ming @ 2024-12-26 8:10 UTC (permalink / raw)
To: Anatoly Burakov; +Cc: dev, Yang Ming, stable
Fix the issue where OS memory is mistakenly freed with rte_free
by setting the length (len) of unused memseg to 0.
When eal_legacy_hugepage_init releases the VA space for unused
memseg lists, it does not reset their length to 0. As a result,
mlx5_mem_is_rte may incorrectly identify OS memory as DPDK
memory. This can lead to mlx_free calling rte_free on OS memory,
causing an "EAL: Error: Invalid memory" log and failing to free
the OS memory.
This issue is occasional and occurs when the DPDK program’s
memory map places the heap address range between 0 and len(32G).
In such cases, malloc may return an address less than len,
causing mlx5_mem_is_rte to incorrectly treat it as DPDK memory.
Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
Cc: anatoly.burakov@intel.com
Cc: stable@dpdk.org
Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
---
lib/eal/linux/eal_memory.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
index 45879ca743..9dda60c0e1 100644
--- a/lib/eal/linux/eal_memory.c
+++ b/lib/eal/linux/eal_memory.c
@@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
mem_sz = msl->len;
munmap(msl->base_va, mem_sz);
msl->base_va = NULL;
+ msl->len = 0;
msl->heap = 0;
/* destroy backing fbarray */
--
2.34.1
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] Fix unused memseg length for memory issue
2024-12-26 8:10 [PATCH] Fix unused memseg length for memory issue Yang Ming
@ 2025-01-01 12:34 ` Dmitry Kozlyuk
2025-01-02 6:59 ` Yang Ming
2025-01-02 8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
1 sibling, 1 reply; 6+ messages in thread
From: Dmitry Kozlyuk @ 2025-01-01 12:34 UTC (permalink / raw)
To: Yang Ming; +Cc: Anatoly Burakov, dev, stable
2024-12-26 16:10 (UTC+0800), Yang Ming:
> Fix the issue where OS memory is mistakenly freed with rte_free
> by setting the length (len) of unused memseg to 0.
>
> When eal_legacy_hugepage_init releases the VA space for unused
> memseg lists, it does not reset their length to 0. As a result,
> mlx5_mem_is_rte may incorrectly identify OS memory as DPDK
> memory. This can lead to mlx_free calling rte_free on OS memory,
> causing an "EAL: Error: Invalid memory" log and failing to free
> the OS memory.
>
> This issue is occasional and occurs when the DPDK program’s
> memory map places the heap address range between 0 and len(32G).
> In such cases, malloc may return an address less than len,
> causing mlx5_mem_is_rte to incorrectly treat it as DPDK memory.
>
> Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
> Cc: anatoly.burakov@intel.com
> Cc: stable@dpdk.org
>
> Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
> ---
> lib/eal/linux/eal_memory.c | 1 +
> 1 file changed, 1 insertion(+)
>
> diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
> index 45879ca743..9dda60c0e1 100644
> --- a/lib/eal/linux/eal_memory.c
> +++ b/lib/eal/linux/eal_memory.c
> @@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
> mem_sz = msl->len;
> munmap(msl->base_va, mem_sz);
> msl->base_va = NULL;
> + msl->len = 0;
> msl->heap = 0;
>
> /* destroy backing fbarray */
Hi Yang,
It seems the bug affects more than just mlx5 PMD.
Consider how the MSL with `base_va == NULL` ends up in `mlx5_mem_is_rte()`.
It comes from `rte_mem_virt2memseg_list()` which iterates MSLs
and checks that an address belongs to [`base_va`; `base_va+len`)
without checking whether `base_va == NULL` i.e. that the MSL is inactive.
Your patch also corrects `rte_mem_virt2memseg_list()` behavior.
Please mention this in the commit message.
Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] Fix unused memseg length for memory issue
2025-01-01 12:34 ` Dmitry Kozlyuk
@ 2025-01-02 6:59 ` Yang Ming
0 siblings, 0 replies; 6+ messages in thread
From: Yang Ming @ 2025-01-02 6:59 UTC (permalink / raw)
To: Dmitry Kozlyuk; +Cc: Anatoly Burakov, dev, stable
[-- Attachment #1: Type: text/plain, Size: 2132 bytes --]
On 2025/1/1 20:34, Dmitry Kozlyuk wrote:
> 2024-12-26 16:10 (UTC+0800), Yang Ming:
>> Fix the issue where OS memory is mistakenly freed with rte_free
>> by setting the length (len) of unused memseg to 0.
>>
>> When eal_legacy_hugepage_init releases the VA space for unused
>> memseg lists, it does not reset their length to 0. As a result,
>> mlx5_mem_is_rte may incorrectly identify OS memory as DPDK
>> memory. This can lead to mlx_free calling rte_free on OS memory,
>> causing an "EAL: Error: Invalid memory" log and failing to free
>> the OS memory.
>>
>> This issue is occasional and occurs when the DPDK program’s
>> memory map places the heap address range between 0 and len(32G).
>> In such cases, malloc may return an address less than len,
>> causing mlx5_mem_is_rte to incorrectly treat it as DPDK memory.
>>
>> Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
>> Cc:anatoly.burakov@intel.com
>> Cc:stable@dpdk.org
>>
>> Signed-off-by: Yang Ming<ming.1.yang@nokia-sbell.com>
>> ---
>> lib/eal/linux/eal_memory.c | 1 +
>> 1 file changed, 1 insertion(+)
>>
>> diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
>> index 45879ca743..9dda60c0e1 100644
>> --- a/lib/eal/linux/eal_memory.c
>> +++ b/lib/eal/linux/eal_memory.c
>> @@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
>> mem_sz = msl->len;
>> munmap(msl->base_va, mem_sz);
>> msl->base_va = NULL;
>> + msl->len = 0;
>> msl->heap = 0;
>>
>> /* destroy backing fbarray */
> Hi Yang,
>
> It seems the bug affects more than just mlx5 PMD.
>
> Consider how the MSL with `base_va == NULL` ends up in `mlx5_mem_is_rte()`.
> It comes from `rte_mem_virt2memseg_list()` which iterates MSLs
> and checks that an address belongs to [`base_va`; `base_va+len`)
> without checking whether `base_va == NULL` i.e. that the MSL is inactive.
> Your patch also corrects `rte_mem_virt2memseg_list()` behavior.
> Please mention this in the commit message.
>
> Acked-by: Dmitry Kozlyuk<dmitry.kozliuk@gmail.com>
Hi Dmitry,
Thanks. I will update this patch within new version (v2) to add these
content to the commit log.
[-- Attachment #2: Type: text/html, Size: 2913 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread
* [PATCH v2] eal: fix unused memseg length
2024-12-26 8:10 [PATCH] Fix unused memseg length for memory issue Yang Ming
2025-01-01 12:34 ` Dmitry Kozlyuk
@ 2025-01-02 8:58 ` Yang Ming
2025-01-22 3:14 ` Yang Ming
1 sibling, 1 reply; 6+ messages in thread
From: Yang Ming @ 2025-01-02 8:58 UTC (permalink / raw)
To: dev; +Cc: Yang Ming, anatoly.burakov, stable, Dmitry Kozlyuk
Fix the issue where OS memory is mistakenly freed with rte_free
by setting the length (len) of unused memseg to 0.
When `eal_legacy_hugepage_init()` releases the VA space for
unused memseg lists(MSLs), it does not reset MSLs' length to 0.
As a result, `mlx5_mem_is_rte()` may incorrectly identify OS
memory as rte memory.
This can lead to `mlx_free()` calling `rte_free()` on OS memory,
causing an "EAL: Error: Invalid memory" log and failing to free
the OS memory.
This issue is occasional and occurs when the DPDK program’s
memory map places the heap address range between 0 and len(32G).
In such cases, malloc may return an address less than len,
causing `mlx5_mem_is_rte()` to incorrectly treat it as rte
memory.
Also, consider how the MSL with `base_va == NULL` ends up in
`mlx5_mem_is_rte()`. It comes from `rte_mem_virt2memseg_list()`
which iterates MSLs and checks that an address belongs to
[`base_va`; `base_va+len`) without checking whether
`base_va == NULL` i.e. that the MSL is inactive. So this patch
also corrects `rte_mem_virt2memseg_list()` behavior.
Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
Cc: anatoly.burakov@intel.com
Cc: stable@dpdk.org
Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
---
lib/eal/linux/eal_memory.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
index 45879ca743..9dda60c0e1 100644
--- a/lib/eal/linux/eal_memory.c
+++ b/lib/eal/linux/eal_memory.c
@@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
mem_sz = msl->len;
munmap(msl->base_va, mem_sz);
msl->base_va = NULL;
+ msl->len = 0;
msl->heap = 0;
/* destroy backing fbarray */
--
2.34.1
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2] eal: fix unused memseg length
2025-01-02 8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
@ 2025-01-22 3:14 ` Yang Ming
0 siblings, 0 replies; 6+ messages in thread
From: Yang Ming @ 2025-01-22 3:14 UTC (permalink / raw)
To: dev, Thomas Monjalon; +Cc: anatoly.burakov, stable, Dmitry Kozlyuk
Hi experts, is there any chance to review and accept this patch?
On 2025/1/2 16:58, Yang Ming wrote:
> Fix the issue where OS memory is mistakenly freed with rte_free
> by setting the length (len) of unused memseg to 0.
>
> When `eal_legacy_hugepage_init()` releases the VA space for
> unused memseg lists(MSLs), it does not reset MSLs' length to 0.
> As a result, `mlx5_mem_is_rte()` may incorrectly identify OS
> memory as rte memory.
> This can lead to `mlx_free()` calling `rte_free()` on OS memory,
> causing an "EAL: Error: Invalid memory" log and failing to free
> the OS memory.
>
> This issue is occasional and occurs when the DPDK program’s
> memory map places the heap address range between 0 and len(32G).
> In such cases, malloc may return an address less than len,
> causing `mlx5_mem_is_rte()` to incorrectly treat it as rte
> memory.
>
> Also, consider how the MSL with `base_va == NULL` ends up in
> `mlx5_mem_is_rte()`. It comes from `rte_mem_virt2memseg_list()`
> which iterates MSLs and checks that an address belongs to
> [`base_va`; `base_va+len`) without checking whether
> `base_va == NULL` i.e. that the MSL is inactive. So this patch
> also corrects `rte_mem_virt2memseg_list()` behavior.
>
> Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
> Cc: anatoly.burakov@intel.com
> Cc: stable@dpdk.org
>
> Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
> ---
> lib/eal/linux/eal_memory.c | 1 +
> 1 file changed, 1 insertion(+)
>
> diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
> index 45879ca743..9dda60c0e1 100644
> --- a/lib/eal/linux/eal_memory.c
> +++ b/lib/eal/linux/eal_memory.c
> @@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
> mem_sz = msl->len;
> munmap(msl->base_va, mem_sz);
> msl->base_va = NULL;
> + msl->len = 0;
> msl->heap = 0;
>
> /* destroy backing fbarray */
^ permalink raw reply [flat|nested] 6+ messages in thread
* [PATCH v2] eal: fix unused memseg length
@ 2025-01-02 7:26 Yang Ming
0 siblings, 0 replies; 6+ messages in thread
From: Yang Ming @ 2025-01-02 7:26 UTC (permalink / raw)
To: Anatoly Burakov; +Cc: dev, Yang Ming, stable, Dmitry Kozlyuk
Fix the issue where OS memory is mistakenly freed with rte_free
by setting the length (len) of unused memseg to 0.
When `eal_legacy_hugepage_init()` releases the VA space for
unused memseg lists(MSLs), it does not reset MSLs' length to 0.
As a result, `mlx5_mem_is_rte()` may incorrectly identify OS
memory as rte memory.
This can lead to `mlx_free()` calling `rte_free()` on OS memory,
causing an "EAL: Error: Invalid memory" log and failing to free
the OS memory.
This issue is occasional and occurs when the DPDK program’s
memory map places the heap address range between 0 and len(32G).
In such cases, malloc may return an address less than len,
causing `mlx5_mem_is_rte()` to incorrectly treat it as rte
memory.
Also, consider how the MSL with `base_va == NULL` ends up in
`mlx5_mem_is_rte()`. It comes from `rte_mem_virt2memseg_list()`
which iterates MSLs and checks that an address belongs to
[`base_va`; `base_va+len`) without checking whether
`base_va == NULL` i.e. that the MSL is inactive. So this patch
also corrects `rte_mem_virt2memseg_list()` behavior.
Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
Cc: anatoly.burakov@intel.com
Cc: stable@dpdk.org
Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
---
lib/eal/linux/eal_memory.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
index 45879ca743..9dda60c0e1 100644
--- a/lib/eal/linux/eal_memory.c
+++ b/lib/eal/linux/eal_memory.c
@@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
mem_sz = msl->len;
munmap(msl->base_va, mem_sz);
msl->base_va = NULL;
+ msl->len = 0;
msl->heap = 0;
/* destroy backing fbarray */
--
2.34.1
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2025-01-22 3:14 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-12-26 8:10 [PATCH] Fix unused memseg length for memory issue Yang Ming
2025-01-01 12:34 ` Dmitry Kozlyuk
2025-01-02 6:59 ` Yang Ming
2025-01-02 8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
2025-01-22 3:14 ` Yang Ming
2025-01-02 7:26 Yang Ming
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).