patches for DPDK stable branches
 help / color / mirror / Atom feed
* [PATCH] Fix unused memseg length for memory issue
@ 2024-12-26  8:10 Yang Ming
  2025-01-01 12:34 ` Dmitry Kozlyuk
  2025-01-02  8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
  0 siblings, 2 replies; 6+ messages in thread
From: Yang Ming @ 2024-12-26  8:10 UTC (permalink / raw)
  To: Anatoly Burakov; +Cc: dev, Yang Ming, stable

Fix the issue where OS memory is mistakenly freed with rte_free
by setting the length (len) of unused memseg to 0.

When eal_legacy_hugepage_init releases the VA space for unused
memseg lists, it does not reset their length to 0. As a result,
mlx5_mem_is_rte may incorrectly identify OS memory as DPDK
memory. This can lead to mlx_free calling rte_free on OS memory,
causing an "EAL: Error: Invalid memory" log and failing to free
the OS memory.

This issue is occasional and occurs when the DPDK program’s
memory map places the heap address range between 0 and len(32G).
In such cases, malloc may return an address less than len,
causing mlx5_mem_is_rte to incorrectly treat it as DPDK memory.

Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
Cc: anatoly.burakov@intel.com
Cc: stable@dpdk.org

Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
---
 lib/eal/linux/eal_memory.c | 1 +
 1 file changed, 1 insertion(+)

diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
index 45879ca743..9dda60c0e1 100644
--- a/lib/eal/linux/eal_memory.c
+++ b/lib/eal/linux/eal_memory.c
@@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
 		mem_sz = msl->len;
 		munmap(msl->base_va, mem_sz);
 		msl->base_va = NULL;
+		msl->len = 0;
 		msl->heap = 0;
 
 		/* destroy backing fbarray */
-- 
2.34.1


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] Fix unused memseg length for memory issue
  2024-12-26  8:10 [PATCH] Fix unused memseg length for memory issue Yang Ming
@ 2025-01-01 12:34 ` Dmitry Kozlyuk
  2025-01-02  6:59   ` Yang Ming
  2025-01-02  8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
  1 sibling, 1 reply; 6+ messages in thread
From: Dmitry Kozlyuk @ 2025-01-01 12:34 UTC (permalink / raw)
  To: Yang Ming; +Cc: Anatoly Burakov, dev, stable

2024-12-26 16:10 (UTC+0800), Yang Ming:
> Fix the issue where OS memory is mistakenly freed with rte_free
> by setting the length (len) of unused memseg to 0.
> 
> When eal_legacy_hugepage_init releases the VA space for unused
> memseg lists, it does not reset their length to 0. As a result,
> mlx5_mem_is_rte may incorrectly identify OS memory as DPDK
> memory. This can lead to mlx_free calling rte_free on OS memory,
> causing an "EAL: Error: Invalid memory" log and failing to free
> the OS memory.
> 
> This issue is occasional and occurs when the DPDK program’s
> memory map places the heap address range between 0 and len(32G).
> In such cases, malloc may return an address less than len,
> causing mlx5_mem_is_rte to incorrectly treat it as DPDK memory.
> 
> Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
> Cc: anatoly.burakov@intel.com
> Cc: stable@dpdk.org
> 
> Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
> ---
>  lib/eal/linux/eal_memory.c | 1 +
>  1 file changed, 1 insertion(+)
> 
> diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
> index 45879ca743..9dda60c0e1 100644
> --- a/lib/eal/linux/eal_memory.c
> +++ b/lib/eal/linux/eal_memory.c
> @@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
>  		mem_sz = msl->len;
>  		munmap(msl->base_va, mem_sz);
>  		msl->base_va = NULL;
> +		msl->len = 0;
>  		msl->heap = 0;
>  
>  		/* destroy backing fbarray */

Hi Yang,

It seems the bug affects more than just mlx5 PMD.

Consider how the MSL with `base_va == NULL` ends up in `mlx5_mem_is_rte()`.
It comes from `rte_mem_virt2memseg_list()` which iterates MSLs
and checks that an address belongs to [`base_va`; `base_va+len`)
without checking whether `base_va == NULL` i.e. that the MSL is inactive.
Your patch also corrects `rte_mem_virt2memseg_list()` behavior.
Please mention this in the commit message.

Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] Fix unused memseg length for memory issue
  2025-01-01 12:34 ` Dmitry Kozlyuk
@ 2025-01-02  6:59   ` Yang Ming
  0 siblings, 0 replies; 6+ messages in thread
From: Yang Ming @ 2025-01-02  6:59 UTC (permalink / raw)
  To: Dmitry Kozlyuk; +Cc: Anatoly Burakov, dev, stable

[-- Attachment #1: Type: text/plain, Size: 2132 bytes --]

On 2025/1/1 20:34, Dmitry Kozlyuk wrote:
> 2024-12-26 16:10 (UTC+0800), Yang Ming:
>> Fix the issue where OS memory is mistakenly freed with rte_free
>> by setting the length (len) of unused memseg to 0.
>>
>> When eal_legacy_hugepage_init releases the VA space for unused
>> memseg lists, it does not reset their length to 0. As a result,
>> mlx5_mem_is_rte may incorrectly identify OS memory as DPDK
>> memory. This can lead to mlx_free calling rte_free on OS memory,
>> causing an "EAL: Error: Invalid memory" log and failing to free
>> the OS memory.
>>
>> This issue is occasional and occurs when the DPDK program’s
>> memory map places the heap address range between 0 and len(32G).
>> In such cases, malloc may return an address less than len,
>> causing mlx5_mem_is_rte to incorrectly treat it as DPDK memory.
>>
>> Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
>> Cc:anatoly.burakov@intel.com
>> Cc:stable@dpdk.org
>>
>> Signed-off-by: Yang Ming<ming.1.yang@nokia-sbell.com>
>> ---
>>   lib/eal/linux/eal_memory.c | 1 +
>>   1 file changed, 1 insertion(+)
>>
>> diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
>> index 45879ca743..9dda60c0e1 100644
>> --- a/lib/eal/linux/eal_memory.c
>> +++ b/lib/eal/linux/eal_memory.c
>> @@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
>>   		mem_sz = msl->len;
>>   		munmap(msl->base_va, mem_sz);
>>   		msl->base_va = NULL;
>> +		msl->len = 0;
>>   		msl->heap = 0;
>>   
>>   		/* destroy backing fbarray */
> Hi Yang,
>
> It seems the bug affects more than just mlx5 PMD.
>
> Consider how the MSL with `base_va == NULL` ends up in `mlx5_mem_is_rte()`.
> It comes from `rte_mem_virt2memseg_list()` which iterates MSLs
> and checks that an address belongs to [`base_va`; `base_va+len`)
> without checking whether `base_va == NULL` i.e. that the MSL is inactive.
> Your patch also corrects `rte_mem_virt2memseg_list()` behavior.
> Please mention this in the commit message.
>
> Acked-by: Dmitry Kozlyuk<dmitry.kozliuk@gmail.com>

Hi Dmitry,

Thanks. I will update this patch within new version (v2) to add these 
content to the commit log.


[-- Attachment #2: Type: text/html, Size: 2913 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [PATCH v2] eal: fix unused memseg length
  2024-12-26  8:10 [PATCH] Fix unused memseg length for memory issue Yang Ming
  2025-01-01 12:34 ` Dmitry Kozlyuk
@ 2025-01-02  8:58 ` Yang Ming
  2025-01-22  3:14   ` Yang Ming
  1 sibling, 1 reply; 6+ messages in thread
From: Yang Ming @ 2025-01-02  8:58 UTC (permalink / raw)
  To: dev; +Cc: Yang Ming, anatoly.burakov, stable, Dmitry Kozlyuk

Fix the issue where OS memory is mistakenly freed with rte_free
by setting the length (len) of unused memseg to 0.

When `eal_legacy_hugepage_init()` releases the VA space for
unused memseg lists(MSLs), it does not reset MSLs' length to 0.
As a result, `mlx5_mem_is_rte()` may incorrectly identify OS
memory as rte memory.
This can lead to `mlx_free()` calling `rte_free()` on OS memory,
causing an "EAL: Error: Invalid memory" log and failing to free
the OS memory.

This issue is occasional and occurs when the DPDK program’s
memory map places the heap address range between 0 and len(32G).
In such cases, malloc may return an address less than len,
causing `mlx5_mem_is_rte()` to incorrectly treat it as rte
memory.

Also, consider how the MSL with `base_va == NULL` ends up in
`mlx5_mem_is_rte()`. It comes from `rte_mem_virt2memseg_list()`
which iterates MSLs and checks that an address belongs to
[`base_va`; `base_va+len`) without checking whether
`base_va == NULL` i.e. that the MSL is inactive. So this patch
also corrects `rte_mem_virt2memseg_list()` behavior.

Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
Cc: anatoly.burakov@intel.com
Cc: stable@dpdk.org

Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
---
 lib/eal/linux/eal_memory.c | 1 +
 1 file changed, 1 insertion(+)

diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
index 45879ca743..9dda60c0e1 100644
--- a/lib/eal/linux/eal_memory.c
+++ b/lib/eal/linux/eal_memory.c
@@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
 		mem_sz = msl->len;
 		munmap(msl->base_va, mem_sz);
 		msl->base_va = NULL;
+		msl->len = 0;
 		msl->heap = 0;
 
 		/* destroy backing fbarray */
-- 
2.34.1


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] eal: fix unused memseg length
  2025-01-02  8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
@ 2025-01-22  3:14   ` Yang Ming
  0 siblings, 0 replies; 6+ messages in thread
From: Yang Ming @ 2025-01-22  3:14 UTC (permalink / raw)
  To: dev, Thomas Monjalon; +Cc: anatoly.burakov, stable, Dmitry Kozlyuk

Hi experts, is there any chance to review and accept this patch?

On 2025/1/2 16:58, Yang Ming wrote:
> Fix the issue where OS memory is mistakenly freed with rte_free
> by setting the length (len) of unused memseg to 0.
>
> When `eal_legacy_hugepage_init()` releases the VA space for
> unused memseg lists(MSLs), it does not reset MSLs' length to 0.
> As a result, `mlx5_mem_is_rte()` may incorrectly identify OS
> memory as rte memory.
> This can lead to `mlx_free()` calling `rte_free()` on OS memory,
> causing an "EAL: Error: Invalid memory" log and failing to free
> the OS memory.
>
> This issue is occasional and occurs when the DPDK program’s
> memory map places the heap address range between 0 and len(32G).
> In such cases, malloc may return an address less than len,
> causing `mlx5_mem_is_rte()` to incorrectly treat it as rte
> memory.
>
> Also, consider how the MSL with `base_va == NULL` ends up in
> `mlx5_mem_is_rte()`. It comes from `rte_mem_virt2memseg_list()`
> which iterates MSLs and checks that an address belongs to
> [`base_va`; `base_va+len`) without checking whether
> `base_va == NULL` i.e. that the MSL is inactive. So this patch
> also corrects `rte_mem_virt2memseg_list()` behavior.
>
> Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
> Cc: anatoly.burakov@intel.com
> Cc: stable@dpdk.org
>
> Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
> ---
>   lib/eal/linux/eal_memory.c | 1 +
>   1 file changed, 1 insertion(+)
>
> diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
> index 45879ca743..9dda60c0e1 100644
> --- a/lib/eal/linux/eal_memory.c
> +++ b/lib/eal/linux/eal_memory.c
> @@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
>   		mem_sz = msl->len;
>   		munmap(msl->base_va, mem_sz);
>   		msl->base_va = NULL;
> +		msl->len = 0;
>   		msl->heap = 0;
>   
>   		/* destroy backing fbarray */

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [PATCH v2] eal: fix unused memseg length
@ 2025-01-02  7:26 Yang Ming
  0 siblings, 0 replies; 6+ messages in thread
From: Yang Ming @ 2025-01-02  7:26 UTC (permalink / raw)
  To: Anatoly Burakov; +Cc: dev, Yang Ming, stable, Dmitry Kozlyuk

Fix the issue where OS memory is mistakenly freed with rte_free
by setting the length (len) of unused memseg to 0.

When `eal_legacy_hugepage_init()` releases the VA space for
unused memseg lists(MSLs), it does not reset MSLs' length to 0.
As a result, `mlx5_mem_is_rte()` may incorrectly identify OS
memory as rte memory.
This can lead to `mlx_free()` calling `rte_free()` on OS memory,
causing an "EAL: Error: Invalid memory" log and failing to free
the OS memory.

This issue is occasional and occurs when the DPDK program’s
memory map places the heap address range between 0 and len(32G).
In such cases, malloc may return an address less than len,
causing `mlx5_mem_is_rte()` to incorrectly treat it as rte
memory.

Also, consider how the MSL with `base_va == NULL` ends up in
`mlx5_mem_is_rte()`. It comes from `rte_mem_virt2memseg_list()`
which iterates MSLs and checks that an address belongs to
[`base_va`; `base_va+len`) without checking whether
`base_va == NULL` i.e. that the MSL is inactive. So this patch
also corrects `rte_mem_virt2memseg_list()` behavior.

Fixes: 66cc45e293ed ("mem: replace memseg with memseg lists")
Cc: anatoly.burakov@intel.com
Cc: stable@dpdk.org

Signed-off-by: Yang Ming <ming.1.yang@nokia-sbell.com>
Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>
---
 lib/eal/linux/eal_memory.c | 1 +
 1 file changed, 1 insertion(+)

diff --git a/lib/eal/linux/eal_memory.c b/lib/eal/linux/eal_memory.c
index 45879ca743..9dda60c0e1 100644
--- a/lib/eal/linux/eal_memory.c
+++ b/lib/eal/linux/eal_memory.c
@@ -1472,6 +1472,7 @@ eal_legacy_hugepage_init(void)
 		mem_sz = msl->len;
 		munmap(msl->base_va, mem_sz);
 		msl->base_va = NULL;
+		msl->len = 0;
 		msl->heap = 0;
 
 		/* destroy backing fbarray */
-- 
2.34.1


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2025-01-22  3:14 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-12-26  8:10 [PATCH] Fix unused memseg length for memory issue Yang Ming
2025-01-01 12:34 ` Dmitry Kozlyuk
2025-01-02  6:59   ` Yang Ming
2025-01-02  8:58 ` [PATCH v2] eal: fix unused memseg length Yang Ming
2025-01-22  3:14   ` Yang Ming
2025-01-02  7:26 Yang Ming

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).