DPDK patches and discussions
 help / color / mirror / Atom feed
* [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped
@ 2020-04-24 10:50 Li Feng
  2020-04-24 11:23 ` Burakov, Anatoly
                   ` (3 more replies)
  0 siblings, 4 replies; 11+ messages in thread
From: Li Feng @ 2020-04-24 10:50 UTC (permalink / raw)
  To: Anatoly Burakov
  Cc: dev, stable, kyle, lifeng1519, fanyang, david.marchand,
	bruce.richardson, Li Feng

Currently, even though memory is mapped with PROT_NONE, this does not
cause it to be excluded from core dumps. This is counter-productive,
because in a lot of cases, this memory will go unused (e.g. when the
memory subsystem preallocates VA space but hasn't yet mapped physical
pages into it).

Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
unmapped memory from being dumped.

Signed-off-by: Li Feng <fengli@smartx.com>
---
V2:
- add support for freebsd.
- when free_seg is called, mark the memory MADV_DONTDUMP.
- when alloc_seg is called, mark the memory MADV_DODUMP.

 lib/librte_eal/common/eal_common_memory.c | 26 ++++++++++++++++++++++++++
 lib/librte_eal/linux/eal_memalloc.c       |  6 ++++++
 2 files changed, 32 insertions(+)

diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
index cc7d54e0c..83be94a20 100644
--- a/lib/librte_eal/common/eal_common_memory.c
+++ b/lib/librte_eal/common/eal_common_memory.c
@@ -177,6 +177,32 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
 		after_len = RTE_PTR_DIFF(map_end, aligned_end);
 		if (after_len > 0)
 			munmap(aligned_end, after_len);
+
+		/*
+		 * Exclude this pages from a core dump.
+		 */
+#ifdef RTE_EXEC_ENV_LINUX
+		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+#elif RTE_EXEC_ENV_FREEBSD
+		if (madvise(aligned_addr, *size, MADV_NOCORE) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+#endif
+	} else {
+		/*
+		 * Exclude this pages from a core dump.
+		 */
+#ifdef RTE_EXEC_ENV_LINUX
+		if (madvise(mapped_addr, map_sz, MADV_DONTDUMP) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+#elif RTE_EXEC_ENV_FREEBSD
+		if (madvise(mapped_addr, map_sz, MADV_NOCORE) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+#endif
 	}
 
 	return aligned_addr;
diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
index af6d0d023..9d2a6fc6f 100644
--- a/lib/librte_eal/linux/eal_memalloc.c
+++ b/lib/librte_eal/linux/eal_memalloc.c
@@ -571,6 +571,9 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id,
 		goto resized;
 	}
 
+	if (madvise(addr, alloc_sz, MADV_DODUMP) != 0)
+		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
+
 	/* In linux, hugetlb limitations, like cgroup, are
 	 * enforced at fault time instead of mmap(), even
 	 * with the option of MAP_POPULATE. Kernel will send
@@ -687,6 +690,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
 		return -1;
 	}
 
+	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
+		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
+
 	exit_early = false;
 
 	/* if we're using anonymous hugepages, nothing to be done */
-- 
2.11.0


-- 
The SmartX email address is only for business purpose. Any sent message 
that is not related to the business is not authorized or permitted by 
SmartX.
本邮箱为北京志凌海纳科技有限公司(SmartX)工作邮箱. 如本邮箱发出的邮件与工作无关,该邮件未得到本公司任何的明示或默示的授权.



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 10:50 [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped Li Feng
@ 2020-04-24 11:23 ` Burakov, Anatoly
  2020-04-24 11:36   ` Bruce Richardson
  2020-04-24 12:37   ` Li Feng
  2020-04-24 12:39 ` [dpdk-dev] [PATCH v4] " Li Feng
                   ` (2 subsequent siblings)
  3 siblings, 2 replies; 11+ messages in thread
From: Burakov, Anatoly @ 2020-04-24 11:23 UTC (permalink / raw)
  To: Li Feng
  Cc: dev, stable, kyle, lifeng1519, fanyang, david.marchand, bruce.richardson

On 24-Apr-20 11:50 AM, Li Feng wrote:
> Currently, even though memory is mapped with PROT_NONE, this does not
> cause it to be excluded from core dumps. This is counter-productive,
> because in a lot of cases, this memory will go unused (e.g. when the
> memory subsystem preallocates VA space but hasn't yet mapped physical
> pages into it).
> 
> Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
> unmapped memory from being dumped.
> 
> Signed-off-by: Li Feng <fengli@smartx.com>
> ---
> V2:
> - add support for freebsd.
> - when free_seg is called, mark the memory MADV_DONTDUMP.
> - when alloc_seg is called, mark the memory MADV_DODUMP.

Isn't this v3 now?

> 
>   lib/librte_eal/common/eal_common_memory.c | 26 ++++++++++++++++++++++++++
>   lib/librte_eal/linux/eal_memalloc.c       |  6 ++++++
>   2 files changed, 32 insertions(+)
> 
> diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
> index cc7d54e0c..83be94a20 100644
> --- a/lib/librte_eal/common/eal_common_memory.c
> +++ b/lib/librte_eal/common/eal_common_memory.c
> @@ -177,6 +177,32 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
>   		after_len = RTE_PTR_DIFF(map_end, aligned_end);
>   		if (after_len > 0)
>   			munmap(aligned_end, after_len);
> +
> +		/*
> +		 * Exclude this pages from a core dump.
> +		 */
> +#ifdef RTE_EXEC_ENV_LINUX
> +		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> +				strerror(errno));
> +#elif RTE_EXEC_ENV_FREEBSD
> +		if (madvise(aligned_addr, *size, MADV_NOCORE) != 0)
> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> +				strerror(errno));
> +#endif
> +	} else {
> +		/*
> +		 * Exclude this pages from a core dump.
> +		 */
> +#ifdef RTE_EXEC_ENV_LINUX
> +		if (madvise(mapped_addr, map_sz, MADV_DONTDUMP) != 0)
> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> +				strerror(errno));
> +#elif RTE_EXEC_ENV_FREEBSD
> +		if (madvise(mapped_addr, map_sz, MADV_NOCORE) != 0)
> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> +				strerror(errno));
> +#endif
>   	}

DRY (Don't Repeat Yourself) :) Probably easier to do "if (!unmap) ..." 
than putting this into two places.

>   
>   	return aligned_addr;
> diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
> index af6d0d023..9d2a6fc6f 100644
> --- a/lib/librte_eal/linux/eal_memalloc.c
> +++ b/lib/librte_eal/linux/eal_memalloc.c
> @@ -571,6 +571,9 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id,
>   		goto resized;
>   	}
>   
> +	if (madvise(addr, alloc_sz, MADV_DODUMP) != 0)
> +		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
> +

See discussion for v2, i believe this is unnecessary.

>   	/* In linux, hugetlb limitations, like cgroup, are
>   	 * enforced at fault time instead of mmap(), even
>   	 * with the option of MAP_POPULATE. Kernel will send
> @@ -687,6 +690,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
>   		return -1;
>   	}
>   
> +	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
> +		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
> +
>   	exit_early = false;
>   
>   	/* if we're using anonymous hugepages, nothing to be done */
> 


-- 
Thanks,
Anatoly

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 11:23 ` Burakov, Anatoly
@ 2020-04-24 11:36   ` Bruce Richardson
  2020-04-24 11:46     ` Burakov, Anatoly
  2020-04-24 12:37   ` Li Feng
  1 sibling, 1 reply; 11+ messages in thread
From: Bruce Richardson @ 2020-04-24 11:36 UTC (permalink / raw)
  To: Burakov, Anatoly
  Cc: Li Feng, dev, stable, kyle, lifeng1519, fanyang, david.marchand

On Fri, Apr 24, 2020 at 12:23:44PM +0100, Burakov, Anatoly wrote:
> On 24-Apr-20 11:50 AM, Li Feng wrote:
> > Currently, even though memory is mapped with PROT_NONE, this does not
> > cause it to be excluded from core dumps. This is counter-productive,
> > because in a lot of cases, this memory will go unused (e.g. when the
> > memory subsystem preallocates VA space but hasn't yet mapped physical
> > pages into it).
> > 
> > Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
> > unmapped memory from being dumped.
> > 
> > Signed-off-by: Li Feng <fengli@smartx.com>
> > ---
> > V2:
> > - add support for freebsd.
> > - when free_seg is called, mark the memory MADV_DONTDUMP.
> > - when alloc_seg is called, mark the memory MADV_DODUMP.
> 
> Isn't this v3 now?
> 
> > 
> >   lib/librte_eal/common/eal_common_memory.c | 26 ++++++++++++++++++++++++++
> >   lib/librte_eal/linux/eal_memalloc.c       |  6 ++++++
> >   2 files changed, 32 insertions(+)
> > 
> > diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
> > index cc7d54e0c..83be94a20 100644
> > --- a/lib/librte_eal/common/eal_common_memory.c
> > +++ b/lib/librte_eal/common/eal_common_memory.c
> > @@ -177,6 +177,32 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
> >   		after_len = RTE_PTR_DIFF(map_end, aligned_end);
> >   		if (after_len > 0)
> >   			munmap(aligned_end, after_len);
> > +
> > +		/*
> > +		 * Exclude this pages from a core dump.
> > +		 */
> > +#ifdef RTE_EXEC_ENV_LINUX
> > +		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
> > +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +				strerror(errno));
> > +#elif RTE_EXEC_ENV_FREEBSD
> > +		if (madvise(aligned_addr, *size, MADV_NOCORE) != 0)
> > +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +				strerror(errno));
> > +#endif
> > +	} else {
> > +		/*
> > +		 * Exclude this pages from a core dump.
> > +		 */
> > +#ifdef RTE_EXEC_ENV_LINUX
> > +		if (madvise(mapped_addr, map_sz, MADV_DONTDUMP) != 0)
> > +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +				strerror(errno));
> > +#elif RTE_EXEC_ENV_FREEBSD
> > +		if (madvise(mapped_addr, map_sz, MADV_NOCORE) != 0)
> > +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +				strerror(errno));
> > +#endif
> >   	}
> 
> DRY (Don't Repeat Yourself) :) Probably easier to do "if (!unmap) ..." than
> putting this into two places.
> 
Can I also suggest putting this at the top of the file:
#ifdef RTE_EXEC_ENV_LINUX
#define RTE_DONTDUMP MADV_DONTDUMP
#elif RTE_EXEC_ENV_FREEBSD
#define RTE_DONTDUMP MADV_NOCORE
#else
#error ....
#endif

and thereafter using RTE_DONTDUMP flag and avoiding any #ifdefs inline in
the code.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 11:36   ` Bruce Richardson
@ 2020-04-24 11:46     ` Burakov, Anatoly
  0 siblings, 0 replies; 11+ messages in thread
From: Burakov, Anatoly @ 2020-04-24 11:46 UTC (permalink / raw)
  To: Bruce Richardson
  Cc: Li Feng, dev, stable, kyle, lifeng1519, fanyang, david.marchand

On 24-Apr-20 12:36 PM, Bruce Richardson wrote:
> On Fri, Apr 24, 2020 at 12:23:44PM +0100, Burakov, Anatoly wrote:
>> On 24-Apr-20 11:50 AM, Li Feng wrote:
>>> Currently, even though memory is mapped with PROT_NONE, this does not
>>> cause it to be excluded from core dumps. This is counter-productive,
>>> because in a lot of cases, this memory will go unused (e.g. when the
>>> memory subsystem preallocates VA space but hasn't yet mapped physical
>>> pages into it).
>>>
>>> Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
>>> unmapped memory from being dumped.
>>>
>>> Signed-off-by: Li Feng <fengli@smartx.com>
>>> ---
>>> V2:
>>> - add support for freebsd.
>>> - when free_seg is called, mark the memory MADV_DONTDUMP.
>>> - when alloc_seg is called, mark the memory MADV_DODUMP.
>>
>> Isn't this v3 now?
>>
>>>
>>>    lib/librte_eal/common/eal_common_memory.c | 26 ++++++++++++++++++++++++++
>>>    lib/librte_eal/linux/eal_memalloc.c       |  6 ++++++
>>>    2 files changed, 32 insertions(+)
>>>
>>> diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
>>> index cc7d54e0c..83be94a20 100644
>>> --- a/lib/librte_eal/common/eal_common_memory.c
>>> +++ b/lib/librte_eal/common/eal_common_memory.c
>>> @@ -177,6 +177,32 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
>>>    		after_len = RTE_PTR_DIFF(map_end, aligned_end);
>>>    		if (after_len > 0)
>>>    			munmap(aligned_end, after_len);
>>> +
>>> +		/*
>>> +		 * Exclude this pages from a core dump.
>>> +		 */
>>> +#ifdef RTE_EXEC_ENV_LINUX
>>> +		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
>>> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
>>> +				strerror(errno));
>>> +#elif RTE_EXEC_ENV_FREEBSD
>>> +		if (madvise(aligned_addr, *size, MADV_NOCORE) != 0)
>>> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
>>> +				strerror(errno));
>>> +#endif
>>> +	} else {
>>> +		/*
>>> +		 * Exclude this pages from a core dump.
>>> +		 */
>>> +#ifdef RTE_EXEC_ENV_LINUX
>>> +		if (madvise(mapped_addr, map_sz, MADV_DONTDUMP) != 0)
>>> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
>>> +				strerror(errno));
>>> +#elif RTE_EXEC_ENV_FREEBSD
>>> +		if (madvise(mapped_addr, map_sz, MADV_NOCORE) != 0)
>>> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
>>> +				strerror(errno));
>>> +#endif
>>>    	}
>>
>> DRY (Don't Repeat Yourself) :) Probably easier to do "if (!unmap) ..." than
>> putting this into two places.
>>
> Can I also suggest putting this at the top of the file:
> #ifdef RTE_EXEC_ENV_LINUX
> #define RTE_DONTDUMP MADV_DONTDUMP
> #elif RTE_EXEC_ENV_FREEBSD
> #define RTE_DONTDUMP MADV_NOCORE
> #else
> #error ....
> #endif
> 
> and thereafter using RTE_DONTDUMP flag and avoiding any #ifdefs inline in
> the code.
> 
+1

-- 
Thanks,
Anatoly

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 11:23 ` Burakov, Anatoly
  2020-04-24 11:36   ` Bruce Richardson
@ 2020-04-24 12:37   ` Li Feng
  1 sibling, 0 replies; 11+ messages in thread
From: Li Feng @ 2020-04-24 12:37 UTC (permalink / raw)
  To: Burakov, Anatoly
  Cc: dev, stable, Kyle Zhang, Feng Li, Yang Fan, David Marchand,
	Bruce Richardson

Thanks,

Feng Li

Burakov, Anatoly <anatoly.burakov@intel.com> 于2020年4月24日周五 下午7:23写道:
>
> On 24-Apr-20 11:50 AM, Li Feng wrote:
> > Currently, even though memory is mapped with PROT_NONE, this does not
> > cause it to be excluded from core dumps. This is counter-productive,
> > because in a lot of cases, this memory will go unused (e.g. when the
> > memory subsystem preallocates VA space but hasn't yet mapped physical
> > pages into it).
> >
> > Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
> > unmapped memory from being dumped.
> >
> > Signed-off-by: Li Feng <fengli@smartx.com>
> > ---
> > V2:
> > - add support for freebsd.
> > - when free_seg is called, mark the memory MADV_DONTDUMP.
> > - when alloc_seg is called, mark the memory MADV_DODUMP.
>
> Isn't this v3 now?
My mistake. I will using v4 for next version.
>
> >
> >   lib/librte_eal/common/eal_common_memory.c | 26 ++++++++++++++++++++++++++
> >   lib/librte_eal/linux/eal_memalloc.c       |  6 ++++++
> >   2 files changed, 32 insertions(+)
> >
> > diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
> > index cc7d54e0c..83be94a20 100644
> > --- a/lib/librte_eal/common/eal_common_memory.c
> > +++ b/lib/librte_eal/common/eal_common_memory.c
> > @@ -177,6 +177,32 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
> >               after_len = RTE_PTR_DIFF(map_end, aligned_end);
> >               if (after_len > 0)
> >                       munmap(aligned_end, after_len);
> > +
> > +             /*
> > +              * Exclude this pages from a core dump.
> > +              */
> > +#ifdef RTE_EXEC_ENV_LINUX
> > +             if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
> > +                     RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +                             strerror(errno));
> > +#elif RTE_EXEC_ENV_FREEBSD
> > +             if (madvise(aligned_addr, *size, MADV_NOCORE) != 0)
> > +                     RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +                             strerror(errno));
> > +#endif
> > +     } else {
> > +             /*
> > +              * Exclude this pages from a core dump.
> > +              */
> > +#ifdef RTE_EXEC_ENV_LINUX
> > +             if (madvise(mapped_addr, map_sz, MADV_DONTDUMP) != 0)
> > +                     RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +                             strerror(errno));
> > +#elif RTE_EXEC_ENV_FREEBSD
> > +             if (madvise(mapped_addr, map_sz, MADV_NOCORE) != 0)
> > +                     RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> > +                             strerror(errno));
> > +#endif
> >       }
>
> DRY (Don't Repeat Yourself) :) Probably easier to do "if (!unmap) ..."
> than putting this into two places.

OK. I put this into two places, because the size is more accurate in this place.

>
> >
> >       return aligned_addr;
> > diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
> > index af6d0d023..9d2a6fc6f 100644
> > --- a/lib/librte_eal/linux/eal_memalloc.c
> > +++ b/lib/librte_eal/linux/eal_memalloc.c
> > @@ -571,6 +571,9 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id,
> >               goto resized;
> >       }
> >
> > +     if (madvise(addr, alloc_sz, MADV_DODUMP) != 0)
> > +             RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
> > +
>
> See discussion for v2, i believe this is unnecessary.
>
> >       /* In linux, hugetlb limitations, like cgroup, are
> >        * enforced at fault time instead of mmap(), even
> >        * with the option of MAP_POPULATE. Kernel will send
> > @@ -687,6 +690,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
> >               return -1;
> >       }
> >
> > +     if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
> > +             RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
> > +
> >       exit_early = false;
> >
> >       /* if we're using anonymous hugepages, nothing to be done */
> >
>
>
> --
> Thanks,
> Anatoly

-- 
The SmartX email address is only for business purpose. Any sent message 
that is not related to the business is not authorized or permitted by 
SmartX.
本邮箱为北京志凌海纳科技有限公司(SmartX)工作邮箱. 如本邮箱发出的邮件与工作无关,该邮件未得到本公司任何的明示或默示的授权.



^ permalink raw reply	[flat|nested] 11+ messages in thread

* [dpdk-dev] [PATCH v4] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 10:50 [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped Li Feng
  2020-04-24 11:23 ` Burakov, Anatoly
@ 2020-04-24 12:39 ` Li Feng
  2020-04-24 12:53 ` [dpdk-dev] [PATCH v5] " Li Feng
  2020-04-24 13:19 ` [dpdk-dev] [PATCH v6] " Li Feng
  3 siblings, 0 replies; 11+ messages in thread
From: Li Feng @ 2020-04-24 12:39 UTC (permalink / raw)
  To: Anatoly Burakov
  Cc: dev, stable, kyle, lifeng1519, fanyang, david.marchand,
	bruce.richardson, Li Feng

Currently, even though memory is mapped with PROT_NONE, this does not
cause it to be excluded from core dumps. This is counter-productive,
because in a lot of cases, this memory will go unused (e.g. when the
memory subsystem preallocates VA space but hasn't yet mapped physical
pages into it).

Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
unmapped memory from being dumped.

Signed-off-by: Li Feng <fengli@smartx.com>
---
V4:
- simplify the code;
- remove the undo code in V3;

V3:
- add support for freebsd.
- when free_seg is called, mark the memory MADV_DONTDUMP.
- when alloc_seg is called, mark the memory MADV_DODUMP.

 lib/librte_eal/common/eal_common_memory.c | 17 +++++++++++++++++
 lib/librte_eal/linux/eal_memalloc.c       |  3 +++
 2 files changed, 20 insertions(+)

diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
index cc7d54e0c..2fea1bdb2 100644
--- a/lib/librte_eal/common/eal_common_memory.c
+++ b/lib/librte_eal/common/eal_common_memory.c
@@ -40,6 +40,14 @@
 static void *next_baseaddr;
 static uint64_t system_page_sz;
 
+#ifdef RTE_EXEC_ENV_LINUX
+#define RTE_DONTDUMP MADV_DONTDUMP
+#elif RTE_EXEC_ENV_FREEBSD
+#define RTE_DONTDUMP MADV_NOCORE
+#else
+#error "madvise doesn't support this OS"
+#endif
+
 #define MAX_MMAP_WITH_DEFINED_ADDR_TRIES 5
 void *
 eal_get_virtual_area(void *requested_addr, size_t *size,
@@ -179,6 +187,15 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
 			munmap(aligned_end, after_len);
 	}
 
+    if (!unmap) {
+		/*
+		 * Exclude this pages from a core dump.
+		 */
+		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+	}
+
 	return aligned_addr;
 }
 
diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
index af6d0d023..f628e0991 100644
--- a/lib/librte_eal/linux/eal_memalloc.c
+++ b/lib/librte_eal/linux/eal_memalloc.c
@@ -687,6 +687,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
 		return -1;
 	}
 
+	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
+		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
+
 	exit_early = false;
 
 	/* if we're using anonymous hugepages, nothing to be done */
-- 
2.11.0


-- 
The SmartX email address is only for business purpose. Any sent message 
that is not related to the business is not authorized or permitted by 
SmartX.
本邮箱为北京志凌海纳科技有限公司(SmartX)工作邮箱. 如本邮箱发出的邮件与工作无关,该邮件未得到本公司任何的明示或默示的授权.



^ permalink raw reply	[flat|nested] 11+ messages in thread

* [dpdk-dev] [PATCH v5] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 10:50 [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped Li Feng
  2020-04-24 11:23 ` Burakov, Anatoly
  2020-04-24 12:39 ` [dpdk-dev] [PATCH v4] " Li Feng
@ 2020-04-24 12:53 ` Li Feng
  2020-04-24 13:02   ` Bruce Richardson
  2020-04-24 13:19 ` [dpdk-dev] [PATCH v6] " Li Feng
  3 siblings, 1 reply; 11+ messages in thread
From: Li Feng @ 2020-04-24 12:53 UTC (permalink / raw)
  To: Anatoly Burakov
  Cc: dev, stable, kyle, lifeng1519, fanyang, david.marchand,
	bruce.richardson, Li Feng

Currently, even though memory is mapped with PROT_NONE, this does not
cause it to be excluded from core dumps. This is counter-productive,
because in a lot of cases, this memory will go unused (e.g. when the
memory subsystem preallocates VA space but hasn't yet mapped physical
pages into it).

Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
unmapped memory from being dumped.

Signed-off-by: Li Feng <fengli@smartx.com>
---
v5:
- fix a code style.

V4:
- simplify the code;
- remove the undo code in V3;

V3:
- add support for freebsd.
- when free_seg is called, mark the memory MADV_DONTDUMP.
- when alloc_seg is called, mark the memory MADV_DODUMP.

 lib/librte_eal/common/eal_common_memory.c | 17 +++++++++++++++++
 lib/librte_eal/linux/eal_memalloc.c       |  3 +++
 2 files changed, 20 insertions(+)

diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
index cc7d54e0c..161e49825 100644
--- a/lib/librte_eal/common/eal_common_memory.c
+++ b/lib/librte_eal/common/eal_common_memory.c
@@ -40,6 +40,14 @@
 static void *next_baseaddr;
 static uint64_t system_page_sz;
 
+#ifdef RTE_EXEC_ENV_LINUX
+#define RTE_DONTDUMP MADV_DONTDUMP
+#elif RTE_EXEC_ENV_FREEBSD
+#define RTE_DONTDUMP MADV_NOCORE
+#else
+#error "madvise doesn't support this OS"
+#endif
+
 #define MAX_MMAP_WITH_DEFINED_ADDR_TRIES 5
 void *
 eal_get_virtual_area(void *requested_addr, size_t *size,
@@ -179,6 +187,15 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
 			munmap(aligned_end, after_len);
 	}
 
+	if (!unmap) {
+		/*
+		 * Exclude this pages from a core dump.
+		 */
+		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+	}
+
 	return aligned_addr;
 }
 
diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
index af6d0d023..f628e0991 100644
--- a/lib/librte_eal/linux/eal_memalloc.c
+++ b/lib/librte_eal/linux/eal_memalloc.c
@@ -687,6 +687,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
 		return -1;
 	}
 
+	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
+		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
+
 	exit_early = false;
 
 	/* if we're using anonymous hugepages, nothing to be done */
-- 
2.11.0


-- 
The SmartX email address is only for business purpose. Any sent message 
that is not related to the business is not authorized or permitted by 
SmartX.
本邮箱为北京志凌海纳科技有限公司(SmartX)工作邮箱. 如本邮箱发出的邮件与工作无关,该邮件未得到本公司任何的明示或默示的授权.



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v5] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 12:53 ` [dpdk-dev] [PATCH v5] " Li Feng
@ 2020-04-24 13:02   ` Bruce Richardson
  0 siblings, 0 replies; 11+ messages in thread
From: Bruce Richardson @ 2020-04-24 13:02 UTC (permalink / raw)
  To: Li Feng
  Cc: Anatoly Burakov, dev, stable, kyle, lifeng1519, fanyang, david.marchand

On Fri, Apr 24, 2020 at 08:53:11PM +0800, Li Feng wrote:
> Currently, even though memory is mapped with PROT_NONE, this does not
> cause it to be excluded from core dumps. This is counter-productive,
> because in a lot of cases, this memory will go unused (e.g. when the
> memory subsystem preallocates VA space but hasn't yet mapped physical
> pages into it).
> 
> Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
> unmapped memory from being dumped.
> 
> Signed-off-by: Li Feng <fengli@smartx.com>
> ---
> v5:
> - fix a code style.
> 
> V4:
> - simplify the code;
> - remove the undo code in V3;
> 
> V3:
> - add support for freebsd.
> - when free_seg is called, mark the memory MADV_DONTDUMP.
> - when alloc_seg is called, mark the memory MADV_DODUMP.
> 
>  lib/librte_eal/common/eal_common_memory.c | 17 +++++++++++++++++
>  lib/librte_eal/linux/eal_memalloc.c       |  3 +++
>  2 files changed, 20 insertions(+)
> 
> diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
> index cc7d54e0c..161e49825 100644
> --- a/lib/librte_eal/common/eal_common_memory.c
> +++ b/lib/librte_eal/common/eal_common_memory.c
> @@ -40,6 +40,14 @@
>  static void *next_baseaddr;
>  static uint64_t system_page_sz;
>  
> +#ifdef RTE_EXEC_ENV_LINUX
> +#define RTE_DONTDUMP MADV_DONTDUMP
> +#elif RTE_EXEC_ENV_FREEBSD
> +#define RTE_DONTDUMP MADV_NOCORE
> +#else
> +#error "madvise doesn't support this OS"
> +#endif
> +

You forgot to use RTE_DONTDUMP in place of MADV_DONTDUMP below.

>  #define MAX_MMAP_WITH_DEFINED_ADDR_TRIES 5
>  void *
>  eal_get_virtual_area(void *requested_addr, size_t *size,
> @@ -179,6 +187,15 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
>  			munmap(aligned_end, after_len);
>  	}
>  
> +	if (!unmap) {
> +		/*
> +		 * Exclude this pages from a core dump.
> +		 */
> +		if (madvise(aligned_addr, *size, MADV_DONTDUMP) != 0)
> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> +				strerror(errno));
> +	}
> +
>  	return aligned_addr;
>  }
>  
> diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
> index af6d0d023..f628e0991 100644
> --- a/lib/librte_eal/linux/eal_memalloc.c
> +++ b/lib/librte_eal/linux/eal_memalloc.c
> @@ -687,6 +687,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
>  		return -1;
>  	}
>  
> +	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
> +		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
> +
>  	exit_early = false;
>  
>  	/* if we're using anonymous hugepages, nothing to be done */
> -- 
> 2.11.0
> 
> 
> -- 
> The SmartX email address is only for business purpose. Any sent message 
> that is not related to the business is not authorized or permitted by 
> SmartX.
> 本邮箱为北京志凌海纳科技有限公司(SmartX)工作邮箱. 如本邮箱发出的邮件与工作无关,该邮件未得到本公司任何的明示或默示的授权.
> 
> 

^ permalink raw reply	[flat|nested] 11+ messages in thread

* [dpdk-dev] [PATCH v6] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 10:50 [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped Li Feng
                   ` (2 preceding siblings ...)
  2020-04-24 12:53 ` [dpdk-dev] [PATCH v5] " Li Feng
@ 2020-04-24 13:19 ` Li Feng
  2020-04-24 15:20   ` Burakov, Anatoly
  2020-04-24 17:33   ` David Marchand
  3 siblings, 2 replies; 11+ messages in thread
From: Li Feng @ 2020-04-24 13:19 UTC (permalink / raw)
  To: Anatoly Burakov
  Cc: dev, stable, kyle, lifeng1519, fanyang, david.marchand,
	bruce.richardson, Li Feng

Currently, even though memory is mapped with PROT_NONE, this does not
cause it to be excluded from core dumps. This is counter-productive,
because in a lot of cases, this memory will go unused (e.g. when the
memory subsystem preallocates VA space but hasn't yet mapped physical
pages into it).

Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
unmapped memory from being dumped.

Signed-off-by: Li Feng <fengli@smartx.com>
---
v6:
- use RTE_DONTDUMP in right place.

v5:
- fix a code style.

V4:
- simplify the code;
- remove the undo code in V3;

V3:
- add support for freebsd.
- when free_seg is called, mark the memory MADV_DONTDUMP.
- when alloc_seg is called, mark the memory MADV_DODUMP.

 lib/librte_eal/common/eal_common_memory.c | 17 +++++++++++++++++
 lib/librte_eal/linux/eal_memalloc.c       |  3 +++
 2 files changed, 20 insertions(+)

diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
index cc7d54e0c..cd20672ce 100644
--- a/lib/librte_eal/common/eal_common_memory.c
+++ b/lib/librte_eal/common/eal_common_memory.c
@@ -40,6 +40,14 @@
 static void *next_baseaddr;
 static uint64_t system_page_sz;
 
+#ifdef RTE_EXEC_ENV_LINUX
+#define RTE_DONTDUMP MADV_DONTDUMP
+#elif RTE_EXEC_ENV_FREEBSD
+#define RTE_DONTDUMP MADV_NOCORE
+#else
+#error "madvise doesn't support this OS"
+#endif
+
 #define MAX_MMAP_WITH_DEFINED_ADDR_TRIES 5
 void *
 eal_get_virtual_area(void *requested_addr, size_t *size,
@@ -179,6 +187,15 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
 			munmap(aligned_end, after_len);
 	}
 
+	if (!unmap) {
+		/*
+		 * Exclude this pages from a core dump.
+		 */
+		if (madvise(aligned_addr, *size, RTE_DONTDUMP) != 0)
+			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
+				strerror(errno));
+	}
+
 	return aligned_addr;
 }
 
diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
index af6d0d023..f628e0991 100644
--- a/lib/librte_eal/linux/eal_memalloc.c
+++ b/lib/librte_eal/linux/eal_memalloc.c
@@ -687,6 +687,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
 		return -1;
 	}
 
+	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
+		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
+
 	exit_early = false;
 
 	/* if we're using anonymous hugepages, nothing to be done */
-- 
2.11.0


-- 
The SmartX email address is only for business purpose. Any sent message 
that is not related to the business is not authorized or permitted by 
SmartX.
本邮箱为北京志凌海纳科技有限公司(SmartX)工作邮箱. 如本邮箱发出的邮件与工作无关,该邮件未得到本公司任何的明示或默示的授权.



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v6] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 13:19 ` [dpdk-dev] [PATCH v6] " Li Feng
@ 2020-04-24 15:20   ` Burakov, Anatoly
  2020-04-24 17:33   ` David Marchand
  1 sibling, 0 replies; 11+ messages in thread
From: Burakov, Anatoly @ 2020-04-24 15:20 UTC (permalink / raw)
  To: Li Feng
  Cc: dev, stable, kyle, lifeng1519, fanyang, david.marchand, bruce.richardson

On 24-Apr-20 2:19 PM, Li Feng wrote:
> Currently, even though memory is mapped with PROT_NONE, this does not
> cause it to be excluded from core dumps. This is counter-productive,
> because in a lot of cases, this memory will go unused (e.g. when the
> memory subsystem preallocates VA space but hasn't yet mapped physical
> pages into it).
> 
> Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
> unmapped memory from being dumped.
> 
> Signed-off-by: Li Feng <fengli@smartx.com>
> ---
> v6:
> - use RTE_DONTDUMP in right place.
> 
> v5:
> - fix a code style.
> 
> V4:
> - simplify the code;
> - remove the undo code in V3;
> 
> V3:
> - add support for freebsd.
> - when free_seg is called, mark the memory MADV_DONTDUMP.
> - when alloc_seg is called, mark the memory MADV_DODUMP.
> 
>   lib/librte_eal/common/eal_common_memory.c | 17 +++++++++++++++++
>   lib/librte_eal/linux/eal_memalloc.c       |  3 +++
>   2 files changed, 20 insertions(+)
> 
> diff --git a/lib/librte_eal/common/eal_common_memory.c b/lib/librte_eal/common/eal_common_memory.c
> index cc7d54e0c..cd20672ce 100644
> --- a/lib/librte_eal/common/eal_common_memory.c
> +++ b/lib/librte_eal/common/eal_common_memory.c
> @@ -40,6 +40,14 @@
>   static void *next_baseaddr;
>   static uint64_t system_page_sz;
>   
> +#ifdef RTE_EXEC_ENV_LINUX
> +#define RTE_DONTDUMP MADV_DONTDUMP
> +#elif RTE_EXEC_ENV_FREEBSD
> +#define RTE_DONTDUMP MADV_NOCORE
> +#else
> +#error "madvise doesn't support this OS"
> +#endif
> +
>   #define MAX_MMAP_WITH_DEFINED_ADDR_TRIES 5
>   void *
>   eal_get_virtual_area(void *requested_addr, size_t *size,
> @@ -179,6 +187,15 @@ eal_get_virtual_area(void *requested_addr, size_t *size,
>   			munmap(aligned_end, after_len);
>   	}
>   
> +	if (!unmap) {
> +		/*
> +		 * Exclude this pages from a core dump.
> +		 */
> +		if (madvise(aligned_addr, *size, RTE_DONTDUMP) != 0)
> +			RTE_LOG(DEBUG, EAL, "madvise failed: %s\n",
> +				strerror(errno));
> +	}
> +
>   	return aligned_addr;
>   }
>   
> diff --git a/lib/librte_eal/linux/eal_memalloc.c b/lib/librte_eal/linux/eal_memalloc.c
> index af6d0d023..f628e0991 100644
> --- a/lib/librte_eal/linux/eal_memalloc.c
> +++ b/lib/librte_eal/linux/eal_memalloc.c
> @@ -687,6 +687,9 @@ free_seg(struct rte_memseg *ms, struct hugepage_info *hi,
>   		return -1;
>   	}
>   
> +	if (madvise(ms->addr, ms->len, MADV_DONTDUMP) != 0)
> +		RTE_LOG(DEBUG, EAL, "madvise failed: %s\n", strerror(errno));
> +
>   	exit_early = false;
>   
>   	/* if we're using anonymous hugepages, nothing to be done */
> 

Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>

-- 
Thanks,
Anatoly

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: [dpdk-dev] [PATCH v6] eal: use madvise to exclude unmapped memory from being dumped
  2020-04-24 13:19 ` [dpdk-dev] [PATCH v6] " Li Feng
  2020-04-24 15:20   ` Burakov, Anatoly
@ 2020-04-24 17:33   ` David Marchand
  1 sibling, 0 replies; 11+ messages in thread
From: David Marchand @ 2020-04-24 17:33 UTC (permalink / raw)
  To: Li Feng
  Cc: Anatoly Burakov, dev, dpdk stable, Kyle Zhang, Feng Li, fanyang,
	Bruce Richardson

On Fri, Apr 24, 2020 at 3:19 PM Li Feng <fengli@smartx.com> wrote:
>
> Currently, even though memory is mapped with PROT_NONE, this does not
> cause it to be excluded from core dumps. This is counter-productive,
> because in a lot of cases, this memory will go unused (e.g. when the
> memory subsystem preallocates VA space but hasn't yet mapped physical
> pages into it).
>
> Use `madvise()` call with MADV_DONTDUMP parameter to exclude the
> unmapped memory from being dumped.
>
> Signed-off-by: Li Feng <fengli@smartx.com>

Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>

Applied, thanks.


-- 
David Marchand


^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2020-04-24 17:33 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-04-24 10:50 [dpdk-dev] [PATCH v2] eal: use madvise to exclude unmapped memory from being dumped Li Feng
2020-04-24 11:23 ` Burakov, Anatoly
2020-04-24 11:36   ` Bruce Richardson
2020-04-24 11:46     ` Burakov, Anatoly
2020-04-24 12:37   ` Li Feng
2020-04-24 12:39 ` [dpdk-dev] [PATCH v4] " Li Feng
2020-04-24 12:53 ` [dpdk-dev] [PATCH v5] " Li Feng
2020-04-24 13:02   ` Bruce Richardson
2020-04-24 13:19 ` [dpdk-dev] [PATCH v6] " Li Feng
2020-04-24 15:20   ` Burakov, Anatoly
2020-04-24 17:33   ` David Marchand

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).