* [PATCH v2] eal/linux: enable the hugepage mem dump [not found] <20220308094125.2716847-1-fengli@smartx.comOD> @ 2022-04-01 9:10 ` Li Feng 2022-04-05 22:46 ` Stephen Hemminger 2024-10-02 16:46 ` Stephen Hemminger 0 siblings, 2 replies; 6+ messages in thread From: Li Feng @ 2022-04-01 9:10 UTC (permalink / raw) To: Anatoly Burakov; +Cc: dev, Li Feng These hugepages include important structures. we should dump these hugepages into a coredump file for debugging when generating a coredump. Signed-off-by: Li Feng <fengli@smartx.com> --- lib/eal/linux/eal_memalloc.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/lib/eal/linux/eal_memalloc.c b/lib/eal/linux/eal_memalloc.c index f8b1588cae..93c4f396cf 100644 --- a/lib/eal/linux/eal_memalloc.c +++ b/lib/eal/linux/eal_memalloc.c @@ -677,6 +677,8 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id, __func__); #endif + eal_mem_set_dump(addr, alloc_sz, true); + huge_recover_sigbus(); ms->addr = addr; -- 2.35.1 ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2] eal/linux: enable the hugepage mem dump 2022-04-01 9:10 ` [PATCH v2] eal/linux: enable the hugepage mem dump Li Feng @ 2022-04-05 22:46 ` Stephen Hemminger 2022-04-05 23:14 ` Dmitry Kozlyuk 2022-04-06 2:11 ` Li Feng 2024-10-02 16:46 ` Stephen Hemminger 1 sibling, 2 replies; 6+ messages in thread From: Stephen Hemminger @ 2022-04-05 22:46 UTC (permalink / raw) To: Li Feng; +Cc: Anatoly Burakov, dev On Fri, 1 Apr 2022 17:10:04 +0800 Li Feng <fengli@smartx.com> wrote: > These hugepages include important structures. we should dump these > hugepages into a coredump file for debugging when generating a coredump. > > Signed-off-by: Li Feng <fengli@smartx.com> > --- > lib/eal/linux/eal_memalloc.c | 2 ++ > 1 file changed, 2 insertions(+) > > diff --git a/lib/eal/linux/eal_memalloc.c b/lib/eal/linux/eal_memalloc.c > index f8b1588cae..93c4f396cf 100644 > --- a/lib/eal/linux/eal_memalloc.c > +++ b/lib/eal/linux/eal_memalloc.c > @@ -677,6 +677,8 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id, > __func__); > #endif > > + eal_mem_set_dump(addr, alloc_sz, true); > + > huge_recover_sigbus(); > > ms->addr = addr; Don't merge this patch as is please; it would cause a lot of pain in a cloud environment. In our environment core dumps are collected (via systemd) and uploaded to a central server. With this kind of change the processing would get overloaded with multi-gigabyte core dump size. Probably couldn't even save a core dump on these kind of smart nics. This needs to be optional (from command line) and default to the current behavior (not dumping huge pages). ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2] eal/linux: enable the hugepage mem dump 2022-04-05 22:46 ` Stephen Hemminger @ 2022-04-05 23:14 ` Dmitry Kozlyuk 2023-07-05 23:19 ` Stephen Hemminger 2022-04-06 2:11 ` Li Feng 1 sibling, 1 reply; 6+ messages in thread From: Dmitry Kozlyuk @ 2022-04-05 23:14 UTC (permalink / raw) To: Stephen Hemminger; +Cc: Li Feng, Anatoly Burakov, dev 2022-04-05 15:46 (UTC-0700), Stephen Hemminger: > On Fri, 1 Apr 2022 17:10:04 +0800 > Li Feng <fengli@smartx.com> wrote: > > > These hugepages include important structures. we should dump these > > hugepages into a coredump file for debugging when generating a coredump. > > > > Signed-off-by: Li Feng <fengli@smartx.com> > > --- > > lib/eal/linux/eal_memalloc.c | 2 ++ > > 1 file changed, 2 insertions(+) > > > > diff --git a/lib/eal/linux/eal_memalloc.c b/lib/eal/linux/eal_memalloc.c > > index f8b1588cae..93c4f396cf 100644 > > --- a/lib/eal/linux/eal_memalloc.c > > +++ b/lib/eal/linux/eal_memalloc.c > > @@ -677,6 +677,8 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id, > > __func__); > > #endif > > > > + eal_mem_set_dump(addr, alloc_sz, true); > > + > > huge_recover_sigbus(); > > > > ms->addr = addr; > > > Don't merge this patch as is please; it would cause a lot of pain > in a cloud environment. > > In our environment core dumps are collected (via systemd) and uploaded > to a central server. With this kind of change the processing would get > overloaded with multi-gigabyte core dump size. Probably couldn't even > save a core dump on these kind of smart nics. > > > This needs to be optional (from command line) and default to the current > behavior (not dumping huge pages). Maybe expose eal_mem_set_dump() as rte_mem_set_dump()? This would allow to implement the feature easily using memory callbacks. Better, one can enable hugepages to dump selectively: for example, dump some interesting hash tables but skip rings and mempools. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2] eal/linux: enable the hugepage mem dump 2022-04-05 23:14 ` Dmitry Kozlyuk @ 2023-07-05 23:19 ` Stephen Hemminger 0 siblings, 0 replies; 6+ messages in thread From: Stephen Hemminger @ 2023-07-05 23:19 UTC (permalink / raw) To: Dmitry Kozlyuk; +Cc: Li Feng, Anatoly Burakov, dev On Wed, 6 Apr 2022 02:14:46 +0300 Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> wrote: > > > > Don't merge this patch as is please; it would cause a lot of pain > > in a cloud environment. > > > > In our environment core dumps are collected (via systemd) and uploaded > > to a central server. With this kind of change the processing would get > > overloaded with multi-gigabyte core dump size. Probably couldn't even > > save a core dump on these kind of smart nics. > > > > > > This needs to be optional (from command line) and default to the current > > behavior (not dumping huge pages). > > Maybe expose eal_mem_set_dump() as rte_mem_set_dump()? > This would allow to implement the feature easily using memory callbacks. > Better, one can enable hugepages to dump selectively: > for example, dump some interesting hash tables but skip rings and mempools. As was mentioned in thread core_dump_filter will also control these. So it won't impact users who do not enable it. Since the granularity is a the page level, it doesn't make sense to try and be selective for hash tables, rings, mempools etc. Looks good as is, though it might need a rebase. Acked-by: Stephen Hemminger <stephen@networkplumber.org> ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2] eal/linux: enable the hugepage mem dump 2022-04-05 22:46 ` Stephen Hemminger 2022-04-05 23:14 ` Dmitry Kozlyuk @ 2022-04-06 2:11 ` Li Feng 1 sibling, 0 replies; 6+ messages in thread From: Li Feng @ 2022-04-06 2:11 UTC (permalink / raw) To: Stephen Hemminger; +Cc: Anatoly Burakov, dev On Wed, Apr 6, 2022 at 6:46 AM Stephen Hemminger <stephen@networkplumber.org> wrote: > > On Fri, 1 Apr 2022 17:10:04 +0800 > Li Feng <fengli@smartx.com> wrote: > > > These hugepages include important structures. we should dump these > > hugepages into a coredump file for debugging when generating a coredump. > > > > Signed-off-by: Li Feng <fengli@smartx.com> > > --- > > lib/eal/linux/eal_memalloc.c | 2 ++ > > 1 file changed, 2 insertions(+) > > > > diff --git a/lib/eal/linux/eal_memalloc.c b/lib/eal/linux/eal_memalloc.c > > index f8b1588cae..93c4f396cf 100644 > > --- a/lib/eal/linux/eal_memalloc.c > > +++ b/lib/eal/linux/eal_memalloc.c > > @@ -677,6 +677,8 @@ alloc_seg(struct rte_memseg *ms, void *addr, int socket_id, > > __func__); > > #endif > > > > + eal_mem_set_dump(addr, alloc_sz, true); > > + > > huge_recover_sigbus(); > > > > ms->addr = addr; > > > Don't merge this patch as is please; it would cause a lot of pain > in a cloud environment. > > In our environment core dumps are collected (via systemd) and uploaded > to a central server. With this kind of change the processing would get > overloaded with multi-gigabyte core dump size. Probably couldn't even > save a core dump on these kind of smart nics. > > > This needs to be optional (from command line) and default to the current > behavior (not dumping huge pages). On Linux, just with this patch, the coredump will not include these hugepages which are shared, we should write 0x73 to /proc/self/coredump_filter. This is the coredump_filter explanation: Since kernel 2.6.23, the Linux-specific /proc/[pid]/coredump_filter file can be used to control which memory segments are written to the core dump file in the event that a core dump is performed for the process with the corresponding process ID. The value in the file is a bit mask of memory mapping types (see mmap(2)). If a bit is set in the mask, then memory mappings of the corresponding type are dumped; otherwise they are not dumped. The bits in this file have the following meanings: bit 0 Dump anonymous private mappings. bit 1 Dump anonymous shared mappings. bit 2 Dump file-backed private mappings. bit 3 Dump file-backed shared mappings. bit 4 (since Linux 2.6.24) Dump ELF headers. bit 5 (since Linux 2.6.28) Dump private huge pages. bit 6 (since Linux 2.6.28) Dump shared huge pages. bit 7 (since Linux 4.4) Dump private DAX pages. bit 8 (since Linux 4.4) Dump shared DAX pages. By default, the following bits are set: 0, 1, 4 (if the CONFIG_CORE_DUMP_DEFAULT_ELF_HEADERS kernel configuration option is enabled), and 5. This default can be modified at boot time using the coredump_filter boot option. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2] eal/linux: enable the hugepage mem dump 2022-04-01 9:10 ` [PATCH v2] eal/linux: enable the hugepage mem dump Li Feng 2022-04-05 22:46 ` Stephen Hemminger @ 2024-10-02 16:46 ` Stephen Hemminger 1 sibling, 0 replies; 6+ messages in thread From: Stephen Hemminger @ 2024-10-02 16:46 UTC (permalink / raw) To: Li Feng; +Cc: Anatoly Burakov, dev On Fri, 1 Apr 2022 17:10:04 +0800 Li Feng <fengli@smartx.com> wrote: > These hugepages include important structures. we should dump these > hugepages into a coredump file for debugging when generating a coredump. > > Signed-off-by: Li Feng <fengli@smartx.com> This needs to be config option to avoid surprising existing users. ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2024-10-02 16:47 UTC | newest] Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- [not found] <20220308094125.2716847-1-fengli@smartx.comOD> 2022-04-01 9:10 ` [PATCH v2] eal/linux: enable the hugepage mem dump Li Feng 2022-04-05 22:46 ` Stephen Hemminger 2022-04-05 23:14 ` Dmitry Kozlyuk 2023-07-05 23:19 ` Stephen Hemminger 2022-04-06 2:11 ` Li Feng 2024-10-02 16:46 ` Stephen Hemminger
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).