From: "Burakov, Anatoly" <anatoly.burakov@intel.com>
To: Jonas Pfefferle1 <JPF@zurich.ibm.com>
Cc: bruce.richardson@intel.com, chaozhu@linux.vnet.ibm.com, dev@dpdk.org
Subject: Re: [dpdk-dev] Huge mapping secondary process linux
Date: Fri, 27 Oct 2017 15:44:52 +0100 [thread overview]
Message-ID: <a6bfa126-8e7d-8e72-1dee-98ea4d0a2090@intel.com> (raw)
In-Reply-To: <OF071E97C0.CC8DFD8D-ONC12581C6.004E6E02-C12581C6.004F7897@notes.na.collabserv.com>
On 27-Oct-17 3:28 PM, Jonas Pfefferle1 wrote:
> "Burakov, Anatoly" <anatoly.burakov@intel.com> wrote on 10/27/2017
> 04:06:44 PM:
>
> > From: "Burakov, Anatoly" <anatoly.burakov@intel.com>
> > To: Jonas Pfefferle1 <JPF@zurich.ibm.com>, dev@dpdk.org
> > Cc: chaozhu@linux.vnet.ibm.com, bruce.richardson@intel.com
> > Date: 10/27/2017 04:06 PM
> > Subject: Re: [dpdk-dev] Huge mapping secondary process linux
> >
> > On 27-Oct-17 1:43 PM, Jonas Pfefferle1 wrote:
> > >
> > >
> > > Hi @all,
> > >
> > > I'm trying to make sense of the hugepage memory mappings in
> > > librte_eal/linuxapp/eal/eal_memory.c:
> > > * In rte_eal_hugepage_attach (line 1347) when we try to do a private
> > > mapping on /dev/zero (line 1393) why do we not use MAP_FIXED if we
> need the
> > > addresses to be identical with the primary process?
> > > * On POWER we have this weird business going on where we use
> MAP_HUGETLB
> > > because according to this commit:
> > >
> > > commit 284ae3e9ff9a92575c28c858efd2c85c8de6d440
> > > Author: Chao Zhu <chaozhu@linux.vnet.ibm.com>
> > > Date: Thu Apr 6 15:36:09 2017 +0530
> > >
> > > eal/ppc: fix mmap for memory initialization
> > >
> > > On IBM POWER platform, when mapping /dev/zero file to hugepage
> memory
> > > space, mmap will not respect the requested address hint. This will
> > > cause
> > > the memory initialization for the second process fails. This
> patch adds
> > > the required mmap flags to make it work. Beside this, users
> need to set
> > > the nr_overcommit_hugepages to expand the VA range. When
> > > doing the initialization, users need to set both nr_hugepages and
> > > nr_overcommit_hugepages to the same value, like 64, 128, etc.
> > >
> > > mmap address hints are not respected. Looking at the mmap code in the
> > > kernel this is not true entirely however under some circumstances
> the hint
> > > can be ignored (
> > > https://urldefense.proofpoint.com/v2/url?
> >
> u=http-3A__elixir.free-2Delectrons.com_linux_latest_source_arch_powerpc_mm_mmap.c-23L103&d=DwICaQ&c=jf_iaSHvJObTbx-
> > siA1ZOg&r=rOdXhRsgn8Iur7bDE0vgwvo6TC8OpoDN-
> > pXjigIjRW0&m=cttQcHlAYixhsYS3lz-
> > BAdEeg4dpbwGdPnj2R3I8Do0&s=Gp0TIjUtIed05Jgb7XnlocpCYZdFXZXiH0LqIWiNMhA&e=
> > > ). However I believe we can remove the extra case for PPC if we use
> > > MAP_FIXED when doing the secondary process mappings because we need
> them to
> > > be identical anyway. We could also use MAP_FIXED when doing the primary
> > > process mappings resp. get_virtual_area if we want to have any
> guarantees
> > > when specifying a base address. Any thoughts?
> > >
> > > Thanks,
> > > Jonas
> > >
> > hi Jonas,
> >
> > MAP_FIXED is not used because it's dangerous, it unmaps anything that is
> > already mapped into that space. We would rather know that we can't map
> > something than unwittingly unmap something that was mapped before.
>
> Ok, I see. Maybe we can add a check to the primary process's memory
> mappings whether the hint has been respected or not? At least warn if it
> hasn't.
Hi Jonas,
I'm unfamiliar with POWER platform, so i'm afraid you'd have to explain
a bit more what you mean by "hint has been respected" :)
--
Thanks,
Anatoly
next prev parent reply other threads:[~2017-10-27 14:44 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-27 12:43 Jonas Pfefferle1
2017-10-27 14:06 ` Burakov, Anatoly
2017-10-27 14:28 ` Jonas Pfefferle1
2017-10-27 14:44 ` Burakov, Anatoly [this message]
2017-10-27 14:58 ` Jonas Pfefferle1
2017-10-27 15:16 ` Jonas Pfefferle1
2017-10-27 16:00 ` Burakov, Anatoly
2017-10-27 19:22 ` Jonas Pfefferle1
2017-11-07 8:25 ` Chao Zhu
2017-11-07 10:15 ` Jonas Pfefferle1
2017-11-09 3:08 ` Chao Zhu
2017-11-09 9:54 ` Jonas Pfefferle1
2017-10-27 15:48 ` Tan, Jianfeng
2017-10-27 16:06 ` Burakov, Anatoly
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a6bfa126-8e7d-8e72-1dee-98ea4d0a2090@intel.com \
--to=anatoly.burakov@intel.com \
--cc=JPF@zurich.ibm.com \
--cc=bruce.richardson@intel.com \
--cc=chaozhu@linux.vnet.ibm.com \
--cc=dev@dpdk.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).