From: Shahaf Shuler <shahafs@mellanox.com>
To: Shahaf Shuler <shahafs@mellanox.com>,
"dev@dpdk.org" <dev@dpdk.org>,
"anatoly.burakov@intel.com" <anatoly.burakov@intel.com>
Cc: Thomas Monjalon <thomas@monjalon.net>,
"stable@dpdk.org" <stable@dpdk.org>,
"alejandro.lucero@netronome.com" <alejandro.lucero@netronome.com>
Subject: Re: [dpdk-dev] [PATCH] mem: don't use address hint for mapping unless requested
Date: Wed, 27 Mar 2019 05:36:04 +0000 [thread overview]
Message-ID: <AM0PR0502MB3795413E68ADE90A68C7DFD1C3580@AM0PR0502MB3795.eurprd05.prod.outlook.com> (raw)
Message-ID: <20190327053604.636YY7URbuRiMulVjk9Wrizc0R2aGGtxiYAxRdtHF7I@z> (raw)
In-Reply-To: <20190321202156.117496-1-shahafs@mellanox.com>
Anatoly, Alejandro,
Any comments on this one?
Thursday, March 21, 2019 10:22 PM, Shahaf Shuler:
> Subject: [dpdk-dev] [PATCH] mem: don't use address hint for mapping
> unless requested
>
> patch[1] added an address hint as starting address for 64 bit systems in case
> an explicit base virtual address was not set by the user.
>
> The justification for such hint was to help devices that work in VA mode and
> has a address range limitation to work smoothly with the eal memory
> subsystem.
>
> While the base address value selected may work fine for the eal initialization,
> it easily breaks when trying to register external memory using
> rte_extmem_register API.
>
> Trying to register anonymous memory on RH x86_64 machine took several
> minutes, during them the function eal_get_virtual_area repeatedly scanned
> for a good VA candidate.
>
> The attempt to guess which VA address will be free for mapping will always
> result in not portable, error prone code:
> * different application may use different libraries along w/ DPDK. One
> can never guess which library was called first and how much virtual
> memory it consumed.
> * external memory can be registered at any time in the application run
> time.
>
> This patch removes the default address hint and use the address returned by
> mmap.
> devices with address limitations should suggest to their users a proper base-
> virtaddr (EAL arg) to use.
>
> Fixes: 1df21702873d ("mem: use address hint for mapping hugepages")
> Cc: stable@dpdk.org
> Cc: alejandro.lucero@netronome.com
>
> [1] commit 1df21702873d ("mem: use address hint for mapping hugepages")
>
> Signed-off-by: Shahaf Shuler <shahafs@mellanox.com>
> ---
> lib/librte_eal/common/eal_common_memory.c | 22 ----------------------
> 1 file changed, 22 deletions(-)
>
> diff --git a/lib/librte_eal/common/eal_common_memory.c
> b/lib/librte_eal/common/eal_common_memory.c
> index c9da69b164..09108f7a32 100644
> --- a/lib/librte_eal/common/eal_common_memory.c
> +++ b/lib/librte_eal/common/eal_common_memory.c
> @@ -39,23 +39,6 @@
> static void *next_baseaddr;
> static uint64_t system_page_sz;
>
> -#ifdef RTE_ARCH_64
> -/*
> - * Linux kernel uses a really high address as starting address for serving
> - * mmaps calls. If there exists addressing limitations and IOVA mode is VA,
> - * this starting address is likely too high for those devices. However, it
> - * is possible to use a lower address in the process virtual address space
> - * as with 64 bits there is a lot of available space.
> - *
> - * Current known limitations are 39 or 40 bits. Setting the starting address
> - * at 4GB implies there are 508GB or 1020GB for mapping the available
> - * hugepages. This is likely enough for most systems, although a device with
> - * addressing limitations should call rte_mem_check_dma_mask for
> ensuring all
> - * memory is within supported range.
> - */
> -static uint64_t baseaddr = 0x100000000; -#endif
> -
> void *
> eal_get_virtual_area(void *requested_addr, size_t *size,
> size_t page_sz, int flags, int mmap_flags) @@ -79,11 +62,6
> @@ eal_get_virtual_area(void *requested_addr, size_t *size,
> rte_eal_process_type() == RTE_PROC_PRIMARY)
> next_baseaddr = (void *) internal_config.base_virtaddr;
>
> -#ifdef RTE_ARCH_64
> - if (next_baseaddr == NULL && internal_config.base_virtaddr == 0 &&
> - rte_eal_process_type() == RTE_PROC_PRIMARY)
> - next_baseaddr = (void *) baseaddr;
> -#endif
> if (requested_addr == NULL && next_baseaddr != NULL) {
> requested_addr = next_baseaddr;
> requested_addr = RTE_PTR_ALIGN(requested_addr,
> page_sz);
> --
> 2.12.0
next prev parent reply other threads:[~2019-03-27 5:36 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-03-21 20:21 Shahaf Shuler
2019-03-21 20:21 ` Shahaf Shuler
2019-03-27 5:36 ` Shahaf Shuler [this message]
2019-03-27 5:36 ` Shahaf Shuler
2019-03-27 11:49 ` Burakov, Anatoly
2019-03-27 11:49 ` Burakov, Anatoly
2019-03-28 8:45 ` Shahaf Shuler
2019-03-28 8:45 ` Shahaf Shuler
2019-03-28 10:26 ` Burakov, Anatoly
2019-03-28 10:26 ` Burakov, Anatoly
2019-03-31 8:43 ` [dpdk-dev] [PATCH v2] mem: limit use of address hint Shahaf Shuler
2019-03-31 8:43 ` Shahaf Shuler
2019-04-02 16:13 ` Burakov, Anatoly
2019-04-02 16:13 ` Burakov, Anatoly
2019-04-02 17:23 ` Alejandro Lucero
2019-04-02 17:23 ` Alejandro Lucero
2019-04-03 17:22 ` [dpdk-dev] [dpdk-stable] " Thomas Monjalon
2019-04-03 17:22 ` Thomas Monjalon
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=AM0PR0502MB3795413E68ADE90A68C7DFD1C3580@AM0PR0502MB3795.eurprd05.prod.outlook.com \
--to=shahafs@mellanox.com \
--cc=alejandro.lucero@netronome.com \
--cc=anatoly.burakov@intel.com \
--cc=dev@dpdk.org \
--cc=stable@dpdk.org \
--cc=thomas@monjalon.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).