DPDK patches and discussions
 help / color / mirror / Atom feed
From: "Nélio Laranjeiro" <nelio.laranjeiro@6wind.com>
To: Xueming Li <xuemingl@mellanox.com>
Cc: Shahaf Shuler <shahafs@mellanox.com>, dev@dpdk.org
Subject: Re: [dpdk-dev] [PATCH] net/mlx5: remmap UAR address for multiple process
Date: Mon, 22 Jan 2018 15:53:21 +0100	[thread overview]
Message-ID: <20180122145321.jpyepyvjjlktillp@laranjeiro-vm.dev.6wind.com> (raw)
In-Reply-To: <20180119150854.89828-1-xuemingl@mellanox.com>

Hi Xueming,

On Fri, Jan 19, 2018 at 11:08:54PM +0800, Xueming Li wrote:
> UAR(doorbell) is hw resources that have to be same address between
> primary and secondary process, failed to mmap UAR will make TX packets
> invisible to HW.
> Today, UAR address returned from verbs api is mixed in heap and loaded
> library address space, prone to be occupied in secondary process.
> This patch reserves a dedicate UAR address space, both primary and
> secondary process re-mmap UAR pages into this space.
> Below is a brief picture of dpdk app address space allocation:
> 	Before			This patch
> 	------			----------
> 	[stack]			[stack]
> 	[.so, uar, heap]	[.so, heap]
> 	[(empty)]		[(empty)]
> 	[hugepage]		[hugepage]
> 	[? others]		[? others]
> 	[(empty)]		[(empty)]
> 				[uar]
> 				[(empty)]
> To minimize conflicts, UAR address space comes after hugepage space with
> an offset to skip potential usage from other drivers.

Seems it is not the case when the memory is contiguous, according to
what I see in my testpmd /proc/<pid>/maps:

 PMD: mlx5.c:523: mlx5_uar_init_primary(): Reserved UAR address space: 0x0x7f4da5800000

And the fist huge page is at address 0x7f4fa5800000, new UAR space is
before and not after.

With this patch I still have the situation described as "before".

> Once UAR space reserved successfully, UAR pages are re-mmapped into new
> area to keep UAR address aligned between primary and secondary process.
> 
> Signed-off-by: Xueming Li <xuemingl@mellanox.com>
> ---
>  drivers/net/mlx5/mlx5.c         | 107 ++++++++++++++++++++++++++++++++++++++++
>  drivers/net/mlx5/mlx5.h         |   1 +
>  drivers/net/mlx5/mlx5_defs.h    |  10 ++++
>  drivers/net/mlx5/mlx5_rxtx.h    |   3 +-
>  drivers/net/mlx5/mlx5_trigger.c |   7 ++-
>  drivers/net/mlx5/mlx5_txq.c     |  51 +++++++++++++------
>  6 files changed, 163 insertions(+), 16 deletions(-)
> 
> diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c
> index fc2d59fee..1539ef608 100644
> --- a/drivers/net/mlx5/mlx5.c
> +++ b/drivers/net/mlx5/mlx5.c
> @@ -39,6 +39,7 @@
>  #include <stdlib.h>
>  #include <errno.h>
>  #include <net/if.h>
> +#include <sys/mman.h>
>  
>  /* Verbs header. */
>  /* ISO C doesn't support unnamed structs/unions, disabling -pedantic. */
> @@ -56,6 +57,7 @@
>  #include <rte_pci.h>
>  #include <rte_bus_pci.h>
>  #include <rte_common.h>
> +#include <rte_eal_memconfig.h>
>  #include <rte_kvargs.h>
>  
>  #include "mlx5.h"
> @@ -466,6 +468,101 @@ mlx5_args(struct mlx5_dev_config *config, struct rte_devargs *devargs)
>  
>  static struct rte_pci_driver mlx5_driver;
>  
> +/*
> + * Reserved UAR address space for TXQ UAR(hw doorbell) mapping, process
> + * local resource used by both primary and secondary to avoid duplicate
> + * reservation.
> + * The space has to be available on both primary and secondary process,
> + * TXQ UAR maps to this area using fixed mmap w/o double check.
> + */
> +static void *uar_base;
> +
> +/**
> + * Reserve UAR address space for primary process
> + *
> + * @param[in] priv
> + *   Pointer to private structure.
> + *
> + * @return
> + *   0 on success, negative errno value on failure.
> + */
> +static int
> +mlx5_uar_init_primary(struct priv *priv)
> +{
> +	void *addr = (void *)0;
> +	int i;
> +	const struct rte_mem_config *mcfg;
> +
> +	if (uar_base) { /* UAR address space mapped */
> +		priv->uar_base = uar_base;
> +		return 0;
> +	}
> +	/* find out lower bound of hugepage segments */
> +	mcfg = rte_eal_get_configuration()->mem_config;
> +	for (i = 0; i < RTE_MAX_MEMSEG && mcfg->memseg[i].addr; i++) {
> +		if (addr)
> +			addr = RTE_MIN(addr, mcfg->memseg[i].addr);
> +		else
> +			addr = mcfg->memseg[i].addr;

This if/else is useless as addr is already initialised with the smallest
possible value.

> +	}
> +	/* offset down UAR area */
> +	addr = RTE_PTR_SUB(addr, MLX5_UAR_OFFSET + MLX5_UAR_SIZE);

Seems the error is here, the loops get the address of the memseg with
the smallest address and then it subtract the UAR size, addr cannot be
after the huge pages unless if this subtraction overflows.

> +	/* anonymous mmap, no real memory consumption */
> +	addr = mmap(addr, MLX5_UAR_SIZE,
> +		    PROT_NONE, MAP_PRIVATE | MAP_ANONYMOUS, -1, 0);
> +	if (addr == MAP_FAILED) {
> +		ERROR("Failed to reserve UAR address space, please adjust "
> +		      "MLX5_UAR_SIZE or try --base-virtaddr");

How does a user knows the UAR memory space the NIC needs to adjust the
MLX5_UAR_SIZE?

> +		return -ENOMEM;
> +	}
> +	/* Accept either same addr or a new addr returned from mmap if target
> +	 * range occupied.
> +	 */
> +	INFO("Reserved UAR address space: 0x%p", addr);

The '%p' already prefix the address with the 0x.

> +	priv->uar_base = addr; /* for primary and secondary UAR re-mmap */
> +	uar_base = addr; /* process local, don't reserve again */
> +	return 0;
> +}
> +
<snip/>

Regards,

-- 
Nélio Laranjeiro
6WIND

  reply	other threads:[~2018-01-22 14:53 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-01-19 15:08 Xueming Li
2018-01-22 14:53 ` Nélio Laranjeiro [this message]
2018-01-23  9:50   ` Xueming(Steven) Li
2018-01-23 13:31     ` Nélio Laranjeiro
2018-01-23 14:16       ` Xueming(Steven) Li
2018-01-25 15:00 ` [dpdk-dev] [PATCH v2] net/mlx5: mmap uar address around huge pages Xueming Li
2018-01-25 16:01   ` Nélio Laranjeiro
2018-01-25 16:33     ` Shahaf Shuler

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20180122145321.jpyepyvjjlktillp@laranjeiro-vm.dev.6wind.com \
    --to=nelio.laranjeiro@6wind.com \
    --cc=dev@dpdk.org \
    --cc=shahafs@mellanox.com \
    --cc=xuemingl@mellanox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).