DPDK usage discussions
 help / color / mirror / Atom feed
From: Ricardo Roldan <rroldan@bequant.com>
To: users@dpdk.org
Subject: [dpdk-users] attach/detach on secondary process
Date: Wed, 13 Dec 2017 17:58:53 +0100	[thread overview]
Message-ID: <7a3a2174-831f-caa8-ed33-0f06133c96a2@bequant.com> (raw)
In-Reply-To: <fb117eef-ddad-67f6-8e0f-6febdbc0aeed@bequant.com>

Hi,

We have a multi-process application and we need to support 
attaching/detaching of ports. We are using the 17.11 version with the 
Intel x520 (ixgbe driver) and virtio.

At the time we initialize our processes there are not any devices binded 
with the DPDK drivers, so we initialize all processes (primary and 
secondaries) with 0 ports.

This seems to work fine only on the primary processes, but on the 
secondary processes we see some problems. In the following paragraphs I 
describe the procedure used to attach/detach interfaces with DPDK.

For the attach procedure (all processes initially have no devices 
attached):

- Bind the devices we want to attach to the DPDK driver (with the script 
dpdk-devbind, from external process)

- Primary process: Call rte_eth_dev_attach

- Primary process: Configure ports using ...

- Secondary processes: Call to rte_eth_dev_attach


Start to send/receive packets from all processes.


For the detach procedure:

- Secondary processes: For each port, call rte_eth_dev_stop(port), 
rte_eth_dev_close(port) and rte_eth_dev_detach(port, dev).

- Primary process: After the secondary processes have detach all their 
ports, for each port call rte_eth_dev_stop(port), 
rte_eth_dev_close(port) and rte_eth_dev_detach(port, dev).

- Bind the device to the original Linux driver (with the script 
dpdk-devbind, from external process)


With this approach we have noticed that when the secondary processes 
call rte_dev_detach there is an error, because it calls the remove 
operation, which ends up calling eth_ixgbe_dev_uninit that returns 
-EPERM (because it does not allow a non primary process to uninitialize 
the driver).


Therefore, the port attach never works again on the secondary processes 
as the function rte_eal_hotplug_add fails because it cannot find the 
device.


dev = bus->find_device(NULL, cmp_detached_dev_name, devname);
         if (dev == NULL) {
                 RTE_LOG(ERR, EAL, "Cannot find unplugged device (%s)\n",
                         devname);
                 ret = -ENODEV;
                 goto err_devarg;
         }


This happens because in order to check unplugged devices, the function 
cmp_detached_dev_name  checks if there is a pointer to the driver and 
fails if it is already set, because the detach procedure never set the 
driver variable to NULL.

static int cmp_detached_dev_name(const struct rte_device *dev,
         const void *_name)
{
         const char *name = _name;

         /* skip attached devices */
         RTE_LOG(ERR, EAL, "cmp_detached_dev_name dev %p name %s driver %p"
         " search %s\n",
         dev, dev->name, dev->driver, name);
         if (dev->driver != NULL)
                 return 1;

         return strcmp(dev->name, name);
}

To fix this behavior we have done the following changes on the DPDK code.

First, in order to prevent cmp_detached_dev_name from failing, 
rte_eal_dev_detach sets driver to NULL.

diff --git a/lib/librte_eal/common/eal_common_dev.c 
b/lib/librte_eal/common/eal_common_dev.c

index dda8f5835..9a363dcf7 100644
--- a/lib/librte_eal/common/eal_common_dev.c
+++ b/lib/librte_eal/common/eal_common_dev.c
@@ -114,6 +114,7 @@ int rte_eal_dev_detach(struct rte_device *dev)
         if (ret)
                 RTE_LOG(ERR, EAL, "Driver cannot detach the device (%s)\n",
                         dev->name);
+       dev->driver = NULL;
         return ret;
  }/*
*/


Then, in the rte_eth_dev_pci_generic_remove function, the call to 
dev_uninit does not consider -EPERM an error, because when detaching a 
port, some drivers return 0 and other drivers return -EPERM to indicate 
that it is called from a secondary process.

diff --git a/lib/librte_ether/rte_ethdev_pci.h 
b/lib/librte_ether/rte_ethdev_pci.h
@@ -184,7 +184,7 @@ rte_eth_dev_pci_generic_remove(struct rte_pci_device 
*pci_dev,

         if (dev_uninit) {
                 ret = dev_uninit(eth_dev);
-               if (ret)
+               if (ret && ret != -EPERM)
                         return ret;
         }

Finally, in the rte_eth_dev_pci_release function, only the fields in the 
shared memory region are reset if called from a primary process.

/**/diff --git a/lib/librte_ether/rte_ethdev_pci.h 
b/lib/librte_ether/rte_ethdev_pci.h
index 722075e09..a79188fbf 100644
--- a/lib/librte_ether/rte_ethdev_pci.h
+++ b/lib/librte_ether/rte_ethdev_pci.h
@@ -125,16 +125,16 @@ rte_eth_dev_pci_release(struct rte_eth_dev *eth_dev)
         /* free ether device */
         rte_eth_dev_release_port(eth_dev);

-       if (rte_eal_process_type() == RTE_PROC_PRIMARY)
+       if (rte_eal_process_type() == RTE_PROC_PRIMARY) {
rte_free(eth_dev->data->dev_private);
+ eth_dev->data->dev_private = NULL;

-       eth_dev->data->dev_private = NULL;
-
-       /*
-        * Secondary process will check the name to attach.
-        * Clear this field to avoid attaching a released ports.
-        */
-       eth_dev->data->name[0] = '\0';
+               /*
+                * Secondary process will check the name to attach.
+                * Clear this field to avoid attaching a released ports.
+                */
+               eth_dev->data->name[0] = '\0';
+        }

         eth_dev->device = NULL;
         eth_dev->intr_handle = NULL;


After applying these changes, it seems like attaching and detaching 
ports multiple times works without problems, at least with the ixgbe and 
virtio drivers.

In order to generate a patch, It would be very helpful if anyone can 
confirm that this approach is correct and that we do not break any other 
parts.

Best regards,
Ricardo


On 12/13/2017 03:31 PM, Ricardo Roldan wrote:
>
>
> Hi,
>
> We have a multi-process application and we need to support 
> attaching/detaching of ports.
>
> At the time we initialize our process they aren't any devices binded 
> with the dpdk drivers, so we initialize in all processes with 0 ports.
>
> We manage to work fine only on the primary processes, on the secondary 
> process we had some problems. Here is the way call the dpdk
> interface to attach/detach from our process
>
>       Attach - At this moment all process (primary and secondaries) 
> has not devices attached
>
>       - Bind the devices we want to attach to the DPDK driver (with 
> the script dpdk-devbind, from external process)
>
>       - Call rte_eth_dev_attach from primary process
>
>       - Configure ports from primary process
>
>       - From secondary process call to rte_eth_dev_attach
>
>
>      Began to send/receive packets from all process
>
>
>
>      Detach
>
>        - From secondary process call to:
>
>               rte_eth_dev_stop(port);
>
>               rte_eth_dev_close(port);
>               rte_eth_dev_detach(port, dev);
>
>
>        - From primary process call to:
>
>               rte_eth_dev_stop(port);
>
>               rte_eth_dev_close(port);
>               rte_eth_dev_detach(port, dev);
>
>
>        - Bind the device to the original Linux driver (with the script 
> dpdk-devbind, from external process)
>
>
>    With this second approach we notice that the detach from the 
> secondary process returns with an error. This is because the function 
> eth_ixgbe_dev_uninit has a check to not uninitialize the driver from a 
> not primary process.
>
>    So that the detach can't finish all its task.
>
>    After this, the attach never works again on the secondary process 
> as the function rte_eal_hotplug_add began to fail due that it cant not 
> find the device.
>
> */        dev = bus->find_device(NULL, cmp_detached_dev_name, 
> devname); /**/
> /**/        if (dev == NULL) { /**/
> /**/                RTE_LOG(ERR, EAL, "Cannot find unplugged device 
> (%s)\n", /**/
> /**/                        devname); /**/
> /**/                ret = -ENODEV; /**/
> /**/                goto err_devarg; /**/
> /**/        } /**/
> /**//*
>    This is due that to check unplugged devices, what the comparation 
> function cmp_detached_dev_name do, is to check if there is a pointer 
> to the driver.
>
> */static int cmp_detached_dev_name(const struct rte_device *dev, /**/
> /**/                   const void *_name) /**/
> /**/{ /**/
> /**/        const char *name = _name; /**/
> /**//**/
> /**/        /* skip attached devices */ /**/
> /**/        if (dev->driver != NULL) /**/
> /**/                return 1; /**/
> /**//**/
> /**/        return strcmp(dev->name, name); /**/
> /**/} /*/
> /
>
> And as detach has not finish correctly on the secondary process, the 
> device continues with the pointer to the driver setted. So the attach 
> fails as the device is not found.
>
>
> To overcome this behavior we had done this changes on the DPDK code:
>
> We have modify the dpdk to clean the pointer to the driver on the 
> detach.We had modify also the function rte_eth_dev_pci_generic_remove 
> so even if the uninit
> of the driver return with -EPERM the function continue executing the 
> rest of the code. We had done this as we had seen that the check on 
> the uninit testing to
> see if the process is not a primary  is donein all drivers, but some 
> drivers return with no error ( 0) and others with (-EPERM). So on 
> rte_eth_dev_pci_generic if the call
> to uninit returns with -EPERM we continue executing calling 
> rte_eth_dev_pci_release. To that last function we had also done some 
> changes, as only primary process
> should be able to uninitialized some common values, that a detach on a 
> secondary process should never do.
>
> These are the changes:
>
> /*diff --git a/lib/librte_eal/common/eal_common_dev.c 
> b/lib/librte_eal/common/eal_common_dev.c*//*
> *//*index dda8f5835..9a363dcf7 100644*//*
> *//*--- a/lib/librte_eal/common/eal_common_dev.c*//*
> *//*+++ b/lib/librte_eal/common/eal_common_dev.c*//*
> *//*@@ -114,6 +114,7 @@ int rte_eal_dev_detach(struct rte_device *dev)*//*
> *//*        if (ret)*//*
> *//*                RTE_LOG(ERR, EAL, "Driver cannot detach the device 
> (%s)\n",*//*
> *//*                        dev->name);*//*
> *//*+       dev->driver = NULL;*//*
> *//*        return ret;*//*
> *//* }*//*
> *//**//*
> *//*diff --git a/lib/librte_ether/rte_ethdev_pci.h 
> b/lib/librte_ether/rte_ethdev_pci.h*//*
> *//*index 722075e09..a79188fbf 100644*//*
> *//*--- a/lib/librte_ether/rte_ethdev_pci.h*//*
> *//*+++ b/lib/librte_ether/rte_ethdev_pci.h*//*
> *//*@@ -125,16 +125,16 @@ rte_eth_dev_pci_release(struct rte_eth_dev 
> *eth_dev)*//*
> *//*        /* free ether device */*//*
> *//*        rte_eth_dev_release_port(eth_dev);*//*
> *//**//*
> *//*-       if (rte_eal_process_type() == RTE_PROC_PRIMARY)*//*
> *//*+       if (rte_eal_process_type() == RTE_PROC_PRIMARY) {*//*
> *//*rte_free(eth_dev->data->dev_private);*//*
> *//*+ eth_dev->data->dev_private = NULL;*//*
> *//**//*
> *//*-       eth_dev->data->dev_private = NULL;*//*
> *//*-*//*
> *//*-       /**//*
> *//*-        * Secondary process will check the name to attach.*//*
> *//*-        * Clear this field to avoid attaching a released ports.*//*
> *//*-        */*//*
> *//*-       eth_dev->data->name[0] = '\0';*//*
> *//*+               /**//*
> *//*+                * Secondary process will check the name to 
> attach.*//*
> *//*+                * Clear this field to avoid attaching a released 
> ports.*//*
> *//*+                */*//*
> *//*+ eth_dev->data->name[0] = '\0';*//*
> *//*+        }*//*
> *//**//*
> *//*        eth_dev->device = NULL;*//*
> *//*        eth_dev->intr_handle = NULL;*//*
> *//*@@ -184,7 +184,7 @@ rte_eth_dev_pci_generic_remove(struct 
> rte_pci_device *pci_dev,*//*
> *//**//*
> *//*        if (dev_uninit) {*//*
> *//*                ret = dev_uninit(eth_dev);*//*
> *//*-               if (ret)*//*
> *//*+               if (ret && ret != -EPERM)*//*
> *//*                        return ret;*//*
> *//*        }*//*
> *//**//*
> */
>
>
> And now seems to work. Is this a correct way to proceed?
>
> Could some one that have work with this functionalities can advise us?
>
> Best regards,
>
> Ricardo
>
>

       reply	other threads:[~2017-12-13 17:00 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <fb117eef-ddad-67f6-8e0f-6febdbc0aeed@bequant.com>
2017-12-13 16:58 ` Ricardo Roldan [this message]
2017-12-13 17:09   ` Stephen Hemminger
2017-12-13 21:00     ` Thomas Monjalon
2017-12-13 21:10       ` Stephen Hemminger
2017-12-13 21:20         ` Thomas Monjalon
2018-01-08  5:33           ` [dpdk-users] How to mirror the live traffic in dpdk jyoti swarup
2018-01-08  7:01             ` Gowda, Sandesh

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=7a3a2174-831f-caa8-ed33-0f06133c96a2@bequant.com \
    --to=rroldan@bequant.com \
    --cc=users@dpdk.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).