From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gaetan.rivet@6wind.com>
Received: from mail-wr1-f67.google.com (mail-wr1-f67.google.com
 [209.85.221.67]) by dpdk.org (Postfix) with ESMTP id 096DF239
 for <dev@dpdk.org>; Wed,  6 Mar 2019 11:46:36 +0100 (CET)
Received: by mail-wr1-f67.google.com with SMTP id d17so12817655wre.10
 for <dev@dpdk.org>; Wed, 06 Mar 2019 02:46:36 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=6wind-com.20150623.gappssmtp.com; s=20150623;
 h=date:from:to:cc:subject:message-id:references:mime-version
 :content-disposition:content-transfer-encoding:in-reply-to
 :user-agent; bh=xbM1lJp6jO9uJQYUNResyPYyfq8GaLhvgeZy8TRSkX4=;
 b=OBdTK8FNF+kGSA2DR5oeRmDePwG17BzuOgGsNfDany2MG3erKzRHXMoCpsPtpFC4RS
 lko79AySHdYjDWqqMJySMx4KATYB116vOiIUyDce4Nr3eQiz+UWH4Z9L3jufUlSknbZz
 PJUh/5bZSHWCo6bJFV8bnQEO9U5H5LaI/dhnkmBIy2XiPcjphBbih/kRrjW10qCnRHDB
 Jdi60GMsIwth6o7lEXkEh3cmChoQtWXjTujUwE6DhBtef/7I4msK/pogFFuSlEkTOmkm
 TtC/FXKmG0WP+jgyJYaht/hvrLVuSOrDXMKtHbw1JrYg+6Xs+OYVqkn2L8ii/C1MrpSK
 mBxQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:date:from:to:cc:subject:message-id:references
 :mime-version:content-disposition:content-transfer-encoding
 :in-reply-to:user-agent;
 bh=xbM1lJp6jO9uJQYUNResyPYyfq8GaLhvgeZy8TRSkX4=;
 b=LuJEXPlWdXGoFmDHVGhwZv8rwVTiVuTtF5PAyte3SPZotw0t7uYB1wLmkxAgxzXdWF
 KBxuqyUJgzMF/GxyhXSra4Sl49qUZ1R1vgQdCVkD3XcjCJ+Hppk3DgvewmtzP8Lrt/kC
 j4UNr61Zhv+Y50xcdBGxrzLHuj/4PU48EnBohpasUgIlCjiPkrxte+O7o8/VrHWkOpsq
 oVCW4dFnW3ZHrK6VZGtjkGKyMeMFWTT1rUDqw7wunvAVwwOeBB8Ajdii0VF5T22TsdUS
 njVB5U6QXH5sTAEJmf6jE9ZVOsvPV689bekhbE1v9MJZGSXb/4nJrlm0Kl7WvubIW6tA
 +J8w==
X-Gm-Message-State: APjAAAVLWrOrqRjwDo87XajhZXZ/Mh/S28uv4oBIv6W4xScbRCQJyUPu
 dAy8oBRyFlSqKRaM07MtPD2XAA==
X-Google-Smtp-Source: APXvYqyb7zZOqIxbGSLtibtFskpboKF+aIdeFe0orI5V3iI8k6J/erCtJ6rnCPnJZ5KzhAnDijWkYw==
X-Received: by 2002:a5d:6542:: with SMTP id z2mr2420891wrv.237.1551869195423; 
 Wed, 06 Mar 2019 02:46:35 -0800 (PST)
Received: from bidouze.vm.6wind.com (host.78.145.23.62.rev.coltfrance.com.
 [62.23.145.78])
 by smtp.gmail.com with ESMTPSA id c1sm3883205wrw.7.2019.03.06.02.46.34
 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256);
 Wed, 06 Mar 2019 02:46:34 -0800 (PST)
Date: Wed, 6 Mar 2019 11:46:32 +0100
From: =?iso-8859-1?Q?Ga=EBtan?= Rivet <gaetan.rivet@6wind.com>
To: Thomas Monjalon <thomas@monjalon.net>
Cc: Raslan Darawsheh <rasland@mellanox.com>, "dev@dpdk.org" <dev@dpdk.org>,
 "stephen@networkplumber.org" <stephen@networkplumber.org>
Message-ID: <20190306104632.fia2r2sz5yajvkne@bidouze.vm.6wind.com>
References: <1551779507-10857-1-git-send-email-rasland@mellanox.com>
 <1551779507-10857-3-git-send-email-rasland@mellanox.com>
 <20190305173825.2ho27gqfx7f72xqb@bidouze.vm.6wind.com>
 <172443910.dRk5910QrU@xps>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <172443910.dRk5910QrU@xps>
User-Agent: NeoMutt/20170113 (1.7.2)
Subject: Re: [dpdk-dev] [PATCH v2 3/4] net/failsafe: replace local
 sub-device with shared data
X-BeenThere: dev@dpdk.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: DPDK patches and discussions <dev.dpdk.org>
List-Unsubscribe: <https://mails.dpdk.org/options/dev>,
 <mailto:dev-request@dpdk.org?subject=unsubscribe>
List-Archive: <http://mails.dpdk.org/archives/dev/>
List-Post: <mailto:dev@dpdk.org>
List-Help: <mailto:dev-request@dpdk.org?subject=help>
List-Subscribe: <https://mails.dpdk.org/listinfo/dev>,
 <mailto:dev-request@dpdk.org?subject=subscribe>
X-List-Received-Date: Wed, 06 Mar 2019 10:46:36 -0000

On Tue, Mar 05, 2019 at 06:58:04PM +0100, Thomas Monjalon wrote:
> Hi,
> 
> 05/03/2019 18:38, Gaëtan Rivet:
> > >  fs_dev_remove(struct sub_device *sdev)
> [...]
> > > -		rte_eth_dev_close(PORT_ID(sdev));
> > > +		rte_eth_dev_close(edev->data->port_id);
> > 
> > Ok I see. I missed that during the first reading, the private_data is
> > zeroed on dev_close(), so ETH(sdev) becomes invalid here.
> 
> I don't follow you. What do you mean with this comment?
> 

PORT_ID(sdev) uses ETH(sdev).

ETH(sdev) will now point to `&rte_eth_devices[(sdev)->data->port_id]`,
so data->port_id will be zeroed on sdev close.

So once the sub-device has been closed, calling
rte_eth_dev_release_port(ETH(sdev)) is not possible anymore, justifying
the change (taking the reference to ETH(sdev) first, then using it after
shared data has been overwritten).

> > What happens when a primary process closes a device before a secondary?
> > Is the secondary unable to stop / close its own then? Isn't there some
> > missing uninit?
> 
> Is the secondary process supposed to do any closing?
> The device management should be done only by the primary process.
> 
> Note: anyway all this hotplug related code should be dropped
> from failsafe to be replaced by EAL hotplug management.
> 

I don't know, I've never used secondary process.
However, cursory reading the code of rte_eth_dev_close(), I don't see
a guard against calling it from a secondary process?

Reading code like

   rte_free(dev->data->rx_queues);
   dev->data->rx_queues = NULL;

within makes me think the issue has been seen at least once, where
shared data is freed multiple times, so I guessed some secondary
processes were calling it. Maybe they are not meant to, but what
prevents them from being badly written?

Also, given rte_dev_remove IPC call to transfer the order to the
primary, it seems that at least secondary processes are expected to call
rte_dev_remove() at some point? So are they only authorized to call
rte_dev_remove() (to manage hotplug), but not rte_eth_dev_close()? Is
there a specific documentation detailing the design of secondary process
and the related responsibilities in the lifetime of a device? How are
they synching their rte_eth_devices list if they are not calling
rte_eth_dev_close(), ever?

> > This seems dangerous to me. Why not instead allocating a per-process
> > slab of memory that would hold the relevant references and outlive the
> > shared data (a per-process rte_eth_dev private data...).
> 
> Which data do you think should be allocated per process?
> 
> 

[-------- SHARED SPACE --------------] [-- PER-PROCESS --------]
+--------------------------------------------------------------+
| +------------------+                +- rte_eth_devices[n] -+ |
| |rte_eth_dev_data  |<---------------+ data                 | | PRIMARY
| |                  |   +dev_priv-+  |                      | |
| |      dev_private +-->|         |  |                      | |
| |              ... |   |         |  |                      | |
| |          port_id |   |         |  |                      | |
| |                  |   |         |  |                      | |
| |                  |   |         |  |                      | |
| |                  |   |         |  +----------------------+ |
| |                  |   |         |  +- rte_eth_devices[n] -+ |
| |                  |   |         |  |                      | | SECONDARY
| |                  |   |         |  |                      | |
| |                  |   |         |  |                      | |
| |                  |   |         |  |                      | |
| |                  |   +---------+  |                      | |
| |                  |<---------------+ data                 | |
| +------------------+                +----------------------+ |
+--------------------------------------------------------------+

Here port_id is used within fail-safe to get back to rte_eth_devices[n].
This disappears once a device is closed, as all shared space is zeroed.

This means that sometimes ETH(sdev) and PORT_ID(sdev) is correct,
and at some point it is not anymore, once a sub-device has been
closed. This seems dangerous.

I was thinking initially that allocating a place where each sdev would
store their rte_eth_devices / port_id back-reference could alleviate the
issue, meaning that the fail-safe would not zero it on sdev_close(), and
it would remain valid for the lifetime of a sub-device, so even when a
sub-device is in DEV_PROBED state.

But now that I think about it, it could probably be simpler: instead of
using (ETH(sdev)->data->port_id) for the port_id of an sdev (meaning
that it is dependent on the lifetime of the sdev, instead of the
lifetime of the failsafe), the port-id itself should be stored in the
sub_device structure. This structure will be available for the lifetime
of the failsafe, and the port_id is correct accross all processes.

So PORT_ID(sdev) would be defined to something like (sdev->port_id), and
ETH(sdev) would be (&rte_eth_devices[PORT_ID(sdev)]). It would remain
correct even once the primary has closed the sub-device.

What do you think? Do you agree that the current state is dangerous, and
do you think the solution would alleviate the issue? Maybe the concern
is unfounded and the only issue is within fs_dev_remove().

-- 
Gaëtan Rivet
6WIND