From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dpdk.org (dpdk.org [92.243.14.124]) by inbox.dpdk.org (Postfix) with ESMTP id C9E74A053A; Thu, 23 Jan 2020 15:48:18 +0100 (CET) Received: from [92.243.14.124] (localhost [127.0.0.1]) by dpdk.org (Postfix) with ESMTP id 8F6C34C7B; Thu, 23 Jan 2020 15:48:18 +0100 (CET) Received: from mga01.intel.com (mga01.intel.com [192.55.52.88]) by dpdk.org (Postfix) with ESMTP id 3B0A34C7A; Thu, 23 Jan 2020 15:48:17 +0100 (CET) X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by fmsmga101.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 23 Jan 2020 06:48:14 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.70,354,1574150400"; d="scan'208";a="222359871" Received: from fyigit-mobl.ger.corp.intel.com (HELO [10.237.221.35]) ([10.237.221.35]) by fmsmga008.fm.intel.com with ESMTP; 23 Jan 2020 06:48:14 -0800 To: Matan Azrad , "Yigit, Ferruh" , "dev@dpdk.org" , Bernard Iremonger Cc: Gaetan Rivet , Thomas Monjalon , "stable@dpdk.org" , David Marchand , Jeff Guo , Qi Zhang References: <1573548459-6931-1-git-send-email-matan@mellanox.com> <1573548459-6931-2-git-send-email-matan@mellanox.com> From: Ferruh Yigit Autocrypt: addr=ferruh.yigit@intel.com; prefer-encrypt=mutual; keydata= mQINBFXZCFABEADCujshBOAaqPZpwShdkzkyGpJ15lmxiSr3jVMqOtQS/sB3FYLT0/d3+bvy qbL9YnlbPyRvZfnP3pXiKwkRoR1RJwEo2BOf6hxdzTmLRtGtwWzI9MwrUPj6n/ldiD58VAGQ +iR1I/z9UBUN/ZMksElA2D7Jgg7vZ78iKwNnd+vLBD6I61kVrZ45Vjo3r+pPOByUBXOUlxp9 GWEKKIrJ4eogqkVNSixN16VYK7xR+5OUkBYUO+sE6etSxCr7BahMPKxH+XPlZZjKrxciaWQb +dElz3Ab4Opl+ZT/bK2huX+W+NJBEBVzjTkhjSTjcyRdxvS1gwWRuXqAml/sh+KQjPV1PPHF YK5LcqLkle+OKTCa82OvUb7cr+ALxATIZXQkgmn+zFT8UzSS3aiBBohg3BtbTIWy51jNlYdy ezUZ4UxKSsFuUTPt+JjHQBvF7WKbmNGS3fCid5Iag4tWOfZoqiCNzxApkVugltxoc6rG2TyX CmI2rP0mQ0GOsGXA3+3c1MCdQFzdIn/5tLBZyKy4F54UFo35eOX8/g7OaE+xrgY/4bZjpxC1 1pd66AAtKb3aNXpHvIfkVV6NYloo52H+FUE5ZDPNCGD0/btFGPWmWRmkPybzColTy7fmPaGz cBcEEqHK4T0aY4UJmE7Ylvg255Kz7s6wGZe6IR3N0cKNv++O7QARAQABtCVGZXJydWggWWln aXQgPGZlcnJ1aC55aWdpdEBpbnRlbC5jb20+iQJUBBMBCgA+AhsDAh4BAheABQsJCAcDBRUK CQgLBRYCAwEAFiEE0jZTh0IuwoTjmYHH+TPrQ98TYR8FAl1meboFCQlupOoACgkQ+TPrQ98T YR9ACBAAv2tomhyxY0Tp9Up7mNGLfEdBu/7joB/vIdqMRv63ojkwr9orQq5V16V/25+JEAD0 60cKodBDM6HdUvqLHatS8fooWRueSXHKYwJ3vxyB2tWDyZrLzLI1jxEvunGodoIzUOtum0Ce gPynnfQCelXBja0BwLXJMplM6TY1wXX22ap0ZViC0m714U5U4LQpzjabtFtjT8qOUR6L7hfy YQ72PBuktGb00UR/N5UrR6GqB0x4W41aZBHXfUQnvWIMmmCrRUJX36hOTYBzh+x86ULgg7H2 1499tA4o6rvE13FiGccplBNWCAIroAe/G11rdoN5NBgYVXu++38gTa/MBmIt6zRi6ch15oLA Ln2vHOdqhrgDuxjhMpG2bpNE36DG/V9WWyWdIRlz3NYPCDM/S3anbHlhjStXHOz1uHOnerXM 1jEjcsvmj1vSyYoQMyRcRJmBZLrekvgZeh7nJzbPHxtth8M7AoqiZ/o/BpYU+0xZ+J5/szWZ aYxxmIRu5ejFf+Wn9s5eXNHmyqxBidpCWvcbKYDBnkw2+Y9E5YTpL0mS0dCCOlrO7gca27ux ybtbj84aaW1g0CfIlUnOtHgMCmz6zPXThb+A8H8j3O6qmPoVqT3qnq3Uhy6GOoH8Fdu2Vchh TWiF5yo+pvUagQP6LpslffufSnu+RKAagkj7/RSuZV25Ag0EV9ZMvgEQAKc0Db17xNqtSwEv mfp4tkddwW9XA0tWWKtY4KUdd/jijYqc3fDD54ESYpV8QWj0xK4YM0dLxnDU2IYxjEshSB1T qAatVWz9WtBYvzalsyTqMKP3w34FciuL7orXP4AibPtrHuIXWQOBECcVZTTOdZYGAzaYzxiA ONzF9eTiwIqe9/oaOjTwTLnOarHt16QApTYQSnxDUQljeNvKYt1lZE/gAUUxNLWsYyTT+22/ vU0GDUahsJxs1+f1yEr+OGrFiEAmqrzpF0lCS3f/3HVTU6rS9cK3glVUeaTF4+1SK5ZNO35p iVQCwphmxa+dwTG/DvvHYCtgOZorTJ+OHfvCnSVjsM4kcXGjJPy3JZmUtyL9UxEbYlrffGPQ I3gLXIGD5AN5XdAXFCjjaID/KR1c9RHd7Oaw0Pdcq9UtMLgM1vdX8RlDuMGPrj5sQrRVbgYH fVU/TQCk1C9KhzOwg4Ap2T3tE1umY/DqrXQgsgH71PXFucVjOyHMYXXugLT8YQ0gcBPHy9mZ qw5mgOI5lCl6d4uCcUT0l/OEtPG/rA1lxz8ctdFBVOQOxCvwRG2QCgcJ/UTn5vlivul+cThi 6ERPvjqjblLncQtRg8izj2qgmwQkvfj+h7Ex88bI8iWtu5+I3K3LmNz/UxHBSWEmUnkg4fJl Rr7oItHsZ0ia6wWQ8lQnABEBAAGJAjwEGAEKACYCGwwWIQTSNlOHQi7ChOOZgcf5M+tD3xNh HwUCXWZ5wAUJB3FgggAKCRD5M+tD3xNhH2O+D/9OEz62YuJQLuIuOfL67eFTIB5/1+0j8Tsu o2psca1PUQ61SZJZOMl6VwNxpdvEaolVdrpnSxUF31kPEvR0Igy8HysQ11pj8AcgH0a9FrvU /8k2Roccd2ZIdpNLkirGFZR7LtRw41Kt1Jg+lafI0efkiHKMT/6D/P1EUp1RxOBNtWGV2hrd 0Yg9ds+VMphHHU69fDH02SwgpvXwG8Qm14Zi5WQ66R4CtTkHuYtA63sS17vMl8fDuTCtvfPF HzvdJLIhDYN3Mm1oMjKLlq4PUdYh68Fiwm+boJoBUFGuregJFlO3hM7uHBDhSEnXQr5mqpPM 6R/7Q5BjAxrwVBisH0yQGjsWlnysRWNfExAE2sRePSl0or9q19ddkRYltl6X4FDUXy2DTXa9 a+Fw4e1EvmcF3PjmTYs9IE3Vc64CRQXkhujcN4ZZh5lvOpU8WgyDxFq7bavFnSS6kx7Tk29/ wNJBp+cf9qsQxLbqhW5kfORuZGecus0TLcmpZEFKKjTJBK9gELRBB/zoN3j41hlEl7uTUXTI JQFLhpsFlEdKLujyvT/aCwP3XWT+B2uZDKrMAElF6ltpTxI53JYi22WO7NH7MR16Fhi4R6vh FHNBOkiAhUpoXRZXaCR6+X4qwA8CwHGqHRBfYFSU/Ulq1ZLR+S3hNj2mbnSx0lBs1eEqe2vh cA== Message-ID: <19a86d69-9bcc-42c9-b000-98b3860de42f@intel.com> Date: Thu, 23 Jan 2020 14:48:10 +0000 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Subject: Re: [dpdk-dev] [dpdk-stable] [PATCH 2/2] app/testpmd: fix invalid port detaching X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" On 1/23/2020 2:05 PM, Matan Azrad wrote: > Hi > > From: Yigit, Ferruh >> On 11/12/2019 8:47 AM, Matan Azrad wrote: >>> The port was not validated before detaching. >>> >>> Ignore port detach operation when the port is not valid. >>> >>> Fixes: f8e5baa2662d ("app/testpmd: check not detaching device twice") >>> Cc: thomas@monjalon.net >>> Cc: stable@dpdk.org >>> >>> Signed-off-by: Matan Azrad >>> --- >>> app/test-pmd/testpmd.c | 3 +++ >>> 1 file changed, 3 insertions(+) >>> >>> diff --git a/app/test-pmd/testpmd.c b/app/test-pmd/testpmd.c index >>> 4444346..370eefe 100644 >>> --- a/app/test-pmd/testpmd.c >>> +++ b/app/test-pmd/testpmd.c >>> @@ -2545,6 +2545,9 @@ struct extmem_param { >>> >>> printf("Removing a device...\n"); >>> >>> + if (port_id_is_invalid(port_id, ENABLED_WARN)) >>> + return; >>> + >>> dev = rte_eth_devices[port_id].device; >>> if (dev == NULL) { >>> printf("Device already removed\n"); >>> >> >> The patch is already in 19.11 [1] but it is breaking the testpmd hotplug >> support. >> Before 'detach_port_device()' called, the port has been stopped and closed >> [2], which will make port fail from 'port_id_is_invalid()' check and the device >> removal path never fully called. >> The implication is, since device not detached, vfio request interrupt keeps >> triggered continuously and re-starts the detach path, but because of the half >> cleaned device it fails and app gets stuck with a continuous log [3]. >> >> I wonder if the actual hotplug has been tested with this patch, the commit >> log is not clear about the motivation and implication of the patch, I am not >> clear why this check is added but I am sending a patch soon to remove it >> back. > > The motivation of this patch was to prevent double detach on same port, so the user cannot call detach of invalid port. What is the definition of the 'invalid port', if you mean device already detached case, in the second call of the function "if (dev == NULL)" check should prevent it going forward. But according the 'port_id_is_invalid()' API, a closed port is an invalid port, I think that is wrong in this context. > > I agree this patch is not good and we need a fix but I think the bug is conceptual. > > Testpmd tries to do detach by port_id which is derived by ethdev port id while detach work with rte_device. > > For example: > you can see in the line above after +++: dev = rte_eth_devices[port_id].device, > Testpmd may access invalid or reallocated ethdev structure to get the device name and may even detach unwanted rte_device. I thinks whichever function calling 'detach_port_device()' should check the port validity. 'detach_port_device()' doesn't know if port reallocated or not, it will free the given port_id, and when freeing done 'rte_eth_devices[port_id].device' will be NULL, this looks to me a valid check. The caller of the 'detach_port_device()' should ensure correct port_id passed to the function. > > So, detach is broken with and without this patch. I can't see how it is broken without the check, how the problem you mentioned can be reproduced? Or is it a theoretical issue? But with this check hotplug support is %100 reproducible broken. > > > I think Testpmd should change the concept of rte_device mapping and put attention to next: > 1. Don't detach by ethdev port ID. > 2. Multiple ethdev port IDs may related to the same rte_device. > > The Testpmd user should be sure that all the port IDs of the rte_device are released before the detach call and Testpmd maybe need to validate it. > And like attach, detach should be triggered by PCI address \ rte_device name. > We need to know about port_id too to be able to stop/close it. And sure no objection to improve the hotplug support but it is broken now, lets fix it first. > > > > > > > > > > > > > > > > > > > > > > > > > >> Regards, >> ferruh >> >> >> [1] >> https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.dp >> dk.org%2Fdpdk%2Fcommit%2F%3Fid%3D43d0e304980a1527bcac92dc679057 >> b189e2545a&data=02%7C01%7Cmatan%40mellanox.com%7Cc3f40356d >> d124e20faf708d7a006e68c%7Ca652971c7d2e4d9ba6a4d149256f461b%7C0%7 >> C0%7C637153823809699996&sdata=dBy9m%2BxCA%2Bme1IpX2LqPARa >> 62giznKi8Xbtu220GA%2Bg%3D&reserved=0 >> >> [2] >> rmv_port_callback >> stop_port(port_id); >> close_port(port_id); >> detach_port_device(port_id); >> >> [3] >> EAL: can not get port by device 0000:00:05.0! >> EAL: can not get port by device 0000:00:05.0! >> EAL: can not get port by device 0000:00:05.0! >> EAL: can not get port by device 0000:00:05.0! >> EAL: can not get port by device 0000:00:05.0! >> EAL: can not get port by device 0000:00:05.0! >> ...