From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga04.intel.com (mga04.intel.com [192.55.52.120]) by dpdk.org (Postfix) with ESMTP id EC4EE2BC9 for ; Tue, 8 Aug 2017 19:24:46 +0200 (CEST) Received: from orsmga003.jf.intel.com ([10.7.209.27]) by fmsmga104.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 08 Aug 2017 10:24:31 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.41,344,1498546800"; d="scan'208,217";a="1001408118" Received: from fmsmsx107.amr.corp.intel.com ([10.18.124.205]) by orsmga003.jf.intel.com with ESMTP; 08 Aug 2017 10:24:28 -0700 Received: from fmsmsx125.amr.corp.intel.com (10.18.125.40) by fmsmsx107.amr.corp.intel.com (10.18.124.205) with Microsoft SMTP Server (TLS) id 14.3.319.2; Tue, 8 Aug 2017 10:23:57 -0700 Received: from fmsmsx113.amr.corp.intel.com ([169.254.13.214]) by FMSMSX125.amr.corp.intel.com ([169.254.2.32]) with mapi id 14.03.0319.002; Tue, 8 Aug 2017 10:23:57 -0700 From: "Wiles, Keith" To: "Yigit, Ferruh" CC: Thomas Monjalon , "dev@dpdk.org" , Stephen Hemminger , "Richardson, Bruce" , "Chilikin, Andrey" , "Ananyev, Konstantin" , "Wu, Jingjing" Thread-Topic: [dpdk-dev] [RFC] ethdev: add ioctl-like API to control device specific features Thread-Index: AQHTDRj4kO/62365mUS+R0MtbMBkT6J7MVuA Date: Tue, 8 Aug 2017 17:23:56 +0000 Message-ID: <8EE79C6D-7778-4041-88DF-9329A008B374@intel.com> References: <20170803132138.GA8732@bricha3-MOBL3.ger.corp.intel.com> <20170803091531.5902f86c@xeon-e3> <1581480.IJArXVfUmc@xps> <12c296b7-0696-6c12-7fb2-e43bcd49d784@intel.com> In-Reply-To: <12c296b7-0696-6c12-7fb2-e43bcd49d784@intel.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.255.230.19] MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.15 Subject: Re: [dpdk-dev] [RFC] ethdev: add ioctl-like API to control device specific features X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 08 Aug 2017 17:24:52 -0000 On Aug 4, 2017, at 6:58 AM, Ferruh Yigit > wrote: On 8/3/2017 8:53 PM, Thomas Monjalon wrote: 03/08/2017 18:15, Stephen Hemminger: On Thu, 3 Aug 2017 14:21:38 +0100 Bruce Richardson > wrote: On Thu, Aug 03, 2017 at 01:21:35PM +0100, Chilikin, Andrey wrote: To control some device-specific features public device-specific functions rte_pmd_*.h are used. But this solution requires applications to distinguish devices at runtime and, depending on the device type, call corresponding device-specific functions even if functions' parameters are the same. IOCTL-like API can be added to ethdev instead of public device-specific functions to address the following: * allow more usable support of features across a range of NIC from one vendor, but not others * allow features to be implemented by multiple NIC drivers without relying on a critical mass to get the functionality in ethdev * there are a large number of possible device specific functions, and creating individual APIs for each one is not a good solution * IOCTLs are a proven method for solving this problem in other areas, i.e. OS kernels. Control requests for this API will be globally defined at ethdev level, so an application will use single API call to control different devices from one/multiple vendors. API call may look like as a classic ioctl with an extra parameter for argument length for better sanity checks: int rte_eth_dev_ioctl(uint16_t port, uint64_t ctl, void *argp, unsigned arg_length); Regards, Andrey I think we need to start putting in IOCTLs for ethdevs, much as I hate to admit it, since I dislike IOCTLs and other functions with opaque arguments! Having driver specific functions I don't think will scale well as each vendor tries to expose as much of their driver specific functionality as possible. One other additional example: I discovered just this week another issue with driver specific functions and testpmd, when I was working on the meson build rework. * With shared libraries, when we do "ninja install" we want our DPDK libs moved to e.g. /usr/local/lib, but the drivers moved to a separate driver folder, so that they can be automatically loaded from that single location by DPDK apps [=3D=3D CONFIG_RTE_EAL_PMD_PATH]. * However, testpmd, as well as using the drivers as plugins, uses driver-specific functions, which means that it explicitly links against the pmd .so files. * Those driver .so files are not in with the other libraries, so ld.so does not find the pmd, and the installed testpmd fails to run due to missing library dependencies. * The workaround is to add the drivers path to the ld load path, but we should not require ld library path changes just to get DPDK apps to work. Using ioctls instead of driver-specific functions would solve this. My 2c. My 2c. No. Short answer: Ioctl's were a bad idea in Unix (per Dennis Ritchie et al) and are now despised by Linux kernel developers. They provide an unstructured, unsecure= d, back door for device driver abuse. Try to get a new driver in Linux with a unique ioctl, and it will be hard to get accepted. Long answer: So far every device specific feature has fit into ethdev model. Doing ioctl is admitting "it is too hard to be general, we need need an out". For somet= hing that is a flag, it should fit into existing config model; ignoring silly AB= I constraints. For a real feature (think flow direction), we want a first class API for th= at. For a wart, then devargs will do. Give a good example of something that should be an ioctl. Don't build the API first and then let it get cluttered. I agree with Stephen. And please do not forget that ioctl still requires an API: the argument that you put in ioctl is the API of the feature. So it is the same thing as defining a new function. I am also not fan of the ioctl usage. I believe it hides APIs behind ids and prevent argument check by compiler. BUT, the number of the increasing PMD specific APIs are also worrying, it is becoming harder to maintain, and I believe this is something NOT sustainable in long run. What about having *eth_dev_extended_ops* ? We had talk about adding something like device specific APIs to DPDK in the= past, which to me are just IOCTL like APIs. The big problem with IOCTLs is= trying to cram a bunch of specific requests into a very generic API and I = do not like ioctl as defined in Linux/Unix today. The old IOCTLs calls are = too opaque and difficult for compilers to test args and many other issues. We talked about having a single API in rte_eth_dev that would allow a user = to ask for and possible get a list of function pointers in a given structur= e for the requested type. If a user calls this API to get some feature from= a given NIC he would get NULL or a pointer to a set of functions. The gene= ric API in rte_eth would allow the user to request what structures or types= of APIs it supports. Using a specific API to get the list of APIs or supported features in a NIC= , will allow the developer to request the set of APIs (in an array or some = method). Then we have real APIs for specific control or requests and not a = generic API like ioctl. Cristian had suggested an API like this to make it easy to add any IOCTL li= ke needs to a driver. We can define a set of structures that seem generic f= or some IOCTL like needs or just allow the NIC to define his own structures= and APIs. Allowing the developer to define his own structures and APIs is = not very generic or usable by the users, so I would lean toward defining st= ructures set need today and expand those structures in the future or add mo= re structures. int rte_eth_dev_something(uint16_t port_id, const char *feature, void **obj= ); Using strings we can define or the NIC vendor can define to ask for a point= er to a structure he knows via a driver header. Strings are good, because w= e can read them via the debug or print them out quickly instead of trying t= o use some lookup table. Plus we can have any length or characters for defi= ning the structure request. Just off the top of my head, but it can be changed if needed. As a part of the rte_eth_dev. This can be in the librte_ether library but in a separated file. And the APIs for these ops can be less strict on compatibility, and easier to add. Benefits of having this new dev_ops: * Having an abstraction layer for common checks. * Even feature is not generic for all NICs, still a few NICs can share the ops. * All APIs are in the same file makes it easy to see PMD specific APIs comparing to scattered into various PMDs. * This is very like ioctl approach, but APIs are more clear and arguments can be verified. Thanks, ferruh The real debate is to decide if we want to continue adding more control path features in DPDK or focus on Rx/Tx. But this discussion would be better lead with some examples/requests. Regards, Keith