From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dispatch1-us1.ppe-hosted.com (dispatch1-us1.ppe-hosted.com [67.231.154.164]) by dpdk.org (Postfix) with ESMTP id 7227E2082 for ; Tue, 6 Nov 2018 08:41:09 +0100 (CET) X-Virus-Scanned: Proofpoint Essentials engine Received: from webmail.solarflare.com (uk.solarflare.com [193.34.186.16]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by mx1-us1.ppe-hosted.com (Proofpoint Essentials ESMTP Server) with ESMTPS id C8800780059; Tue, 6 Nov 2018 07:41:07 +0000 (UTC) Received: from [192.168.38.17] (91.220.146.112) by ukex01.SolarFlarecom.com (10.17.10.4) with Microsoft SMTP Server (TLS) id 15.0.1395.4; Tue, 6 Nov 2018 07:41:02 +0000 To: Thomas Monjalon , CC: Ferruh Yigit , "Lu, Wenzhuo" References: <1531373220-42150-1-git-send-email-wenzhuo.lu@intel.com> <4821031.obJuSW6AGg@xps> From: Andrew Rybchenko Message-ID: <1d2777f1-87b9-79f5-2e29-db6e53690f55@solarflare.com> Date: Tue, 6 Nov 2018 10:40:51 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.2.1 MIME-Version: 1.0 In-Reply-To: <4821031.obJuSW6AGg@xps> Content-Language: en-GB X-Originating-IP: [91.220.146.112] X-ClientProxiedBy: ocex03.SolarFlarecom.com (10.20.40.36) To ukex01.SolarFlarecom.com (10.17.10.4) X-TM-AS-Product-Ver: SMEX-12.5.0.1300-8.5.1010-24204.003 X-TM-AS-Result: No-23.962700-8.000000-10 X-TMASE-MatchedRID: yebcs53SkkAOwH4pD14DsPHkpkyUphL9Ud7Bjfo+5jSYt7LeY+Mn+Voq G7y2r1Fi5qduK012SP7ih/b1HlnM9aN0uxYINvQ12os3ueKAsFT4h+uI7dxXxE5j0OsMj9E6Tba f0ad0nbbYHOC/v3JS19U1netJunBfUlzBXm/6ZUVmVHNo7XGknV7OZ6hrwwnzc8FZmOUzKzbmxl PBgd8EzuysRepJ78jJUVdAWv33ZDzK4Jk0iIjDZ6OuVibdZNTvLAnNohUyMa3ceXQ6q2ggSsAfI MzUPnhOjRMwRxWMc3DAX32r0ss3R4UJf3YQjB6CWZbr7hxHnYRUENBIMyKD0doAm89jnq3+166X b3/Hw4O7KT8e5NWWCj+Lkb7uTiTjiqppOxi9o1aQgDXBbWe8E9tb21l1J0jcaKyfWaPPzZ9LyiF Ck+j72fbYZPZ20YJlqS2Ud07XxYVb8Ol7R+ysiriMC5wdwKqdkos2tunL8DTDTXM3VzSaIvGeRO xbU0T2Th5zZ/l8EAJNBs14IbDJJMKf5jPUBmqtsyw+ZJnFumQRWG8HyxfLI/Ypl5XkgVFjlmhcv 9mmQ6T5Dyt2Bypj4XXgs3lqLV2+cZI30b0PGWdjoaO27r+3fTpA2zZYJjv1o1zutTr+ALiuWag1 zapCiQQox+W9miHX+u80QHPJwZq1JENQO/Nrk+7KTDtx8CggY09QyLJhpOmdCgDKzwyxaw1U4SK H6t6vSScJaBmlxjDhhLoXIby/KPJRA2JqAZKz2x/FmlC/aoyb/LTS0T1K1r/DPenuktW9LFlXZd 2HA7Dt80qCYQSwTWqWebqPCncPOHOT3z1mU7aeAiCmPx4NwFkMvWAuahr8ooPRqITj5zhjYGxdK EQ2dE1a4qz+ju4/VdMV8LYj4dzk57ZTP8gZb41pVAT4zg3wgjAjCNqaR8kYUJZLVxIJ5cwzMGqh bx1rOS/FKQhdpYlGZVW2FrbLgdy89tVOrDil X-TM-AS-User-Approved-Sender: Yes X-TM-AS-User-Blocked-Sender: No X-TMASE-Result: 10--23.962700-8.000000 X-TMASE-Version: SMEX-12.5.0.1300-8.5.1010-24204.003 X-MDID: 1541490068-bGxrWfxePkQb Content-Type: text/plain; charset="utf-8"; format=flowed Content-Transfer-Encoding: 8bit X-Content-Filtered-By: Mailman/MimeDel 2.1.15 Subject: Re: [dpdk-dev] [PATCH v2] ethdev: fix device info getting X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 06 Nov 2018 07:41:09 -0000 On 10/22/18 3:13 PM, Thomas Monjalon wrote: > 22/10/2018 14:01, Ferruh Yigit: >> On 8/23/2018 9:58 AM, Andrew Rybchenko wrote: >>> On 22.08.2018 19:55, Ferruh Yigit wrote: >>>> On 8/14/2018 1:57 AM, Lu, Wenzhuo wrote: >>>>> Hi Andrew, >>>>> >>>>>> -----Original Message----- >>>>>> From: Andrew Rybchenko [mailto:arybchenko@solarflare.com] >>>>>> Sent: Monday, August 13, 2018 4:39 PM >>>>>> To: Lu, Wenzhuo ; Thomas Monjalon >>>>>> ; Yigit, Ferruh >>>>>> Cc: dev@dpdk.org >>>>>> Subject: Re: [dpdk-dev] [PATCH v2] ethdev: fix device info getting >>>>>> >>>>>> On 13.08.2018 05:50, Lu, Wenzhuo wrote: >>>>>>> Hi Thomas, >>>>>>> >>>>>>> >>>>>>>> -----Original Message----- >>>>>>>> From: Thomas Monjalon [mailto:thomas@monjalon.net] >>>>>>>> Sent: Wednesday, August 1, 2018 11:37 PM >>>>>>>> To: Lu, Wenzhuo ; Andrew Rybchenko >>>>>>>> ; Yigit, Ferruh >>>>>>>> Cc: dev@dpdk.org >>>>>>>> Subject: Re: [dpdk-dev] [PATCH v2] ethdev: fix device info getting >>>>>>>> >>>>>>>> 16/07/2018 03:58, Lu, Wenzhuo: >>>>>>>>> Hi Andrew, >>>>>>>>> >>>>>>>>>> -----Original Message----- >>>>>>>>>> From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Lu, Wenzhuo >>>>>>>>>> Sent: Monday, July 16, 2018 9:08 AM >>>>>>>>>> To: Andrew Rybchenko ; dev@dpdk.org >>>>>>>>>> Cc: Yigit, Ferruh ; Thomas Monjalon >>>>>>>>>> >>>>>>>>>> Subject: Re: [dpdk-dev] [PATCH v2] ethdev: fix device info getting >>>>>>>>>> >>>>>>>>>> Hi Andrew, >>>>>>>>>> >>>>>>>>>>> -----Original Message----- >>>>>>>>>>> From: Andrew Rybchenko [mailto:arybchenko@solarflare.com] >>>>>>>>>>> Sent: Friday, July 13, 2018 4:03 PM >>>>>>>>>>> To: Lu, Wenzhuo ; dev@dpdk.org >>>>>>>>>>> Cc: Yigit, Ferruh ; Thomas Monjalon >>>>>>>>>>> >>>>>>>>>>> Subject: Re: [dpdk-dev] [PATCH v2] ethdev: fix device info getting >>>>>>>>>>> >>>>>>>>>>> Hi, Wenzhuo, >>>>>>>>>>> >>>>>>>>>>> I'm sorry, but I have more even harder questions than the previous >>>>>> one. >>>>>>>>>>> This questions are rather generic and mainly to ethdev maintainers. >>>>>>>>>>> >>>>>>>>>>> On 13.07.2018 05:42, Wenzhuo Lu wrote: >>>>>>>>>>>> The device information cannot be gotten correctly before the >>>>>>>>>>>> configuration is set. Because on some NICs the information has >>>>>>>>>>>> dependence on the configuration. >>>>>>>>>>> Thinking about it I have the following question. Is it valid >>>>>>>>>>> behaviour of the dev_info if it changes after configuration? >>>>>>>>>>> I always thought that the primary goal of the dev_info is to >>>>>>>>>>> provide information to app about device capabilities to allow app >>>>>>>>>>> configure device and queues correctly. Now we see the case when >>>>>>>>>>> dev_info changes on configure. May be it is acceptable, but it is >>>>>>>>>>> really suspicious. If we accept it, it should be documented. >>>>>>>>>>> May be dev_info should be split into parts: part which is >>>>>>>>>>> persistent and part which may depend on device configuration. >>>>>>>>>> As I remember, the similar discussion has happened :) I've raised >>>>>>>>>> the similar suggestion like this. But we don’t make it happen. >>>>>>>>>> The reason is, you see, this is the rte layer's behavior. So the >>>>>>>>>> user doesn't have to know it. From APP's PoV, it inputs the >>>>>>>>>> configuration, it calls this API "rte_eth_dev_configure". It >>>>>>>>>> doesn't know the configuration is copied before getting the info or not. >>>>>>>>>> So, to my opinion, we can still keep the behavior. We only need to >>>>>>>>>> split it into parts when we do see the case that cannot make it. >>>>>>>>> Maybe I talked too much about the patch. Think about it again. Your >>>>>>>>> comments is about how to use the APIs, rte_eth_dev_info_get, >>>>>>>> rte_eth_dev_configure. To my opinion, rte_eth_dev_info_get is just to >>>>>>>> get the info. It can be called anywhere, before configuration or >>>>>>>> after. It's reasonable the info changes with the configuration changing. >>>>>>>>> But we do have something missing, like, rte_eth_dev_capability_get >>>>>>>>> which >>>>>>>> should be stable. APP can use this API to get the necessary info >>>>>>>> before configuration. >>>>>>>>> A question, maybe a little divergent thinking, that APP should have >>>>>>>>> some >>>>>>>> intelligence to handle the capability automatically. So getting the >>>>>>>> capability is not so good and effective, looks like we still need the human >>>>>> involvement. >>>>>>>> Maybe that the reason currently we suppose APP know the capability >>>>>>>> from the paper copies, examples... >>>>>>>> >>>>>>>> I am not sure to understand all the sentences. >>>>>>>> But I agree that we should take a decision about the stability of these >>>>>> infos. >>>>>>>> Either infos cannot change after probing, or we must document that >>>>>>>> the app must request infos regularly (when?). >>>>>>> Sorry, I missed this mail. >>>>>>> >>>>>>> I have the concern that different NICs have different behavior. One info >>>>>> can be stable on a NIC but dynamic on another. Considering this, we may >>>>>> better not splitting the rte_eth_dev_info_get to 2 APIs. And comparing with >>>>>> handling this in rte layer, maybe we can let every NIC has its own decision. >>>>>>> I have an idea. Maybe we can add a parameter for potential dynamic >>>>>>> fields. Like, Changing uint16_t nb_rx_queues; to struct nb_rx_queues { >>>>>>> uint16_t value; bool stable; } >>>>>> May be it is just very bad example, but as I understand nb_rx_queues is >>>>>> mainly required to configure the device properly. Or should app configure, >>>>>> get new value, reconfigure again, get new value and so on and stop when >>>>>> previous is equal to the new one. Yes, I dramatise and it sounds really bad. >>>>>> In any case it would over-complicate interface and no single app will do it >>>>>> correctly. >>>>> I think you're talking about max_rx_queues. APP can get that info before configuration. Then configure rx queue number which is not larger than it. That's enough. >>>>> nb_rx_queues should be the number which is configured by APP and how many queues are actually used. To my opinion, it's mainly used by the GUI to show the value to human being. >>>>> >>>>> BTW, max_rx_queues could be an good example that shows that some parameters are stable on some NICs but not on other NICs. >>>>> Take Intel NICs for example (I don’t familiar with others.), normally max_rx_queues is stable on PF. But on VF, as the max number is decided by PF, it could be dynamic. When VF starts, it can get an default value from PF. If it not enough, it can request a larger one from PF. If the number works, VF can get a new number. >>>> "struct rte_eth_dev_info" is a little overloaded, it has: >>>> - static info, like *device >>>> - device limitations, max_*, *_lim >>>> - device capabilities, *_capa >>>> - suggested configurations, default_*conf >>>> - device configuration, nb_[r/t]x_queues >>>> - other, switch_info >>>> >>>> There is a concern that some values are dynamic, but this is not new, for >>>> example nb_rx/tx_queues can be changed by rte_eth_dev_rx/tx_queue_config() API >>>> and rte_eth_dev_info() output will be changed. >>> The example looks different to me. It is explicit changes directly >>> requested by the application. So, it is not a surprise that it changes. >>> >>>> For this patch suggested configuration changes based on some other config values >>>> looks ok as concept. >>>> So I think we can say after every configuration related API dev info can be >>>> changed. >>> I think that saying that any configuration changes may result in any >>> changes in dev_info is hardly helpful. I'd suggest to be more specific. >>> Yes, it is harder and will have bugs, but at least it is helpful. >> Hi Andrew, Wenzhuo, >> >> Back to this patch, which fixes an actual defect, >> >> What do you think about: >> 1- Keep existing patch but extend it as, save the original "dev->data" and >> revert it back to this original data on all error path. I guess you mean dev->data->dev_conf here. If so, OK. >> 2- Update rte_eth_dev_info() API document and say default configuration can be >> changed based on other config fields. So this reduces the scope of things can >> change in dev_info. Yes, it looks like many items in rte_eth_conf may change defaults (Rx/Tx modes, offloads, loopback mode, intr_conf). However, it should be highlighted that it is device configuration (struct rte_eth_conf), not per-queue configuration. > I think we are doing too much juggling with data in ethdev layer. > All these things should be the responsibility of the PMD. > My radical proposal would be to remove rte_eth_dev_info and integrate > all the data into rte_eth_dev_data. I think it is a bad idea. It is too error-prone since it will require from PMD to spread the logic and update rte_eth_dev_data in each place which should change it. I think existing functional interface is the right approach here. However, we cannot say that dev_info may change after any configuration action - it could make application impossible or very hard to configure the device properly (get dev_info, find the best settings, try to configure starting from the most important, rollback in the case of failure and retry other ways).