From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga02.intel.com (mga02.intel.com [134.134.136.20]) by dpdk.org (Postfix) with ESMTP id B8F9B4A6E for ; Wed, 12 Nov 2014 15:29:45 +0100 (CET) Received: from orsmga002.jf.intel.com ([10.7.209.21]) by orsmga101.jf.intel.com with ESMTP; 12 Nov 2014 06:39:39 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.07,369,1413270000"; d="scan'208";a="635729651" Received: from irsmsx102.ger.corp.intel.com ([163.33.3.155]) by orsmga002.jf.intel.com with ESMTP; 12 Nov 2014 06:39:37 -0800 Received: from irsmsx105.ger.corp.intel.com ([169.254.7.56]) by IRSMSX102.ger.corp.intel.com ([169.254.2.25]) with mapi id 14.03.0195.001; Wed, 12 Nov 2014 14:39:28 +0000 From: "Ananyev, Konstantin" To: Olivier MATZ , Yong Wang , "Liu, Jijiang" Thread-Topic: [dpdk-dev] [PATCH v8 10/10] app/testpmd:test VxLAN Tx checksum offload Thread-Index: AQHP+igwSkdCwquBBkeqJNewvNm7F5xVaLcAgARPPrCAAFHjgIACB2EwgADtNoCAABGiEA== Date: Wed, 12 Nov 2014 14:39:28 +0000 Message-ID: <2601191342CEEE43887BDE71AB977258213A40EB@IRSMSX105.ger.corp.intel.com> References: <1414376006-31402-1-git-send-email-jijiang.liu@intel.com> <1414376006-31402-11-git-send-email-jijiang.liu@intel.com> <54588BF7.309@6wind.com> <1ED644BD7E0A5F4091CF203DAFB8E4CC01D8510E@SHSMSX101.ccr.corp.intel.com>, <5459FBB2.1040408@6wind.com> <0c654d2c0d304b45a40af6ca38b70adf@EX13-MBX-026.vmware.com> <545CFE56.60605@6wind.com> <2601191342CEEE43887BDE71AB977258213A38D2@IRSMSX105.ger.corp.intel.com> <5460E07F.6060303@6wind.com> <2601191342CEEE43887BDE71AB977258213A3F5F@IRSMSX105.ger.corp.intel.com> <54635B2B.5040603@6wind.com> In-Reply-To: <54635B2B.5040603@6wind.com> Accept-Language: en-IE, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [163.33.239.181] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Cc: "dev@dpdk.org" Subject: Re: [dpdk-dev] [PATCH v8 10/10] app/testpmd:test VxLAN Tx checksum offload X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 12 Nov 2014 14:29:47 -0000 > -----Original Message----- > From: Olivier MATZ [mailto:olivier.matz@6wind.com] > Sent: Wednesday, November 12, 2014 1:06 PM > To: Ananyev, Konstantin; Yong Wang; Liu, Jijiang > Cc: dev@dpdk.org; Thomas Monjalon > Subject: Re: [dpdk-dev] [PATCH v8 10/10] app/testpmd:test VxLAN Tx checks= um offload >=20 > Hi Konstantin, >=20 > On 11/12/2014 10:55 AM, Ananyev, Konstantin wrote: > >> From an API perspective, it looks a bit more complex to have to call > >> dev_prep_tx() before sending the packets if they have been flagged > >> for offload processing. But I admit I have no other argument. I'll be > >> happy to have more comments from other people on the list. > >> > >> I'm sending a first version of the patchset now as it's ready, it does > >> not take in account this comment, but I'm open to add it in a v2 if > >> there is a consensus on this. > >> > >> Now, knowing that: > >> - adding dev_prep_tx() will also concern hw checksum (TCP L4 checksum > >> already requires to set the TCP pseudo header checksum), so adding > >> this will change the API of an existing feature > >> - TSO is a new feature expected for 1.8 (which should be out soon) > >> > >> Do you think we need to include this for 1.8 or can we postpone your > >> proposition for after the 1.8 release? > > > > I'd say it would be good to have it done together with TSO feature. > > About changing API: I think existing applications shouldn't be affected= . > > For existing PMDs/TX offloads we don't change any rules what need to b= e filled by the app. > > We just add a new function that can do that for user. > > If the app fills required manually (as all apps have to do now) it woul= d keep working as expected. >=20 > I agree, this proposition could work without changing the current > applications. >=20 > > If you feel like it is too much work for 1.8 timeframe - > > can we at least move fix_tcp_phdr_cksum() out of TX PMD as a temporary = measure? > > Let say create a function get_ipv4_udptcp_checksum(struct rte_mbuf *m)= (in librte_net ?). > > It will calculate PSD checksum for both TSO and non-TSO case based on g= iven mbuf flags/fields. > > Then we can update testpmd/csumonly.c to use it. >=20 > I'm not sure having get_ipv4_udptcp_checksum() in librte_net would > help. The value we have to set in the TCP checksum field depends on the > PMD (altought only ixgbe is supported now). So, it would require > another parameter and a new PMD eth_ops... which looks very > similar to dev_prep_tx() (except that dev_prep_tx() can be bulked). > I think a stack will not be able to call get_udptcp_checksum(m ,port) > because it does not know the physical port at the time the packet is > built. Moreover, calling a function through a pointer is more efficient > when bulked. So I think the dev_prep_tx() you initially describe is > a better answer to the problem. Yes I understand that it might not be applicable for non-Intel NICs. Though I thought it is ok as a temporary measure as right now we support these offloads for Intel NICs only. Basically my thought was what you proposed as option 3 below. Why common function in librte_net? So people don't need to write their own each time. Plus as I remember all 3 Intel NIC types (ixgbe/igb/i40e) we support have s= imilar=20 requirements for what need to be set/calculated for these TX offloads. So my thought was that having a common function might help to avoid code du= plication in future, If/when will implement dev_prep_tx().=20 >=20 > I don't know what is the exact timeframe for 1.8, maybe Thomas can help > on this? Depending on it, we have several options: >=20 > - implement dev_prep_tx() for 1.8 in the TSO series: this implies that > the community agrees on this new API. We need to check that it will > be faster in a pipeline model (I think this is obvious) but also that > it does not penalize the run-to-completion model: introducing another > function dev_prep_tx() can result in duplicated tests in the driver > (ex: test the offload flag values). >=20 > - postpone dev_prep_tx() or similar to next version and push the current > TSO patchset (including the comments done on the list). It does not > modify the current offload API, it provides the TSO feature on ixgbe > based on a similar API concept (set the TCP phdr cksum). The drawback > is a potential performance loss when using a pipeline model. >=20 > - another option that you may prefer is to bind the API behavior to > ixgbe (for 1.8): we can ask the application to set the pseudo-header > checksum without the IP len when doing TSO, as required by the ixgbe > driver. Then, for next release, we can think about dev_prep_tx(). The > drawback of this solution is that we may go back on this choice if the > dev_prep_tx() approach is not validated by the community. My vote would be for option 3 then. Thanks Konstantin >=20 >=20 > Regards, > Olivier