From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga11.intel.com (mga11.intel.com [192.55.52.93]) by dpdk.org (Postfix) with ESMTP id 605661B1A6 for ; Wed, 10 Jan 2018 08:15:01 +0100 (CET) X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga001.jf.intel.com ([10.7.209.18]) by fmsmga102.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 09 Jan 2018 23:15:00 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.46,338,1511856000"; d="scan'208";a="22833426" Received: from fmsmsx107.amr.corp.intel.com ([10.18.124.205]) by orsmga001.jf.intel.com with ESMTP; 09 Jan 2018 23:14:59 -0800 Received: from FMSMSX110.amr.corp.intel.com (10.18.116.10) by fmsmsx107.amr.corp.intel.com (10.18.124.205) with Microsoft SMTP Server (TLS) id 14.3.319.2; Tue, 9 Jan 2018 23:14:59 -0800 Received: from shsmsx102.ccr.corp.intel.com (10.239.4.154) by fmsmsx110.amr.corp.intel.com (10.18.116.10) with Microsoft SMTP Server (TLS) id 14.3.319.2; Tue, 9 Jan 2018 23:14:59 -0800 Received: from shsmsx103.ccr.corp.intel.com ([169.254.4.213]) by shsmsx102.ccr.corp.intel.com ([169.254.2.189]) with mapi id 14.03.0319.002; Wed, 10 Jan 2018 15:14:57 +0800 From: "Zhang, Qi Z" To: "Richardson, Bruce" , "Xing, Beilei" CC: "dev@dpdk.org" , "Zhang, Helin" , "Yigit, Ferruh" Thread-Topic: [PATCH v2 0/2] AVX2 Vectorized Rx/Tx functions for i40e Thread-Index: AQHTiVbf8uI/lzVVe0iduTs0UhOuWaNssmLg Date: Wed, 10 Jan 2018 07:14:56 +0000 Message-ID: <039ED4275CED7440929022BC67E706115312A5AA@SHSMSX103.ccr.corp.intel.com> References: <20171123165314.168786-1-bruce.richardson@intel.com> <20180109143254.234428-1-bruce.richardson@intel.com> In-Reply-To: <20180109143254.234428-1-bruce.richardson@intel.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: dlp-product: dlpe-windows dlp-version: 11.0.0.116 dlp-reaction: no-action x-originating-ip: [10.239.127.40] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Subject: Re: [dpdk-dev] [PATCH v2 0/2] AVX2 Vectorized Rx/Tx functions for i40e X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 10 Jan 2018 07:15:01 -0000 > -----Original Message----- > From: Richardson, Bruce > Sent: Tuesday, January 9, 2018 10:33 PM > To: Zhang, Qi Z ; Xing, Beilei > Cc: dev@dpdk.org; Zhang, Helin ; Yigit, Ferruh > ; Richardson, Bruce > Subject: [PATCH v2 0/2] AVX2 Vectorized Rx/Tx functions for i40e >=20 > This patch adds an AVX2 vectorized path to the i40e driver, based on the > existing SSE4.2 version. Using AVX2 instructions gives better performance= than > the SSE version, though the percentage increase depends on the exact sett= ings > used. For example: >=20 > * Using 16B rather than 32B descriptors gives the biggest benefit since > 2 descriptors at a time can be read, rather than just 1 when 32B ones > are used. > * Bigger burst sizes for RX gives improved performance - while we see an > improvement with testpmd with the default burst size of 32, burst sizes > of up to 128 give further improvements > * In my testing, most of the improvement comes from faster processing on > the RX path, though the improved TX also gives benefit. >=20 > This has been tested on a system with CPU: "Intel(R) Xeon(R) Gold 6154 CP= U @ > 3.00GHz", and I've focused on testing with Rx ring sizes of approx 1k - g= enerally > --rxd=3D1024 and --txd=3D512, rather than the defaults which tend to give= poorer > zero-loss performance due to the smaller amount of buffering. >=20 > V2: > * Fixed incorrect config variable reference in makefile > * Added missing stub function for when vector drivers are disabled > * Added missing references to the new functions when checking for vector > code paths, e.g. for ring tear-down >=20 > Bruce Richardson (2): > net/i40e: add AVX2 Tx function > net/i40e: add AVX2 Rx function >=20 > drivers/net/i40e/Makefile | 19 + > drivers/net/i40e/i40e_rxtx.c | 66 ++- > drivers/net/i40e/i40e_rxtx.h | 6 + > drivers/net/i40e/i40e_rxtx_vec_avx2.c | 792 > ++++++++++++++++++++++++++++++++++ > 4 files changed, 880 insertions(+), 3 deletions(-) create mode 100644 > drivers/net/i40e/i40e_rxtx_vec_avx2.c >=20 > -- > 2.14.3 Acked-by: Qi Zhang