From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dpdk.org (dpdk.org [92.243.14.124]) by inbox.dpdk.org (Postfix) with ESMTP id 27ED8A04DD; Thu, 22 Oct 2020 17:01:53 +0200 (CEST) Received: from [92.243.14.124] (localhost [127.0.0.1]) by dpdk.org (Postfix) with ESMTP id 0DDACAA21; Thu, 22 Oct 2020 17:01:52 +0200 (CEST) Received: from hqnvemgate25.nvidia.com (hqnvemgate25.nvidia.com [216.228.121.64]) by dpdk.org (Postfix) with ESMTP id 733EAAA15 for ; Thu, 22 Oct 2020 17:01:49 +0200 (CEST) Received: from hqmail.nvidia.com (Not Verified[216.228.121.13]) by hqnvemgate25.nvidia.com (using TLS: TLSv1.2, AES256-SHA) id ; Thu, 22 Oct 2020 08:01:00 -0700 Received: from HQMAIL107.nvidia.com (172.20.187.13) by HQMAIL101.nvidia.com (172.20.187.10) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Thu, 22 Oct 2020 15:01:48 +0000 Received: from NAM02-CY1-obe.outbound.protection.outlook.com (104.47.37.51) by HQMAIL107.nvidia.com (172.20.187.13) with Microsoft SMTP Server (TLS) id 15.0.1473.3 via Frontend Transport; Thu, 22 Oct 2020 15:01:48 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=mIl3Ic5CfZWMYNegEmXCf6Hg9JtPL+jU5mIzTcAYtOJSE2b7nwIBWb37Jsc5HZ8ZmxJliL9Mjb91zJEP1jrU+Vs1NyIwjtQNDsYCgZNfCwYns0ykFSfHYaz6yUep4X66PQzHcj8bJPzwHGgOCij0o6EbdF8gWTi6y4WVEDpY97tWnhUok/YFrBWZipsSM5L9jXN/riZhc/ngukbvyc7AUVT8L2fLEHcTJt9jtppRB49alVvHkv6LS7zSINX6XBKRDOpbLH0EsjBJNbqoxhRWgKcwhJmRz+1qpIhhEBgkTpJZ05AZ5RGSX7JDVescYnzeNHtXOKXynk/yy+ON2GYL/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=BH/4JgqgMRKLySIArHY5aWZgiZRWqAKm7YjeQrab7Nc=; b=MdHVM+ryimGYCbx7y79Qx8o4K+WTg23Eyy01Oi194KbTEuXUtbUBTn+66fXkczkcMYXMFwUy6iQgFFVwzMed2C6utll7taf1MXF6vdWTNpus0BfJ7s8rSNTQX6O0ObC4kA399D6x0ep6JWpMlSeNNzKPLYt6aYunSuUwFuyq993mtQsnBBHy2eXrEqupdznCetko0vcJc2HbDq7NzPK34makFKNeSjknii+TgtQ/n9QHWn65cMC+AcUuMaL2xCpfOSV/wPJxNA9DibkaBMmRTPxJ6hzWxkk8XwH1DkswxmhnozC8J5TtddEaOVGX1PKJYAezeFY4aDQ96ewApBddkA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nvidia.com; dmarc=pass action=none header.from=nvidia.com; dkim=pass header.d=nvidia.com; arc=none Received: from DM6PR12MB2748.namprd12.prod.outlook.com (2603:10b6:5:43::28) by DM5PR12MB1595.namprd12.prod.outlook.com (2603:10b6:4:3::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3499.18; Thu, 22 Oct 2020 15:01:47 +0000 Received: from DM6PR12MB2748.namprd12.prod.outlook.com ([fe80::f506:4af8:281b:cdeb]) by DM6PR12MB2748.namprd12.prod.outlook.com ([fe80::f506:4af8:281b:cdeb%7]) with mapi id 15.20.3477.028; Thu, 22 Oct 2020 15:01:47 +0000 From: Raslan Darawsheh To: Alexander Kozyrev , "dev@dpdk.org" CC: Matan Azrad , Slava Ovsiienko Thread-Topic: [PATCH v2 0/2] net/mlx5: add vectorized mprq Thread-Index: AQHWp+kdkWrEH209eEiAc6njpf3+1qmjuJPg Date: Thu, 22 Oct 2020 15:01:46 +0000 Message-ID: References: <20200719041142.14485-1-akozyrev@mellanox.com> <20201021203030.19042-1-akozyrev@nvidia.com> In-Reply-To: <20201021203030.19042-1-akozyrev@nvidia.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: nvidia.com; dkim=none (message not signed) header.d=none;nvidia.com; dmarc=none action=none header.from=nvidia.com; x-originating-ip: [212.29.221.74] x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: 2694f6b8-6029-49c5-e72e-08d8769b65a1 x-ms-traffictypediagnostic: DM5PR12MB1595: x-ms-exchange-transport-forked: True x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:9508; x-ms-exchange-senderadcheck: 1 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: 9wrAWP5irNpu/ZLnk2VR6bQ839TBVDowkA/Q6nBrnUErFzYwLLcPVVKLvOBL/IJw1OY43OL+MQrrSwnTvKTILL4FtddeoaVNvfMIHiTIUTQm/I56hIXO0Bn23WagHWhTVGFKUNGw7iTQb6dpzb8wXVEoq7bncpW9lNK42HePs70o7KwuNawVny2JBs2HPfbRFai/+wG9FVb0u6dbz3iMQ79MMMS0/IXh3XG4N7UFWrVRVfgcGQqhz6VVLpVoa3KLBW1CLBspDWg+fSlIar3xmooDLruJtIN3LLC9zrMibcAijeQC0zDfWO4Q7QcdjliL x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:DM6PR12MB2748.namprd12.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(346002)(366004)(396003)(376002)(136003)(39860400002)(54906003)(8676002)(186003)(110136005)(26005)(6506007)(316002)(478600001)(8936002)(7696005)(53546011)(86362001)(33656002)(4326008)(2906002)(107886003)(76116006)(52536014)(9686003)(71200400001)(66476007)(55016002)(66446008)(5660300002)(64756008)(66946007)(66556008)(83380400001); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata: PwqloZLkDYIbJ9KRBw06FP59nVZj5fpG8xKvptkNTA9XMo+q7y1bmbPyrZ8iPVhNQShdp5lXERz/bYnWcHg3wxGMcAZ9320gEpB5DhbrIZLs4NHY/n5B61x1aZYV5vIEzcO9wkffhGXFkzpRRV9mYOcyFUMfxaiapBdbQXlzG/BobCO/eejM/CitK9j/VjpqvFczcIhrID2jmuuRQ5w5kDiR1WCP4s1b8ZfmTvc0Ugqn9STAwCebq4a0oAtgUKqIwlgl7iwJk2/a7iHUFg1v6rfghCnqvlruQw4nOGJPD2unXrVRl4EWvQolgDsjE231v6vhWJ75fGzQM4X4TTOMzNz/vApcXVqS60rwB2YmUe3wHHu1DVHlPwls8bx8l+8U6v6Uiw2zGDE8N013S23i4N18rCwsOYn47XgKWIcmT7ujiEtIr5ETG5eLgxE64J2zvyRaBvJJPDidtraK3gslIj7mDAenl+sbwW7HS+eWbSh67hHXmockxpIagoCKO9Pg/xqoQbFq1+ZHLEoZyd7KyJHjCu/Pb7GAmQUYJ6rvfIO74/Mdbd+Xtmt149QTBZw9KZozoxKNAD5xhV3rHau63Cy3Y/L09yenNBskR2ndG9yfCXz/Xm5YibuutpYluRX9ydW4+vJqLTClMrRNbM4TGQ== Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: DM6PR12MB2748.namprd12.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 2694f6b8-6029-49c5-e72e-08d8769b65a1 X-MS-Exchange-CrossTenant-originalarrivaltime: 22 Oct 2020 15:01:46.8743 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 43083d15-7273-40c1-b7db-39efd9ccc17a X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: kmeMox8Gpr3DpuAyCJHO1O52Hlzx6WG56YQZUZrm+3q59G+qpctMjECJmsnR/XSfaf82J6RuVx/uE0jrpk8iiA== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM5PR12MB1595 X-OriginatorOrg: Nvidia.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nvidia.com; s=n1; t=1603378860; bh=BH/4JgqgMRKLySIArHY5aWZgiZRWqAKm7YjeQrab7Nc=; h=ARC-Seal:ARC-Message-Signature:ARC-Authentication-Results:From:To: CC:Subject:Thread-Topic:Thread-Index:Date:Message-ID:References: In-Reply-To:Accept-Language:Content-Language:X-MS-Has-Attach: X-MS-TNEF-Correlator:authentication-results:x-originating-ip: x-ms-publictraffictype:x-ms-office365-filtering-correlation-id: x-ms-traffictypediagnostic:x-ms-exchange-transport-forked: x-microsoft-antispam-prvs:x-ms-oob-tlc-oobclassifiers: x-ms-exchange-senderadcheck:x-microsoft-antispam: x-microsoft-antispam-message-info:x-forefront-antispam-report: x-ms-exchange-antispam-messagedata:Content-Type: Content-Transfer-Encoding:MIME-Version: X-MS-Exchange-CrossTenant-AuthAs: X-MS-Exchange-CrossTenant-AuthSource: X-MS-Exchange-CrossTenant-Network-Message-Id: X-MS-Exchange-CrossTenant-originalarrivaltime: X-MS-Exchange-CrossTenant-fromentityheader: X-MS-Exchange-CrossTenant-id:X-MS-Exchange-CrossTenant-mailboxtype: X-MS-Exchange-CrossTenant-userprincipalname: X-MS-Exchange-Transport-CrossTenantHeadersStamped:X-OriginatorOrg; b=USSPaZJBWZCilip+RKslJ+uM1hWfRVV9nPPQHUZUmapSYR4YcmvlQPRhz8j4RfgNE ha4MgT40iAhGcqDZZ29tgPJb4abOPEq0YFimo3fZY4FCNbk8crU6CsHA+ROrc4X4t6 2fLkqKtlWsikGXVk6ina+L+3UGOMJlpkjWcig/67q2CakTTYFaA9ysLwGQyqQo2XOP +YNEKHnqVnHikhV+kqiZ6+3rhLYDltGNygVIjWERWHI17Kc74FTfVwHDf4DltPkWzA XAmvr5Fc/2veo6gr3J8xGBSKe4526YNb3nlQCKfgFwRNC3PvkeTiTHUrOSF0chcI0v G8DcLHKslVk8Q== Subject: Re: [dpdk-dev] [PATCH v2 0/2] net/mlx5: add vectorized mprq X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" Hi, > -----Original Message----- > From: Alexander Kozyrev > Sent: Wednesday, October 21, 2020 11:30 PM > To: dev@dpdk.org > Cc: Raslan Darawsheh ; Matan Azrad > ; Slava Ovsiienko > Subject: [PATCH v2 0/2] net/mlx5: add vectorized mprq >=20 > The vectorized Rx burst function helps to accelerate the Rx processing > by using SIMD (single instruction, multiple data) extensions for the > multi-buffer packet processing. Pre-allocating multiple mbufs and > filling them in batches of four greatly improves the throughput of the > Rx burst routine. >=20 > MPRQ (Multi-Packet Rx Queue) lacks the vectorized version currently. > It works by posting a single large buffer (consisted of multiple > fixed-size strides) in order to receive multiple packets at once on this > buffer. A Rx packet is then copied to a user-provided mbuf or PMD > attaches the Rx packet to the mbuf by the pointer to an external buffer. >=20 > It is proposed to add a vectorized MPRQ Rx routine to speed up the MPRQ > buffer handling as well. It would require pre-allocation of multiple > mbufs every time we exhaust all the strides from the current MPRQ buffer > and switch to a new one. The new mlx5_rx_burst_mprq_vec() routine will > take care of this as well as of decision on whether should we copy or > attach an external buffer for a packet. The batch processing logic won't > be different from the simple vectorized Rx routine. >=20 > The new vectorized MPRQ burst function is going to be selected > automatically whenever the mprq_en devarg is specified. If SIMD is not > available on the platform we fall back to the simple MPRQ Rx burst > function. LRO is not supported by the vectorized MPRQ version and fall > back to the regular MPRQ will be performed. >=20 >=20 > Alexander Kozyrev (2): > net/mlx5: refactor vectorized Rx routine > net/mlx5: implement vectorized MPRQ burst >=20 > drivers/net/mlx5/mlx5_devx.c | 15 +- > drivers/net/mlx5/mlx5_ethdev.c | 20 +- > drivers/net/mlx5/mlx5_rxq.c | 96 +++--- > drivers/net/mlx5/mlx5_rxtx.c | 237 ++++--------- > drivers/net/mlx5/mlx5_rxtx.h | 200 ++++++++++- > drivers/net/mlx5/mlx5_rxtx_vec.c | 416 ++++++++++++++++++++++- > drivers/net/mlx5/mlx5_rxtx_vec.h | 55 --- > drivers/net/mlx5/mlx5_rxtx_vec_altivec.h | 106 ++---- > drivers/net/mlx5/mlx5_rxtx_vec_neon.h | 103 ++---- > drivers/net/mlx5/mlx5_rxtx_vec_sse.h | 121 ++----- > 10 files changed, 813 insertions(+), 556 deletions(-) >=20 > -- > 2.24.1 Series applied to next-net-mlx, Kindest regards, Raslan Darawsheh