From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id F279645697; Tue, 23 Jul 2024 18:46:10 +0200 (CEST) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id D88D742670; Tue, 23 Jul 2024 18:46:10 +0200 (CEST) Received: from NAM11-DM6-obe.outbound.protection.outlook.com (mail-dm6nam11on2084.outbound.protection.outlook.com [40.107.223.84]) by mails.dpdk.org (Postfix) with ESMTP id 5752940E4D for ; Tue, 23 Jul 2024 18:46:09 +0200 (CEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=mxXhe1HxbBz7NybHU+a1uZ2ZVCnbChJhF4mLW8hVANOdbrCrticOEvEgIrz7QlR2udNWYEG70h76R6ycsU0ey5iw2JKvoZ+oLLx84gRZY5U2DV2buITnt+uamGP9Voa1pkNvvS2HrDvknh0dgoLokitYMDzr9DjcKFZJlpjfnMAddJO1WbkOIbbNV8LGQj5UbRJYJPwFbNeng40ikTBsB93A4sqmgVqxZWEtLSbvaiGE8ImWkvEtbvxhzE6xqYJp+7XzBedch8nchcm591fo/qR6H0WwMobnaeEMYjGrRGfgkgHEO93wGC6nrGlQyzQ0WUeNMb1/4sM8bpBQ7KnixA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=X4uiOTkwrpKrMaBMxVygQ5XaQjJLBbcAiJKd7eQuLkM=; b=cRN3RAHOcFb5AdanK1wm3J6f6aS+IJ6sMWxwrZShRyslIJjSye2oGajjwhrXmCxYdFnKBcI4vCunXz91IaFl+fq7aMT3rJeSrgUd88gI1l1L3izEW6tow92xr8LitfIdpUTN0Jxi+LRn4sDJJ/ygfTTUnRtUWVGeph/9E5YzbFCP/9Br373kCoFc6FjR9iDGb7THKdsditGm3PEaIYM+JZyxzMz3ohWppcZHRLLvkGheEe/HrrWdpdqIPQIgdtG/mzIy1ZyO+ZBc27MlTZDZVuKandqcl/3+Ub0bmUSe7/yq1yPtgYDjcyp/xXt4GWJviDw5TtTLBeMtF7fFqKcfTg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=amd.com; dmarc=pass action=none header.from=amd.com; dkim=pass header.d=amd.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=X4uiOTkwrpKrMaBMxVygQ5XaQjJLBbcAiJKd7eQuLkM=; b=OVrvFgMebg7Y28OEZzB36KVAdbbEjYhUO8VqeD0z+lxmim0XSrQDqFYzrFspwuYp7h3bfvVfuggYZGb5lkzS+h0kQC2LEfbz8u7IvZDSepyTxfb9YMVmWXeM8HJo2ufXWOE+jySzr2LioqQ9hUPNhaQE6vlr6djHCb+b0FDHX3s= Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=amd.com; Received: from CH2PR12MB4294.namprd12.prod.outlook.com (2603:10b6:610:a9::11) by CH3PR12MB8354.namprd12.prod.outlook.com (2603:10b6:610:12f::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7784.16; Tue, 23 Jul 2024 16:46:04 +0000 Received: from CH2PR12MB4294.namprd12.prod.outlook.com ([fe80::ebfb:2f9f:f9ca:82cd]) by CH2PR12MB4294.namprd12.prod.outlook.com ([fe80::ebfb:2f9f:f9ca:82cd%4]) with mapi id 15.20.7784.017; Tue, 23 Jul 2024 16:46:04 +0000 Message-ID: <2487809b-ab5e-450b-8d9a-11d3fa32af96@amd.com> Date: Tue, 23 Jul 2024 17:45:57 +0100 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH] app/testpmd: improve sse based macswap To: Vipin Varghese , dev@dpdk.org, bruce.richardson@intel.com, "Mcnamara, John" , "Xu, HailinX" References: <20240716063724.850-1-vipin.varghese@amd.com> Content-Language: en-US Cc: konstantin.v.ananyev@yandex.ru From: Ferruh Yigit Autocrypt: addr=ferruh.yigit@amd.com; keydata= xsFNBGJDD3EBEAC/M7Tk/DfQSmP1K96vyzdhfSBzlCaGtcxNXorq4fALruqVsD3oi0yfyEz9 4YN8x7py0o9EL8ZdpOX0skc0AMCDAaw033uWhCn0GLMeGRKUbfOAPvL6ecSDvGD7CJIO9j0J eZUvasBgPdM/435PEr9DmC6Ggzdzt8IuG4PoLi5jpFSfcqxZFCCxLUDEo/w0nuguk2FTuYJg B2zEZ4JTBZrw7hIHiFh8D8hr6YA6a5uTofq1tr+l048lbtdFUl8TR0aIExVzE4Z8qKZlcE+9 RQaewjK5Al1jLE4sHdmd3GN+IvgDF3D/fLsi25SKJDeGSdeHkOmaX0qGeM4WKIfU6iARRCiQ N3AmBIxZ/A7UXBKLaOyZ+/i3sE6Wb53nrO4i8+0K2Qwyh6LjTeiJAIjYKN43ppxz3DaI+QwQ vI+uyHr4Gg0Da9EPPz/YyKauSeOZCfCB5gIfICO0j6x0SCl8uQ2nLpjxcZkf0gjcwUzP3h+S 3x6NfDji9YEij0zczW/dcSpGgZ6vsFpPrtnP9ZXy6J53yp0kJtOJoOlkEFFdU2yCZnCDseum CoudmGLZVvS0/DzHDJejq+3kK3FDGktZBOxZIIpal+nFqS7lVgOZc4+huVv3jyhzoAUOEyXA XK5j6o7g8STUY+z33QNnHpdLvecMwuzmvqy0jR54yAbZ64mB9QARAQABzSNGZXJydWggWWln aXQgPGZlcnJ1aC55aWdpdEBhbWQuY29tPsLBlwQTAQgAQQIbAwULCQgHAgYVCgkICwIEFgID AQIeAQIXgAIZARYhBEm7aYjps5XGsPHCElRTPtCKKm/6BQJkdyEEBQkE3meNAAoJEFRTPtCK Km/6UdcP/0/kEp49aIUhkRnQfmKmNVpcBEs4NqceNCWTQlaXdEwL1lxf1L49dsF5Jz1yvWi3 tMtq0Mk1o68mQ7q8iZAzIeLxGQAlievMNE0BzLWPFmuX+ac98ITBqKdnUAn6ig5ezR+jxrAU 58utUszDl16eMabtCu76sINL5izB8zCWcDEUB4UqM8iBSQZ7/a7TSBVS0jVBldAORg1qfFIs cGMPQn/skhy3QqbK3u3Rhc44zRxvzrQJmhY6T1rpeniHSyGOeIYqjpbpnMU5n1VWzQ4NXvAD VDkZ4NDw6CpvF4S2h2Ds7w7GKvT6RRTddrl672IaLcaWRiqBNCPm+eKh4q5/XkOXTgUqYBVg Ors8uS9EbQC/SAcp9VHF9fB+3nadxZm4CLPe5ZDJnSmgu/ea7xjWQYR8ouo2THxqNZtkercc GOxGFxIaLcJIR/XChh9d0LKgc1FfVARTMW8UrPgINVEmVSFmAVSgVfsWIV+NSpG9/e90E4SV gMLPABn1YpJ8ca/IwqovctqDDXfxZOvCPOVWTzQe/ut767W+ctGR1kRkxWcz470SycOcY+PW VRPJd91Af0GdLFkwzZgNzkd6Gyc9XXcv4lwwqBLhWrBhqPYB0aZXIG1E/cVTiRp4dWpFHAFD DcuLldjIw93lCDsIeEDM9rBizGVMWEoeFmqSe7pzGTPXzsFNBGJDD3EBEAC8fBFQHej8qgIG CBzoIEd1cZgPIARlIhRudODXoNDbwA+zJMKtOVwol3Hh1qJ2/yZP11nZsqrP4fyUvMxrwhDe WBWFVDbWHLnqXMnKuUU1vQMujbzgq/4Rb9wSMW5vBL6YxhZng+h71JgS/9nVtzyaTtsOTrJi 6nzFSDx6Wbza2jYvL9rlK0yxJcMEiKwZQ/if4KcOesD0rtxomU/iSEv6DATcJbGXP6T93nPl 90XksijRKAmOwvdu3A8IIlxiSSVRP0lxiHOeR35y6PjHY2usfEDZZOVOfDfhlCVAIBZUZALv VmFOVSTYXeKgYa6Ooaf72+cHM3SgJIbYnevJfFv8YQW0MEAJ/IXE7B1Lk+pHNxwU3VBCrKnA fd/PTvviesuYRkrRD6qqZnINeu3b2DouVGGt2fVcGA38BujCd3p8i7azoGc7A6cgF7z9ETnr ANrbg1/dJyDmkDxOxVrVquTBbxJbDy2HaIe9wyJTEK2Sznpy62DaHVY+gfDQzexBXM10geHC IIUhEnOUYVaq65X3ZDjyAQnNDBQ4uMqSHZk8DpJ22X+T+IMzWzWl+VyU4UZXjkLKPvlqPjJk 1RbKScek5L2GhxHQbPaD76Hx4Jiel0vm2G+4wei8Ay1+0YRFkhySxogU/uQVXHTv63KzQMak oIfnN/V2R0ucarsvMBW+gwARAQABwsF8BBgBCAAmAhsMFiEESbtpiOmzlcaw8cISVFM+0Ioq b/oFAmR3IPsFCQTeZ44ACgkQVFM+0Ioqb/qINhAAtcor9bevHy22HvJvXX17IOpPSklZJAeQ Az43ZEo5kRlJ8mElc2g3RzYCvL/V3fSiIATxIsLq/MDtYhO8AAvklxND/u2zeBd7BkRZTZZX W1V1cM3oTvfx3LOhDu4f2ExQzCGdkzbXTRswSJIe1W0qwsDp+YPekbrsKp1maZArGeu+6FuW honeosIrWS98QJmscEhP8ooyJkLDCCOgEk+mJ/JBjzcJGuYn6+Iy/ApMw/vqiLGL1UWekcTA g18mREHqIR+A3ZvypIufSFB52oIs1zD/uh/MgmL62bY/Cw6M2SxiVxLRsav9TNkF6ZaNQCgn GqifliCEMvEuLZRBOZSYH2A/PfwjYW0Ss0Gyfywmb2IA990gcQsXxuCLG7pAbWaeYazoYYEQ NYmWatZNMAs68ERI2zvrVxdJ/fBWAllIEd0uQ4P05GtAHPdTIDQYp545+TPV7oyF0LfXcsQs SFVZE6igdvkjfYmh+QOrHGZvpWXLTmffVf/AQ81wspzbfxJ7sYM4P8Mg5kKOsaoUdyA/2qVe cMh1CLUHXF1GlofpGbe1lj4KUJVse5g3qwV7i9VrseA8c4VIZewdIjkzAhmmbxl+8rM/LKBH dZUMTzME5PFCXJIZ83qkZQ795MTe2YScp9dIV7fsS5tpDwIs7BZNVM1l3NAdK+DLHqNxKuyO 8Zk= In-Reply-To: <20240716063724.850-1-vipin.varghese@amd.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-ClientProxiedBy: FR4P281CA0224.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:e4::14) To CH2PR12MB4294.namprd12.prod.outlook.com (2603:10b6:610:a9::11) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: CH2PR12MB4294:EE_|CH3PR12MB8354:EE_ X-MS-Office365-Filtering-Correlation-Id: aed33437-6e7d-49c8-e0d3-08dcab36f10e X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0;ARA:13230040|376014|366016|1800799024; X-Microsoft-Antispam-Message-Info: =?utf-8?B?TWU3V3BrVkxkM0x3WXMwaVREbEtNaWFScGk0Rzh3ZmhOaXp0T3cyZjdvczFy?= =?utf-8?B?NFQ2dlIzd1RXbEgvV2xrbExsSm0yeWNPYUhpVzFhWHRxZzhZSnlDa0tPRm1u?= =?utf-8?B?N0JRWFNNWEt6TEZrdUxUcG43KzdwZW5xVU12VDV5aHdzQkdkYmFpODJsOG9H?= =?utf-8?B?bk03VkNaaDhZOE9lZkRNeGVHWGg3TXZNaGcrSDF4c1BUUkhLWEVaWmFmWDFD?= =?utf-8?B?bkVoNzdscmVzcnZVcXdYU2lKS1NSWlZEYVRnWTlGc1BjS3o0aHNDaGwrYmg5?= =?utf-8?B?eHprME9la0RpbzgyZ1QrdGhYWmlHSkprWENjZTc4NTh5QS9yVWRrYlMvN2w0?= =?utf-8?B?UG85UWpBbWR0ZUlydnJsOWxCTHNnYjlZcTlvczlkSEhHQk5ZTWpLaWc3UnlL?= =?utf-8?B?U2tqOThHMXRLUXcreEtySU1mRkg2b0VqaGpPMG1Ja3lYR1EwbHV1bmtueldm?= =?utf-8?B?ZEQ2dmNJc2hmMHdMa3NDVjNSZXYrK3N2R0hiK0Y0eGlIY1RMeHpxQlEyOW9h?= =?utf-8?B?RVlTT3o3WTlkN0JYWnREWmhXeFdvN1hwRWhFSDY3cGZ3akVkaUY2V3NnV2hp?= =?utf-8?B?aDg4Y0duMmlyYi9iNkJaVnlkb3JQdTVDdHEwbFpqc2VNUXMwVGRBOENBVk5R?= =?utf-8?B?enMxQldWaE1UYVJxbmlnUVZpSVpzd3BwRTdnU3RPMEtIa3lvQmxBUERoSWVB?= =?utf-8?B?ZFNudzErdFNDMlJLd1hLWWlvalRZQWx6Q3MydEdCRnFHYkMwaEZMc245UERk?= =?utf-8?B?akZaZ21seVBqNk5vRWprdUtzUTNLdVBmd1JNRDh4NnJOanJRWmRyOEQ3dzBa?= =?utf-8?B?WEZUWnhnZm9ycUpCWW5oSE55cW13N0p2T29iaThRRVRZaHp0NFg3Sm4xZjFt?= =?utf-8?B?a0hpamhaa0JyUElGT0VXNW5WR241Wkc4NVExU3hRUVEwMXFmcWxtOWhjUTZS?= =?utf-8?B?QzdheURlc2J1aHJnT0tnZjAvUmZXYnVReVVlb3Q1N2c0dzNjSVFVUmdINXFa?= =?utf-8?B?YWxaQmMvV3JWN0N0T0FVNE9WQVV2NzE4Z0NIaHcrWCtWZFN6emNyZ1N2dWZF?= =?utf-8?B?WXdFQUlhSWlDajNxaDhiL0JXUURJSVBuekMzMDJtTXFjU1dhakpxWEhSMnpD?= =?utf-8?B?bG5mVmJlcHlxVHhrQ1NWOFI2eE13aUZidzVKNUhGUGswZU1SVHpWR2RIcitl?= =?utf-8?B?Szl4d0ZuYmIyNW5wb2xhOVRCak9RT1FlTXhhR3FPb1VzUmh6a21ZU3JsMmlh?= =?utf-8?B?UkhkbEpjUDZkZXlMa1VCNUwzdWlnRGQxSjZ6N1l3bzBJaXp4MTN3S240c1k3?= =?utf-8?B?NUloN1NwWmZZS2RITkMzVHd5UUFyRkdDS0gzRjllSHZuVWV4SERSQytYYTV1?= =?utf-8?B?MlBES09rWHVoZ09RbE9mbkN0Szc2RHA5Ui9vSXRJRnEwZFAzV1FtRnIwemw3?= =?utf-8?B?QXJ2K0w0aG5VTXpDNEJLVXQzandtV0hhN2FqRnM3M3hhSmxQOG0yS1FlK2pS?= =?utf-8?B?cTBLQUw0ci9Kd2szVkIya2pOSEVjR0QyNzBTakhYSG1YYmMvUXFTdW1iKzRv?= =?utf-8?B?Z2pIRVB3ZWx5MS92b1FkbjY3Vysvak9MSXV2MUlYOUtiNWZxajBtYzBoSlI4?= =?utf-8?B?NzJRYmNvZFgybHN1d0R2YVBtQURHYnd0MDUzTXBzVGI3ZU5zMkc1ZVpxMnY5?= =?utf-8?B?cmdVa1htSG5qYSs2eCswRk1VQUFJL1pJTjNPQldDOVNyN2gvUEIvUUVJYjNk?= =?utf-8?B?OTFLMzJjdm5aSEdaNHdJdE9URnloeFFMVlY5OGUrRUcrSDlTYnk4ZFZ2eWxu?= =?utf-8?B?eUU5ZjBnR1RPNy85QjMxUT09?= X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:CH2PR12MB4294.namprd12.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230040)(376014)(366016)(1800799024); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?RmNmZWFxUEE5citBaTliL1ZDa3RhMzVwVjIxZEROelJoU1hlWjdYS0RZK0di?= =?utf-8?B?UnlQTTZocW43Z25nS0puakFNc08yaDZVOW9aN0dUMHE2VGJsRE44bVNtVHRr?= =?utf-8?B?bDhZb1VrLzlBLzN4TEFRYllBMk9VSndNOUduMnAvbzYydWFsS0J2b1lwY0h2?= =?utf-8?B?bjFmSjVuN0FyNCsyczUvdFB4cWxoSE5MYmNmWlZtTGh6NHIybE9qS2NVSmNi?= =?utf-8?B?U0E2OTRNSkhIL0k4VmpXNitaSDRhaU5Sa1RNZWFvazVxc2dxVDc2VEpJeFZ4?= =?utf-8?B?SzRkR1cyaTNtWXZzMXR2dk1WTUt2WVl3UngvRUt3TEppVEJrYWdvSEJ2VDgv?= =?utf-8?B?VzVEcW1Xb3pJQ296azc0SjNaeXNPTm4xbU00eWdudVVtc2Y1Sm9wcEJaa2VC?= =?utf-8?B?ODdIempXZzB5TTdoZmJNanlvZjN4RnV1cERyYzcvMWVycXNVaCtOWHcxMXdS?= =?utf-8?B?OCtEaUdUUUVwVG1KdWQrK2krd3BBMUFZb0ZzMHNYSEY3WUg4bHpBaXdZRzBH?= =?utf-8?B?RWJpL040QWFCaXNXa1FZTDlMM0pSR0ZXbHhKUms4Qm9IRkU5UFhOY0hpWUJR?= =?utf-8?B?Slo1dEJtbW1MS25OeXlCb1hJRDVmTXZuVStvelZLd29QTkZLUVhEK09ybWZX?= =?utf-8?B?N0w3dmZza0FMZHJwUStFZ3dWYWllZ0w5MG0rYlk5emlQMFlka0FCN1drd0t2?= =?utf-8?B?dFBJcGZFdFFtRE1BYmFyd1NVWThUWjNBRlVHNzJ6ckV3WG5tNVNFaUVmUmJj?= =?utf-8?B?YUJGcjVlY0pUMENNVmgvR1ZGVSs3Y0RLcFdDM0tBdThpYVIyU2pyWTgrai9m?= =?utf-8?B?UDMzZHQrUm5BVzUxMXdzMkdGcXF4UjczajJvQWhVRkJELy9RZnV6MXJXUVFm?= =?utf-8?B?N3RUcElQSlk4dGk3UFhFTFBJOHZpS1dDakxvZDRtVFJDR2E0cWZBUDh1T0dP?= =?utf-8?B?T0I2bXNwRFJ4aHBvZGQ3cm5BMU5lam5kOXVTaHdjbnVsK1d2U2Y4YWN6cXJJ?= =?utf-8?B?U1FKbXNtQzkxTWhrRi8yNTM1VVdxd2pmWnA3S0ZCU0h5UUllS1htOVJESm9V?= =?utf-8?B?T0lOZHdUVTg0ZmhPVGQyRjF3Q1gzWGM2cXQrMTJ2d2FPWU9FVkRmZXNYdHpL?= =?utf-8?B?QXVFaHo2MEx2L1NCMHBQaHZTajU5bUVCUk9pNTN3OUlzUkdqZ3BGTEROaWRE?= =?utf-8?B?RVphZUhtWG5aYXZITHBwQ3d4ZnBhRGV1NnNzbTlienNlbE5CMEZTV1l0cERa?= =?utf-8?B?Y1RVU0FTUjBxMXRjVWphMUtzMkhqNGlDV2NMRmdRTm40Uy9PTnlWWVJqWGZh?= =?utf-8?B?eHpFdzNXU1hPTmNDVkZlRlE1WjlUc2ZBN3p0VTdXUkRUYUxsTm1tK2hPNGho?= =?utf-8?B?Z2p4cEZLaXZ0THc4dElsU2FhbXlOd28vMEZmMFg0NWF3anN5UzdLWDNGUmJP?= =?utf-8?B?WUFKRW5TWVJQRFJyM1BKN2pVTDZGcENoQ1EzZDdNUTRoYVRFTnBLck1VR3l0?= =?utf-8?B?UHdkMjFwK1hRUUdkNWNQTGxFVHJRNUtLTnhleHpQdkxlMEUwREZQMUFqSDFW?= =?utf-8?B?bXlNK1RKNklkTC90dU5SRWNQUE5FZUNjdVNLRzV2Rmp3N1UrZFdsTTRNQjlS?= =?utf-8?B?dEZGVGs0MHRhaWc4WHoyL09HUXdwU0xWQ2xpeWlUbzZiRzdiSk1IZzh3NUNz?= =?utf-8?B?dWhDQVJWcnd4RWRCNVhmTmRLdEZyYkdUd1dSYTZKa1J4UDA3SzdqRjlBN09V?= =?utf-8?B?QlJmL2pTTzgwZi84QjBTTDJIanhGTGJvYTJjT2FBMDA5NkpKMnRBRXhlK1cz?= =?utf-8?B?eVlvV3dpdm9Ta2lWK1A0NEJERmpvbm9KZXFsU1ZrMHloMS8wb3B1VnF6aWN4?= =?utf-8?B?RGgwK3pRRWNYZDBSU1V3VHY5d1B3bHJ5N2hDWmxvai9VVWg0VktiUG5vK3dO?= =?utf-8?B?eXNrZGdxRE0wRWh2cFpCVnp6dlVJRVZMUGZUTHBzOW1TZ2RUSzZTSWV4VHFW?= =?utf-8?B?ZGl4RXZ4QW1SNDFYYzhLSk9YbzNRTFFhMUFyUmR3OWN6ZXQ2TURzMjExaCs5?= =?utf-8?B?aXloNUMzVUhSYk1haER1SFpQcTMzNk5uVVVYenNiRWd6c2tPODZYRWtkUXF3?= =?utf-8?Q?DGdXvar1uDi/vqHILXU7er64G?= X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-Network-Message-Id: aed33437-6e7d-49c8-e0d3-08dcab36f10e X-MS-Exchange-CrossTenant-AuthSource: CH2PR12MB4294.namprd12.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 23 Jul 2024 16:46:04.3121 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: iX9lmwcUHLG6U6zdjHpT1ZUi/IAF5iVVlVcDB/N6YtiL3EzNqjFXO5Up1azZVC4t X-MS-Exchange-Transport-CrossTenantHeadersStamped: CH3PR12MB8354 X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org On 7/16/2024 7:37 AM, Vipin Varghese wrote: > Goal of the patch is to improve SSE macswap on x86_64 by reducing > the stalls in backend engine. Original implementation of the SSE > macswap makes loop call to multiple load, shuffle & store. Using > SIMD ISA interleaving we can reduce the stalls for > - load SSE token exhaustion > - Shuffle and Load dependency > > Also other changes which improves packet per second are > - Filling access to MBUF for offload flags which is separate cacheline, > - using register keyword > > Build test using meson script: > `````````````````````````````` > > build-gcc-static > buildtools > build-gcc-shared > build-mini > build-clang-static > build-clang-shared > build-x86-generic > > Test Results: > ````````````` > > Platform-1: AMD EPYC SIENA 8594P @2.3GHz, no boost > > ------------------------------------------------ > TEST IO 64B: baseline > - mellanox CX-7 2*200Gbps : 42.0 > - intel E810 1*100Gbps : 82.0 > - intel E810 2*200Gbps (2CQ-DA2): 82.45 > ------------------------------------------------ > TEST MACSWAP 64B: > - mellanox CX-7 2*200Gbps : 31.533 : 31.90 > - intel E810 1*100Gbps : 50.380 : 47.0 > - intel E810 2*200Gbps (2CQ-DA2): 48.840 : 49.827 > ------------------------------------------------ > TEST MACSWAP 128B: > - mellanox CX-7 2*200Gbps: 30.946 : 31.770 > - intel E810 1*100Gbps: 49.386 : 46.366 > - intel E810 2*200Gbps (2CQ-DA2): 47.979 : 49.503 > ------------------------------------------------ > TEST MACSWAP 256B: > - mellanox CX-7 2*200Gbps: 32.480 : 33.150 > - intel E810 1 * 100Gbps: 45.29 : 44.571 > - intel E810 2 * 200Gbps (2CQ-DA2): 45.033 : 45.117 > ------------------------------------------------ > > Platform-2: AMD EPYC 9554 @3.1GHz, no boost > > ------------------------------------------------ > TEST IO 64B: baseline > - intel E810 2*200Gbps (2CQ-DA2): 82.49 > ------------------------------------------------ > > TEST MACSWAP: 1Q 1C1T > 64B: : 45.0 : 45.54 > 128B: : 44.48 : 44.43 > 256B: : 42.0 : 41.99 > +++++++++++++++++++++++++ > TEST MACSWAP: 2Q 2C2T > 64B: : 59.5 : 60.55 > 128B: : 56.78 : 58.1 > 256B: : 41.85 : 41.99 > ------------------------------------------------ > > Signed-off-by: Vipin Varghese > Hi Bruce, John, Can you please help testing macswap performance with this patch on Intel platforms, to be sure it is not causing regression? Other option is to get this patch for -rc3 and tested there, with the condition to remove it in any regression, if this help testing the patch? Thanks, ferruh