From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mails.dpdk.org (xvm-189-124.dc0.ghst.net [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id 622CCA0524; Fri, 8 Jan 2021 11:29:30 +0100 (CET) Received: from [217.70.189.124] (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id 442CC140E95; Fri, 8 Jan 2021 11:29:30 +0100 (CET) Received: from mx0b-0016f401.pphosted.com (mx0b-0016f401.pphosted.com [67.231.156.173]) by mails.dpdk.org (Postfix) with ESMTP id 24DD7140DF2; Fri, 8 Jan 2021 11:29:29 +0100 (CET) Received: from pps.filterd (m0045851.ppops.net [127.0.0.1]) by mx0b-0016f401.pphosted.com (8.16.0.43/8.16.0.43) with SMTP id 108AOtWj023332; Fri, 8 Jan 2021 02:29:26 -0800 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=marvell.com; h=from : to : cc : subject : date : message-id : references : in-reply-to : content-type : content-transfer-encoding : mime-version; s=pfpt0220; bh=khWiaPG8jwIGAPScF56VQYGdkxQaGGWFT3IoanQzTGg=; b=GFNvvEXDWNDQxGpBwQAhpO7kF586nLSObPHblsmP2uwd4zLcY48lcP2Jij3OII6Xjw8q nio1fcpAgb5AsTFjcKohy9uMK8wbv6HgzU3q+YJSyRIMXmag5hmVhtIJ+0T3Zl6g1nYQ jlUNFsyINcJYW5T+HA+2zV1rL9/oK37bNRydtwivGVr7DsmKCqmpk10k0yWosGLkmyCT SXEZxaoTFd3oCmOr11K9/KfpALgS4dFpZRgImJ047jZriI4ZSOwgAnlH+BOQYn0uWexp Ux8VVQXnHwsKQFGwq4do7Zvf6lhG6GNQlURas7va1aCERs0I34DaE5lKn9UFn/aUCCRb 2w== Received: from dc5-exch01.marvell.com ([199.233.59.181]) by mx0b-0016f401.pphosted.com with ESMTP id 35wy5a3m8a-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=NOT); Fri, 08 Jan 2021 02:29:26 -0800 Received: from SC-EXCH04.marvell.com (10.93.176.84) by DC5-EXCH01.marvell.com (10.69.176.38) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Fri, 8 Jan 2021 02:29:24 -0800 Received: from DC5-EXCH02.marvell.com (10.69.176.39) by SC-EXCH04.marvell.com (10.93.176.84) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Fri, 8 Jan 2021 02:29:23 -0800 Received: from NAM10-BN7-obe.outbound.protection.outlook.com (104.47.70.100) by DC5-EXCH02.marvell.com (10.69.176.39) with Microsoft SMTP Server (TLS) id 15.0.1497.2 via Frontend Transport; Fri, 8 Jan 2021 02:29:23 -0800 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=CEC4FqlI25/CmCZYa3//Cq7JL5FMuHdamuBF0xqO0kPlbXORp/vN+cK4L/14drMjZ/w22/5c4Q6ugadCuyeopzRbHxeeJjKbKbAjmbpf1NaWDVf1HuHpjhERRViciMF9t0AtMKgiHbg2rCWQxMr39xdi+U3O+HPfsDT7JMxovlu3qFt4Zcw+G5jJNETeJTpipDzZgNomOtRWusfKdXY73m2h+lncEuj6LKg3t4cQC6QmNU8Uo8cCo6WIEL2H3Hw1MCw/Z1FoPVj3pPau+Y/0GzNV+RmpnpckyhHwm4AyNpjMBjEamH7Dlts7ZSgZ+8grjbq3Dcri7773UPUo2pdpnA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=khWiaPG8jwIGAPScF56VQYGdkxQaGGWFT3IoanQzTGg=; b=Zgwu9+GlFmJhOT3KVju20SRTIE9Y3I8vnfcC/RdlYvwrLOZdnW658lYu/bgVJ7TqkqpMRDa1OTIQqhyHrjCyFXGc7gldk8cawCtToCnHP8rs7KsEXF2XjV8VuwsMAMxJ89VwJbfvbQ/3mvbp6e3LdpMrLkcDnDlKS6XDNsL/B5ExPaBb6+d+wmDy17d6C9mfe6SBgjrfEgze+JobiA9/mxEiiPL4wiDbCYkw5dWnljawWYt6WMH0zEYGpQ48WPbY0r7cH8a4hk0lpqZuqF+HuT0o5Dptx3OJVEMezGk+YtQgBBSFdUtr6BrUzsRyK14bOZECOVr2SnO+3W7mDgQQyA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=marvell.com; dmarc=pass action=none header.from=marvell.com; dkim=pass header.d=marvell.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=marvell.onmicrosoft.com; s=selector1-marvell-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=khWiaPG8jwIGAPScF56VQYGdkxQaGGWFT3IoanQzTGg=; b=CoNf1THkZbvFF0/qN2GqJqDkoSpmu1Igwaa1c/1dNlbgCY3WTlzR7CD8C1MS/uiqTNehFspKzcFe7A6Hwv95L0bS34uiC54ldwV17mReLahrbDZ9Ixfv/OvOMWfph9LUgu6XA+hSGrv2mhdwK54rg5RVW7IPUJgAlAPqSnSgONs= Received: from CO6PR18MB3828.namprd18.prod.outlook.com (2603:10b6:5:346::20) by MWHPR18MB1213.namprd18.prod.outlook.com (2603:10b6:320:2a::11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3742.6; Fri, 8 Jan 2021 10:29:20 +0000 Received: from CO6PR18MB3828.namprd18.prod.outlook.com ([fe80::5c2c:a13a:4f3d:94df]) by CO6PR18MB3828.namprd18.prod.outlook.com ([fe80::5c2c:a13a:4f3d:94df%5]) with mapi id 15.20.3742.006; Fri, 8 Jan 2021 10:29:20 +0000 From: Pavan Nikhilesh Bhagavatula To: Ruifeng Wang , Jerin Jacob Kollanukkaran , Nithin Kumar Dabilpuram CC: "dev@dpdk.org" , "vladimir.medvedkin@intel.com" , "hemant.agrawal@nxp.com" , "honnappa.nagarahalli@arm.com" , "nd@arm.com" , "stable@dpdk.org" Thread-Topic: [EXT] [PATCH v2 4/5] common/octeontx2: fix build with sve enabled Thread-Index: AQHW5ZgKG1rY3bje1kSZyQvsNrABo6odf9vQ Date: Fri, 8 Jan 2021 10:29:20 +0000 Message-ID: References: <20201218101210.356836-1-ruifeng.wang@arm.com> <20210108082523.1062058-1-ruifeng.wang@arm.com> <20210108082523.1062058-5-ruifeng.wang@arm.com> In-Reply-To: <20210108082523.1062058-5-ruifeng.wang@arm.com> Accept-Language: en-IN, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: arm.com; dkim=none (message not signed) header.d=none;arm.com; dmarc=none action=none header.from=marvell.com; x-originating-ip: [2405:201:d009:380f:7025:a7e9:ea:4c26] x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: 8c7866d9-54b2-483d-b6b6-08d8b3c042a2 x-ms-traffictypediagnostic: MWHPR18MB1213: x-ms-exchange-transport-forked: True x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:6430; x-ms-exchange-senderadcheck: 1 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: AkPFzFyHt/7bsvuFwsidKVqD88yRI+9azT8toL6CHhl6rr6NaF/6pInSIsAJ7nc/p0BjADBxXomuz/1y+c5ZG4j0nZiJi5IkRPT5ACY7BEl94hhS9pW/ohB6c6hnTHEUmyo51pIVRbmqe7QJUbbPcel1lWQgfL/vRSWHbdUkESnVHhJCh/oIb7wDyxzXP5ZUROSR7D+MtDVXRcJUrHO8EB6aBN8W8iyDbpQ1CW0aV0Ojzhyk6Ld41Uli1A3HYhe8EeuVMjHFP80VVHD4nhlltiSas2Ontq3etT7EETJ8AIOzUb2OobbA06nlML1/PJ/XoiI7w1jaFU6jlDhRdFjBt2FdqJ/LtBv2D77pgkamlsQBGRbjU+6/QHmUlfUWpwSLcYGLD3xew/O9Al9ShJ9bJ25bE51JzX/ZpYW4U3k8A49nGpZOPCiMKCRllSgL5znP x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:CO6PR18MB3828.namprd18.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(376002)(39860400002)(366004)(396003)(346002)(136003)(5660300002)(8936002)(4326008)(66476007)(66556008)(8676002)(64756008)(66446008)(6636002)(52536014)(66946007)(76116006)(316002)(55016002)(33656002)(9686003)(54906003)(86362001)(7696005)(186003)(110136005)(478600001)(6506007)(83380400001)(2906002)(71200400001)(357404004); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata: =?us-ascii?Q?Hgx/CWdSulyZgtq0nxKAF6ogDC0EqX87uAZyH1N/q8NxjQ9wZKUU5YMxxYw2?= =?us-ascii?Q?6CTIOPfzlTr6joDbADouFhTkIYpjyc4TgFVXTtSiymuvL7lVaAH05UAf0qtq?= =?us-ascii?Q?2OM1Mgj/7pTQvbgwkRjmO9v88+hw9MxgXG/glRMUIY1uZ53vUWuoRFWzFbWl?= =?us-ascii?Q?fYux8TEaFFw/Mu8Jn6ZSVYw+fI2WkZOt5MC81Gj3YTgiSF50IFMra2NDfybn?= =?us-ascii?Q?lUcnWQFkyzUTSkI6L/HCExXCbMkVbBr7wkzcisE3VeTg5EJni6d+FjH8g2P0?= =?us-ascii?Q?7Mt5VctL0rmcZ80TpoN2Cct/dU3Phk1fwREKNFGlrG/0K7ZMwIYsenBkGc5k?= =?us-ascii?Q?TjuNUhDcA8gnKsSqHcR2g2phCbhYuLeqp/ExfRrebPe+xBoUKiyOEeABo9zG?= =?us-ascii?Q?0VutvvcTbvqR29eEnDFHj7ydlwR2M0RB47SNUCWfhFERoTlftiqxD7ItoHkH?= =?us-ascii?Q?PnizZ3BjwmvWcxhYgkquRr/oHLwkGbHO0QywZrLF0htsTzLucoTRMEjEozu7?= =?us-ascii?Q?80gtLBqzHnZ1bt6lKpxOS1mVDFcu2AYUdOKCvM74F67xu+WBNe3rV6r+wDmQ?= =?us-ascii?Q?iASc6CwXlkT827rQxctUKM2Yudz22tzkThilabJLzVJWNeuBK++RQxodhQVk?= =?us-ascii?Q?LxAGFuj3/33kZ9JVgYO2kGfXZxdU5IkGN0xoxwGzaKMUMEm5t41fHdzJy1Mq?= =?us-ascii?Q?li879o91zEsSFqrL2VD7GZCRim9qDle3/hV2HuHYgRtvstmwEO23aEtntd/Q?= =?us-ascii?Q?KMaUHcVH9r8FpyHi7YraODo+C63M5l7qi/22T3KFT3D10wTW5SsFD0YzEZvX?= =?us-ascii?Q?3rIHUw8GREnIMJzfVWqFOqR7+JElxXs0whFekjZz/4/1rZBezX/lAvljHidA?= =?us-ascii?Q?Slf9cwhuynu/FmZVppoaG0N48D8INy319Tdx5YSeJd29PG1pYo6EUU1EVowD?= =?us-ascii?Q?ZekAPpCujBqPchEgnSWcjIBibdU4MlRqdDzz+tJHzZLB/03VjROTDqxBShpo?= =?us-ascii?Q?6B1OvhNRSY2amrqt3TlmkEbs8As6Ce0qfBcNWcTsdQSNMNI=3D?= Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: CO6PR18MB3828.namprd18.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 8c7866d9-54b2-483d-b6b6-08d8b3c042a2 X-MS-Exchange-CrossTenant-originalarrivaltime: 08 Jan 2021 10:29:20.4344 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 70e1fb47-1155-421d-87fc-2e58f638b6e0 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: VqatFSJlR7QL2/leNcE3PEBBBPR2rVnanfK+C3VaXHIzDqncfu5EteQntU8JnkjV4y1mA0nehAz/yVzKmwJH6WCvrmU7nkX7KjjO5o/uZcg= X-MS-Exchange-Transport-CrossTenantHeadersStamped: MWHPR18MB1213 X-OriginatorOrg: marvell.com X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.343, 18.0.737 definitions=2021-01-08_06:2021-01-07, 2021-01-08 signatures=0 Subject: Re: [dpdk-dev] [EXT] [PATCH v2 4/5] common/octeontx2: fix build with sve enabled X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" Hi Ruifeng, >Building with gcc 10.2 with SVE extension enabled got error: > >{standard input}: Assembler messages: >{standard input}:4002: Error: selected processor does not support `mov >z3.b,#0' >{standard input}:4003: Error: selected processor does not support >`whilelo p1.b,xzr,x7' >{standard input}:4005: Error: selected processor does not support `ld1b >z0.b,p1/z,[x8]' >{standard input}:4006: Error: selected processor does not support >`whilelo p4.s,wzr,w7' > >This is because inline assembly code explicitly resets cpu model to >not have SVE support. Thus SVE instructions generated by compiler >auto vectorization got rejected by assembler. > >Fixed the issue by replacing inline assembly with equivalent atomic >built-ins. Compiler will generate LSE instructions for cpu that has >the extension. > >Fixes: 8a4f835971f5 ("common/octeontx2: add IO handling APIs") >Cc: jerinj@marvell.com >Cc: stable@dpdk.org > >Signed-off-by: Ruifeng Wang >--- > drivers/common/octeontx2/otx2_io_arm64.h | 37 +++-------------------- >- > 1 file changed, 4 insertions(+), 33 deletions(-) > >diff --git a/drivers/common/octeontx2/otx2_io_arm64.h >b/drivers/common/octeontx2/otx2_io_arm64.h >index b5c85d9a6..8843a79b5 100644 >--- a/drivers/common/octeontx2/otx2_io_arm64.h >+++ b/drivers/common/octeontx2/otx2_io_arm64.h >@@ -24,55 +24,26 @@ > static __rte_always_inline uint64_t > otx2_atomic64_add_nosync(int64_t incr, int64_t *ptr) > { >- uint64_t result; >- > /* Atomic add with no ordering */ >- asm volatile ( >- ".cpu generic+lse\n" >- "ldadd %x[i], %x[r], [%[b]]" >- : [r] "=3Dr" (result), "+m" (*ptr) >- : [i] "r" (incr), [b] "r" (ptr) >- : "memory"); >- return result; >+ return (uint64_t)__atomic_fetch_add(ptr, incr, >__ATOMIC_RELAXED); > } > Here LDADD acts as a way to interface to co-processors i.e.=20 LDADD instruction opcode + specific io address are recognized by=20 HW interceptor and dispatched to the specific coprocessor. Leaving it to the compiler to use the correct instruction is a bad idea. This breaks the arm64_armv8_linux_gcc build as it doesn't have the +lse enabled. __atomic_fetch_add will generate a different instruction with SVE=20 enabled. Instead can we add +sve to the first line to prevent outer loop from optimi= zing out=20 the trap? I tested with 10.2 and n2 config below change works fine. -" .cpu generic+lse\n" +" .cpu generic+lse+sve\n" Regards, Pavan. > static __rte_always_inline uint64_t > otx2_atomic64_add_sync(int64_t incr, int64_t *ptr) > { >- uint64_t result; >- >- /* Atomic add with ordering */ >- asm volatile ( >- ".cpu generic+lse\n" >- "ldadda %x[i], %x[r], [%[b]]" >- : [r] "=3Dr" (result), "+m" (*ptr) >- : [i] "r" (incr), [b] "r" (ptr) >- : "memory"); >- return result; >+ return (uint64_t)__atomic_fetch_add(ptr, incr, >__ATOMIC_ACQUIRE); > } > > static __rte_always_inline uint64_t > otx2_lmt_submit(rte_iova_t io_address) > { >- uint64_t result; >- >- asm volatile ( >- ".cpu generic+lse\n" >- "ldeor xzr,%x[rf],[%[rs]]" : >- [rf] "=3Dr"(result): [rs] "r"(io_address)); >- return result; >+ return __atomic_fetch_xor((uint64_t *)io_address, 0, >__ATOMIC_RELAXED); > } > > static __rte_always_inline uint64_t > otx2_lmt_submit_release(rte_iova_t io_address) > { >- uint64_t result; >- >- asm volatile ( >- ".cpu generic+lse\n" >- "ldeorl xzr,%x[rf],[%[rs]]" : >- [rf] "=3Dr"(result) : [rs] "r"(io_address)); >- return result; >+ return __atomic_fetch_xor((uint64_t *)io_address, 0, >__ATOMIC_RELEASE); > } > > static __rte_always_inline void >-- >2.25.1