From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dpdk.org (dpdk.org [92.243.14.124]) by inbox.dpdk.org (Postfix) with ESMTP id 0502BA04FA; Mon, 3 Feb 2020 12:51:08 +0100 (CET) Received: from [92.243.14.124] (localhost [127.0.0.1]) by dpdk.org (Postfix) with ESMTP id 3A3EA1BFC1; Mon, 3 Feb 2020 12:51:08 +0100 (CET) Received: from smtp.tuxdriver.com (charlotte.tuxdriver.com [70.61.120.58]) by dpdk.org (Postfix) with ESMTP id 6BE991BFB6 for ; Mon, 3 Feb 2020 12:51:06 +0100 (CET) Received: from [107.15.85.130] (helo=localhost) by smtp.tuxdriver.com with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1iyaFU-0007cK-0a; Mon, 03 Feb 2020 06:50:47 -0500 Date: Mon, 3 Feb 2020 06:50:34 -0500 From: Neil Horman To: Ferruh Yigit Cc: "Ananyev, Konstantin" , Thomas Monjalon , Akhil Goyal , "Trahe, Fiona" , David Marchand , Anoob Joseph , "Kusztal, ArkadiuszX" , "dev@dpdk.org" , "Richardson, Bruce" , "Mcnamara, John" , "dodji@seketeli.net" , Andrew Rybchenko , "aconole@redhat.com" Message-ID: <20200203115034.GA25978@hmswarspite.think-freely.org> References: <20191220152058.10739-1-david.marchand@redhat.com> <6121442.K2JlShyGXD@xps> <1779027.taCxCBeP46@xps> <666f2cc7-0906-7a07-a582-87800f321a00@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <666f2cc7-0906-7a07-a582-87800f321a00@intel.com> X-Spam-Score: -2.9 (--) X-Spam-Status: No Subject: Re: [dpdk-dev] [PATCH v2 4/4] add ABI checks X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" On Mon, Feb 03, 2020 at 09:30:06AM +0000, Ferruh Yigit wrote: > On 2/2/2020 2:41 PM, Ananyev, Konstantin wrote: > > > >> 31/01/2020 15:16, Trahe, Fiona: > >>> On 1/30/2020 8:18 PM, Thomas Monjalon wrote: > >>>> 30/01/2020 17:09, Ferruh Yigit: > >>>>> On 1/29/2020 8:13 PM, Akhil Goyal wrote: > >>>>>> > >>>>>> I believe these enums will be used only in case of ASYM case which is experimental. > >>>>> > >>>>> Independent from being experiment and not, this shouldn't be a problem, I think > >>>>> this is a false positive. > >>>>> > >>>>> The ABI break can happen when a struct has been shared between the application > >>>>> and the library (DPDK) and the layout of that memory know differently by > >>>>> application and the library. > >>>>> > >>>>> Here in all cases, there is no layout/size change. > >>>>> > >>>>> As to the value changes of the enums, since application compiled with old DPDK, > >>>>> it will know only up to '6', 7 and more means invalid to the application. So it > >>>>> won't send these values also it should ignore these values from library. Only > >>>>> consequence is old application won't able to use new features those new enums > >>>>> provide but that is expected/normal. > >>>> > >>>> If library give higher value than expected by the application, > >>>> if the application uses this value as array index, > >>>> there can be an access out of bounds. > >>> > >>> [Fiona] All asymmetric APIs are experimental so above shouldn't be a problem. > >>> But for the same issue with sym crypto below, I believe Ferruh's explanation makes > >>> sense and I don't see how there can be an API breakage. > >>> So if an application hasn't compiled against the new lib it will be still using the old value > >>> which will be within bounds. If it's picking up the higher new value from the lib it must > >>> have been compiled against the lib so shouldn't have problems. > >> > >> You say there is no ABI issue because the application will be re-compiled > >> for the updated library. Indeed, compilation fixes compatibility issues. > >> But this is not relevant for ABI compatibility. > >> ABI compatibility means we can upgrade the library without recompiling > >> the application and it must work. > >> You think it is a false positive because you assume the application > >> "picks" the new value. I think you miss the case where the new value > >> is returned by a function in the upgraded library. > >> > >>> There are also no structs on the API which contain arrays using this > >>> for sizing, so I don't see an opportunity for an appl to have a > >>> mismatch in memory addresses. > >> > >> Let me demonstrate where the API may "use" the new value > >> RTE_CRYPTO_AEAD_CHACHA20_POLY1305 and how it impacts the application. > >> > >> Once upon a time a DPDK application counting the number of devices > >> supporting each AEAD algo (in order to find the best supported algo). > >> It is done in an array indexed by algo id: > >> int aead_dev_count[RTE_CRYPTO_AEAD_LIST_END]; > >> The application is compiled with DPDK 19.11, > >> where RTE_CRYPTO_AEAD_LIST_END = 3. > >> So the size of the application array aead_dev_count is 3. > >> This binary is run with DPDK 20.02, > >> where RTE_CRYPTO_AEAD_CHACHA20_POLY1305 = 3. > >> When calling rte_cryptodev_info_get() on a device QAT_GEN3, > >> rte_cryptodev_info.capabilities.sym.aead.algo is set to > >> RTE_CRYPTO_AEAD_CHACHA20_POLY1305 (= 3). > >> The application uses this value: > >> ++ aead_dev_count[info.capabilities.sym.aead.algo]; > >> The application is crashing because of out of bound access. > > > > I'd say this is an example of bad written app. > > It probably should check that returned by library value doesn't > > exceed its internal array size. > > +1 > > Application should ignore values >= MAX. > The example is still somewhat valid in it general principle though. While extending an ennumeration may be flagged by libabigail as an ABI breakage, its not necessecarily a false positive. By extending the ennumeration, all the previous entries in an array defined by said ennumeration remain constant in their offsets, so you can 'get away with such a change' in terms of preserving backwards compatibility in the above example, but you cannot, for example, shuffle the values in the ennumeration, as doing so would cause a functional breakage (i.e. requesting an instance of RTE_CRYPTO_AEAD_CHACHA20_POLY1305 might instead give you an instance of RTE_CRYPTO_AEAD_AES_GCM. These sorts of changes are the type that we could collectively waive in terms of ABI checking, as they should be ok, but the errors from libabigail should be taken as an indicator that this API could be rewritten (for example by removing the abi entirely, and adding an API call that returns an array of instance name and ids), so that changes of the above sort arent required. > Do you suggest we don't extend any enum or define between ABI breakage releases > to be sure bad written applications not affected? > As noted above, we could waive such corner cases, and probably be fine, but the error from the ABI check still serves a valid purpose in that its an indicator that your library API is ABI sensitive to code changes that re-architecture may address Neil