From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp.tuxdriver.com (charlotte.tuxdriver.com [70.61.120.58]) by dpdk.org (Postfix) with ESMTP id 66D155ABA for ; Mon, 23 Mar 2015 19:45:53 +0100 (CET) Received: from cpe-098-026-070-093.nc.res.rr.com ([98.26.70.93] helo=localhost) by smtp.tuxdriver.com with esmtpsa (TLSv1:AES128-SHA:128) (Exim 4.63) (envelope-from ) id 1Ya7MB-0006pb-Q8; Mon, 23 Mar 2015 14:45:50 -0400 Date: Mon, 23 Mar 2015 14:45:42 -0400 From: Neil Horman To: Olivier MATZ Message-ID: <20150323184542.GD5661@hmsreliant.think-freely.org> References: <1426710078-11230-1-git-send-email-vadim.suraev@gmail.com> <20150318205856.GA5151@neilslaptop.think-freely.org> <550A8BB5.9040104@6wind.com> <20150319131639.GB1992@hmsreliant.think-freely.org> <551042D2.5040300@6wind.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <551042D2.5040300@6wind.com> User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -2.9 (--) X-Spam-Status: No Cc: dev@dpdk.org Subject: Re: [dpdk-dev] [PATCH v2] rte_mbuf: mbuf bulk alloc/free functions added + unittest X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Mar 2015 18:45:53 -0000 On Mon, Mar 23, 2015 at 05:44:02PM +0100, Olivier MATZ wrote: > Hi Neil, > > On 03/19/2015 02:16 PM, Neil Horman wrote: > >> On 03/18/2015 09:58 PM, Neil Horman wrote: > >>>> +/** > >>>> + * Free a bulk of mbufs into its original mempool. > >>>> + * This function assumes: > >>>> + * - refcnt equals 1 > >>>> + * - mbufs are direct > >>>> + * - all mbufs must belong to the same mempool > >>>> + * > >>>> + * @param mbufs > >>>> + * Array of pointers to mbuf > >>>> + * @param count > >>>> + * Array size > >>>> + */ > >>>> +static inline void rte_pktmbuf_bulk_free(struct rte_mbuf **mbufs, > >>>> + unsigned count) > >>>> +{ > >>>> + unsigned idx; > >>>> + > >>>> + RTE_MBUF_ASSERT(count > 0); > >>>> + > >>>> + for (idx = 0; idx < count; idx++) { > >>>> + RTE_MBUF_ASSERT(mbufs[idx]->pool == mbufs[0]->pool); > >>>> + RTE_MBUF_ASSERT(rte_mbuf_refcnt_read(mbufs[idx]) == 1); > >>>> + rte_mbuf_refcnt_set(mbufs[idx], 0); > >>> This is really a misuse of the API. The entire point of reference counting is > >>> to know when an mbuf has no more references and can be freed. By forcing all > >>> the reference counts to zero here, you allow the refcnt infrastructure to be > >>> circumvented, causing memory leaks. > >>> > >>> I think what you need to do here is enhance the underlying pktmbuf interface > >>> such that an rte_mbuf structure has a destructor method association with it > >>> which is called when its refcnt reaches zero. That way the > >>> rte_pktmbuf_bulk_free function can just decrement the refcnt on each > >>> mbuf_structure, and the pool as a whole can be returned when the destructor > >>> function discovers that all mbufs in that bulk pool are freed. > >> > >> I don't really understand what's the problem here. The API explicitly > >> describes the conditions for calling this functions: the segments are > >> directs, they belong to the same mempool, and their refcnt is 1. > >> > >> This function could be useful in a driver which knows that the mbuf > >> it allocated matches this conditions. I think an application that > >> only uses direct mbufs and one mempool could also use this function. > > > > > > That last condition is my issue with this patch, that the user has to know what > > refcnts are. It makes this api useful for little more than the test case that > > is provided with it. Its irritating enough that for singly allocated mbufs the > > user has to know what the refcount of a buffer is before freeing, but at least > > they can macrotize a {rte_pktmbuf_refcnt_update; if(rte_pktmbuf_refct_read) then > > free} operation. > > > > With this, you've placed the user in charge of not only doing that, but also of > > managing the relationship between pktmbufs and the pool they came from. while > > that makes sense for the test case, it really doesn't in any general use case in > > which packet processing is ever deferred or queued, because it means that the > > application is now responsible for holding a pointer to every packet it > > allocates and checking its refcount periodically until it completes. > > > > There is never any reason that an application won't need to do this management, > > so making it the purview of the application to handle rather than properly > > integrating that functionality in the library is really a false savings. > > There are some places where you know that the prerequisites are met, > so you can save cycles by using this function. > > From what I imagine, if in a driver you allocate mbufs, chain them and > for some reason you realize you have to free them, you can use this > function instead of freeing them one by one. > > Also, as it's up to the application to decide how many mbuf pools are > created, and whether indirect mbufs are used or not, the application > can take the short path of using this function in some conditions. > > Vadim, maybe you have another reason or use case for adding this > function? Could you detail why you need it and how it improves your > use case? > > Regards, > Olivier > So, I think we're making different points here. You seem to be justifying the API as it exists by finding use cases that fit into its documented restrictions (direct buffers, refcounts at 1, etc), which severely limit that use case set. My assertion is that those restrictions were created because it was inconvienient to code using the reference count as intended. I'm saying lets augment the reference counting mechanism so that we can use these specially allocated mbufs in a wider variety of use cases beyond the limited set they are currently good for Neil