From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from out1-smtp.messagingengine.com (out1-smtp.messagingengine.com [66.111.4.25]) by dpdk.org (Postfix) with ESMTP id B31FA7CE5 for ; Wed, 20 Sep 2017 17:35:40 +0200 (CEST) Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailout.nyi.internal (Postfix) with ESMTP id 5FE3121004; Wed, 20 Sep 2017 11:35:40 -0400 (EDT) Received: from frontend2 ([10.202.2.161]) by compute1.internal (MEProxy); Wed, 20 Sep 2017 11:35:40 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=monjalon.net; h= cc:content-transfer-encoding:content-type:date:from:in-reply-to :message-id:mime-version:references:subject:to:x-me-sender :x-me-sender:x-sasl-enc:x-sasl-enc; s=mesmtp; bh=20E2qz5CeMmev9Z qXWIDasOX2XFeCwW+Jsztthl+NTI=; b=eY/yDdMjhGwd/wrz6+lZ1yUA8iSgT+j 29EUrMG8fSkAeeNyt79xkYxS2x/VKpr9TCXpMIZbjrCVIhbaT4tya5Stc8f+tt1/ TJP3ggq8M8Yfj/iL03cg5QJDmpMXGJDo2WfEb/UlL1TXVRSKX4qNaZDraLeCJ4s/ YOJv4Qq/AR88= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-sender:x-me-sender:x-sasl-enc:x-sasl-enc; s= fm1; bh=20E2qz5CeMmev9ZqXWIDasOX2XFeCwW+Jsztthl+NTI=; b=NVY+SNdu VfnyRNuuKmPGkQP6P0r5yKkbrw4h+cqTp//9AyRockr0ldO8RWM44GsW6poncDcO qTaYdmAHoLnlb7jHiU+b7MJVxrZYOsWrRHWY90UkkwvCMaygGbM93fZSZAJDncgu +Wjo9x5m0n21CwmYhyVi39OOANI3DaLEvqp1n9mUh7aCmqpmwi0DFl8KPJIJd3xK yoAuyogdkp83reRDhOJKoKXRmRdxcwib2UC2pYCmEGk5TaiQalUH2fWLZfYs2SOU p3CWQBdFlOrkQWCz04mhMkepPQNByJG194HDijCftxw6DPghWQncv28GsGqXJmEX q7WciEZ/WGd07g== X-ME-Sender: X-Sasl-enc: aORCRdzFSZqjJI1y3+2KV75gEaO8gKsaHdlbJTSkpUmW 1505921740 Received: from xps.localnet (184.203.134.77.rev.sfr.net [77.134.203.184]) by mail.messagingengine.com (Postfix) with ESMTPA id 06D052444E; Wed, 20 Sep 2017 11:35:40 -0400 (EDT) From: Thomas Monjalon To: Jasvinder Singh , cristian.dumitrescu@intel.com Cc: dev@dpdk.org, ferruh.yigit@intel.com Date: Wed, 20 Sep 2017 17:35:39 +0200 Message-ID: <9843308.sSVeHNjL0n@xps> In-Reply-To: <20170918091015.82824-1-jasvinder.singh@intel.com> References: <20170811124929.118564-2-jasvinder.singh@intel.com> <20170918091015.82824-1-jasvinder.singh@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="us-ascii" Subject: Re: [dpdk-dev] [PATCH v4 0/4] net/softnic: sw fall-back pmd for traffic mgmt and others X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 20 Sep 2017 15:35:41 -0000 Hi, 18/09/2017 11:10, Jasvinder Singh: > The SoftNIC PMD is intended to provide SW fall-back options for specific > ethdev APIs in a generic way to the NICs not supporting those features. I agree it is important to have a solution in DPDK to better manage SW fallbacks. One question is to know whether we can implement and maintain many solutions. We probably must choose only one solution. I have not read the code. I am just interested in the design for now. I think it is a smart idea but maybe less convenient than calling fallback from ethdev API glue code. My opinion has not changed since v1. Thanks for the detailed explanations. Let's discuss below. [...] > * RX/TX: The app reads packets from/writes packets to the "soft" port > instead of the "hard" port. The RX and TX queues of the "soft" port are > thread safe, as any ethdev. "thread safe as any ethdev"? I would say the ethdev queues are not thread safe. [...] > * Meets the NFV vision: The app should be (almost) agnostic about the NIC > implementation (different vendors/models, HW-SW mix), the app should not > require changes to use different NICs, the app should use the same API > for all NICs. If a NIC does not implement a specific feature, the HW > should be augmented with SW to meet the functionality while still > preserving the same API. This goal could also be achieved by adding the SW capability to the API. After getting capabilities of a hardware, the app could set the capability of some driver features to "SW fallback". So the capability would become a tristate: - not supported - HW supported - SW supported The unique API goal is failed if we must manage two ports, the HW port for some features and the softnic port for other features. You explain it in A5 below. [...] > Example: Create "soft" port for "hard" port "0000:04:00.1", enable the TM > feature with default settings: > --vdev 'net_softnic0,hard_name=0000:04:00.1,soft_tm=on' So the app will use only the vdev net_softnic0 which will forward packets to 0000:04:00.1? Can we say in this example that net_softnic0 owns 0000:04:00.1? Probably not, because the config of the HW must be done separately (cf. Q5). See my "ownership proposal": http://dpdk.org/ml/archives/dev/2017-September/074656.html The issue I see in this example is that we must define how to enable every features. It should be equivalent to defining the ethdev capabilities. In this example, the option soft_tm=on is probably not enough fine-grain. We could support some parts of TM API in HW and other parts in SW. [...] > Q3: Why not change the "hard" device (and keep a single device) instead of > creating a new "soft" device (and thus having two devices)? > A3: This is not possible with the current librte_ether ethdev > implementation. The ethdev->dev_ops are defined as constant structure, > so it cannot be changed per device (nor per PMD). The new ops also > need memory space to store their context data structures, which > requires updating the ethdev->data->dev_private of the existing > device; at best, maybe a resize of ethdev->data->dev_private could be > done, assuming that librte_ether will introduce a way to find out its > size, but this cannot be done while device is running. Other side > effects might exist, as the changes are very intrusive, plus it likely > needs more changes in librte_ether. Q3 is about calling SW fallback from the driver code, right? We must not implement fallbacks in drivers because it would hide it to the application. If a feature is not available in hardware, the application can choose to bypass this feature or integrate the fallback in its own workflow. > Q4: Why not call the SW fall-back dev_ops directly in librte_ether for > devices which do not support the specific feature? If the device > supports the capability, let's call its dev_ops, otherwise call the > SW fall-back dev_ops. > A4: First, similar reasons to Q&A3. This fixes the need to change > ethdev->dev_ops of the device, but it does not do anything to fix the > other significant issue of where to store the context data structures > needed by the SW fall-back functions (which, in this approach, are > called implicitly by librte_ether). > Second, the SW fall-back options should not be restricted arbitrarily > by the librte_ether library, the decision should belong to the app. > For example, the TM SW fall-back should not be limited to only > librte_sched, which (like any SW fall-back) is limited to a specific > hierarchy and feature set, it cannot do any possible hierarchy. If > alternatives exist, the one to use should be picked by the app, not by > the ethdev layer. Q4 is about calling SW callback from the API glue code, right? We could summarize Q3/Q4 as "it could be done but we propose another way". I think we must consider the pros and cons of both approaches from a user perspective. I agree the application must decide which fallback to use. We could propose one fallback in ethdev which can be enabled explicitly (see my tristate capabilities proposal above). > Q5: Why is the app required to continue to configure both the "hard" and > the "soft" devices even after the "soft" device has been created? Why > not hiding the "hard" device under the "soft" device and have the > "soft" device configure the "hard" device under the hood? > A5: This was the approach tried in the V2 of this patch set (overlay > "soft" device taking over the configuration of the underlay "hard" > device) and eventually dropped due to increased complexity of having > to keep the configuration of two distinct devices in sync with > librte_ether implementation that is not friendly towards such > approach. Basically, each ethdev API call for the overlay device > needs to configure the overlay device, invoke the same configuration > with possibly modified parameters for the underlay device, then resume > the configuration of overlay device, turning this into a device > emulation project. > V2 minuses: increased complexity (deal with two devices at same time); > need to implement every ethdev API, even those not needed for the scope > of SW fall-back; intrusive; sometimes have to silently take decisions > that should be left to the app. > V3 pluses: lower complexity (only one device); only need to implement > those APIs that are in scope of the SW fall-back; non-intrusive (deal > with "hard" device through ethdev API); app decisions taken by the app > in an explicit way. I think it is breaking what you call the NFV vision in several places. [...] > 9. [rte_ring proliferation] Thread safety requirements for ethdev > RX/TXqueues require an rte_ring to be used for every RX/TX queue > of each "soft" ethdev. This rte_ring proliferation unnecessarily > increases the memory footprint and lowers performance, especially > when each "soft" ethdev ends up on a different CPU core (ping-pong > of cache lines). I am curious to understand why you consider thread safety as a requirement for queues. No need to reply here, the question is already asked at the beginning of this email ;)