From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-ot0-f170.google.com (mail-ot0-f170.google.com [74.125.82.170]) by dpdk.org (Postfix) with ESMTP id 70BF12A5B for ; Mon, 23 Jan 2017 14:49:02 +0100 (CET) Received: by mail-ot0-f170.google.com with SMTP id 104so102492978otd.3 for ; Mon, 23 Jan 2017 05:49:02 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=Kkoft0wNrm96Byeq+rM0CN+rfAvK8khVNEUqoUqe4rw=; b=D/fQYpTnW8bP4a+XBz0aTj4OduaGLztn/a/XlvadJRqA61D9su1ay0QQrQA5km7Xa7 q4bEYOlDmNBiBUmkzeRmrGJY1L83MA0HOOTX7HQxeFWGNqSbeScOrBmnX83HlDYYYACi CI6GtfqYp3qL03QPoUbh+PaD+FiFg3eq/pIQE4DmdgIG/PauyeKyyKHJsN6oq1LPJmI3 jWxoujzl/m9DkUovEhy7XGa0MLr3Y9LctfFjPJmCIXSb9zXgZoZ/9oha7ThQTAwNMpzn 7u5inoydf2zJtKVAyR0PnTn2Xm2jQoMARhlULtkcsA5RvgWgMZbtJblR/X9K4A8cKTFc cCbw== X-Gm-Message-State: AIkVDXJQ0dZDNaLBN2XgudCY1GR1eApWEaaPXYyKHeISJUTAHEl2J9LTsQRMxGEABqcH4YOw1xt0Xhwf8T2XVnSH X-Received: by 10.157.46.67 with SMTP id c3mr14648115otd.163.1485179341609; Mon, 23 Jan 2017 05:49:01 -0800 (PST) MIME-Version: 1.0 Received: by 10.202.198.211 with HTTP; Mon, 23 Jan 2017 05:49:01 -0800 (PST) In-Reply-To: <6A0DE07E22DDAD4C9103DF62FEBC09093B55CADD@shsmsx102.ccr.corp.intel.com> References: <20170120160109.1088-1-bmcfall@redhat.com> <20170120160109.1088-3-bmcfall@redhat.com> <6A0DE07E22DDAD4C9103DF62FEBC09093B55CADD@shsmsx102.ccr.corp.intel.com> From: Billy McFall Date: Mon, 23 Jan 2017 08:49:01 -0500 Message-ID: To: "Lu, Wenzhuo" Cc: "thomas.monjalon@6wind.com" , "dev@dpdk.org" Content-Type: text/plain; charset=UTF-8 X-Content-Filtered-By: Mailman/MimeDel 2.1.15 Subject: Re: [dpdk-dev] [PATCH v3 2/3] net/e1000: e1000 igb support to free consumed buffers X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Jan 2017 13:49:02 -0000 There are two scenarios the APP calls this API (there may be others, but the API was designed with these two in mind): 1) APP receives a packet that needs to be flooded. Instead of making a copy of the packet (whether it is the whole packet, just the header or a clone buffer is used, as done today), the APP increments the reference count, adjusts the header and then sends the packet to one of the destination ports. The APP then polls this API until the reference count is decremented, and forwards to the next destination port with any header modifications needed. 2) An APP like a packet generation completes a run and then wants to reset for another run. It can call this API to free and mbufs that have been cache by the driver so it can start another run with the full set of mbufs. Maybe it is switching interfaces that are on another core and want to reclaim any mbufs that have been cached. So this API is intended to be called on an as need basis, not on every packet. When either a particular mbuf is desired or the mbufs for a particular interface need to be released. Hope this helps, Billy McFall On Sat, Jan 21, 2017 at 10:47 PM, Lu, Wenzhuo wrote: > Hi Billy, > > > -----Original Message----- > > From: Billy McFall [mailto:bmcfall@redhat.com] > > Sent: Saturday, January 21, 2017 12:01 AM > > To: thomas.monjalon@6wind.com; Lu, Wenzhuo > > Cc: dev@dpdk.org; Billy McFall > > Subject: [PATCH v3 2/3] net/e1000: e1000 igb support to free consumed > > buffers > > > > Add support to the e1000 igb driver for the new API to force free > consumed > > buffers on Tx ring. e1000 igb driver does not implement a tx_rs_thresh > to free > > mbufs, it frees a slot in the ring as needed, so a new function needed > to be > > written. > > > > Signed-off-by: Billy McFall > > --- > > drivers/net/e1000/e1000_ethdev.h | 2 + > > drivers/net/e1000/igb_ethdev.c | 1 + > > drivers/net/e1000/igb_rxtx.c | 126 > > +++++++++++++++++++++++++++++++++++++++ > > 3 files changed, 129 insertions(+) > > > > diff --git a/drivers/net/e1000/e1000_ethdev.h > > b/drivers/net/e1000/e1000_ethdev.h > > index 81a6dbb..39b2f43 100644 > > --- a/drivers/net/e1000/e1000_ethdev.h > > +++ b/drivers/net/e1000/e1000_ethdev.h > > @@ -315,6 +315,8 @@ int eth_igb_tx_queue_setup(struct rte_eth_dev *dev, > > uint16_t tx_queue_id, > > uint16_t nb_tx_desc, unsigned int socket_id, > > const struct rte_eth_txconf *tx_conf); > > > > +int eth_igb_tx_done_cleanup(void *txq, uint32_t free_cnt); > > + > > int eth_igb_rx_init(struct rte_eth_dev *dev); > > > > void eth_igb_tx_init(struct rte_eth_dev *dev); diff --git > > a/drivers/net/e1000/igb_ethdev.c b/drivers/net/e1000/igb_ethdev.c index > > 8843dd1..12a1a30 100644 > > --- a/drivers/net/e1000/igb_ethdev.c > > +++ b/drivers/net/e1000/igb_ethdev.c > > @@ -408,6 +408,7 @@ static const struct eth_dev_ops eth_igb_ops = { > > .rx_descriptor_done = eth_igb_rx_descriptor_done, > > .tx_queue_setup = eth_igb_tx_queue_setup, > > .tx_queue_release = eth_igb_tx_queue_release, > > + .tx_done_cleanup = eth_igb_tx_done_cleanup, > > .dev_led_on = eth_igb_led_on, > > .dev_led_off = eth_igb_led_off, > > .flow_ctrl_get = eth_igb_flow_ctrl_get, > > diff --git a/drivers/net/e1000/igb_rxtx.c b/drivers/net/e1000/igb_rxtx.c > index > > 45f3f24..00cd2aa 100644 > > --- a/drivers/net/e1000/igb_rxtx.c > > +++ b/drivers/net/e1000/igb_rxtx.c > > @@ -1277,6 +1277,132 @@ eth_igb_tx_queue_release(void *txq) > > igb_tx_queue_release(txq); > > } > > > > +static int > > +igb_tx_done_cleanup(struct igb_tx_queue *txq, uint32_t free_cnt) { > > + struct igb_tx_entry *sw_ring; > > + volatile union e1000_adv_tx_desc *txr; > > + uint16_t tx_first; /* First segment analyzed. */ > > + uint16_t tx_id; /* Current segment being processed. */ > > + uint16_t tx_last; /* Last segment in the current packet. */ > > + uint16_t tx_next; /* First segment of the next packet. */ > > + int count; > > + > > + if (txq != NULL) { > > + count = 0; > > + sw_ring = txq->sw_ring; > > + txr = txq->tx_ring; > > + > > + /* > > + * tx_tail is the last sent packet on the sw_ring. Goto > the end > > + * of that packet (the last segment in the packet chain) > and > > + * then the next segment will be the start of the oldest > > segment > > + * in the sw_ring. This is the first packet that will be > > + * attempted to be freed. > > + */ > > + > > + /* Get last segment in most recently added packet. */ > > + tx_first = sw_ring[txq->tx_tail].last_id; > > + > > + /* Get the next segment, which is the oldest segment in > ring. > > */ > > + tx_first = sw_ring[tx_first].next_id; > > + > > + /* Set the current index to the first. */ > > + tx_id = tx_first; > > + > > + /* > > + * Loop through each packet. For each packet, verify that > an > > + * mbuf exists and that the last segment is free. If so, > free > > + * it and move on. > > + */ > > + while (1) { > > + tx_last = sw_ring[tx_id].last_id; > > + > > + if (sw_ring[tx_last].mbuf) { > > + if (txr[tx_last].wb.status & > > + E1000_TXD_STAT_DD) { > > + /* > > + * Increment the number of packets > > + * freed. > > + */ > > + count++; > > + > > + /* Get the start of the next > packet. */ > > + tx_next = sw_ring[tx_last].next_id; > > + > > + /* > > + * Loop through all segments in a > > + * packet. > > + */ > > + do { > > + > > rte_pktmbuf_free_seg(sw_ring[tx_id].mbuf); > > + sw_ring[tx_id].mbuf = NULL; > > + sw_ring[tx_id].last_id = > tx_id; > > + > > + /* Move to next segemnt. */ > > + tx_id = > sw_ring[tx_id].next_id; > > + > > + } while (tx_id != tx_next); > > + > > + if (unlikely(count == > (int)free_cnt)) > > + break; > > + } else > > + /* > > + * mbuf still in use, nothing left > to > > + * free. > > + */ > > + break; > > + } else { > > + /* > > + * There are multiple reasons to be here: > > + * 1) All the packets on the ring have been > > + * freed - tx_id is equal to tx_first > > + * and some packets have been freed. > > + * - Done, exit > > + * 2) Interfaces has not sent a rings > worth of > > + * packets yet, so the segment after > tail is > > + * still empty. Or a previous call to > this > > + * function freed some of the segments > but > > + * not all so there is a hole in the > list. > > + * Hopefully this is a rare case. > > + * - Walk the list and find the next > mbuf. If > > + * there isn't one, then done. > > + */ > > + if (likely((tx_id == tx_first) && (count > != 0))) > > + break; > > + > > + /* > > + * Walk the list and find the next mbuf, > if any. > > + */ > > + do { > > + /* Move to next segemnt. */ > > + tx_id = sw_ring[tx_id].next_id; > > + > > + if (sw_ring[tx_id].mbuf) > > + break; > > + > > + } while (tx_id != tx_first); > > + > > + /* > > + * Determine why previous loop bailed. If > > there > > + * is not an mbuf, done. > > + */ > > + if (sw_ring[tx_id].mbuf == NULL) > > + break; > > + } > > + } > > + } else > > + count = -ENODEV; > > + > > + return count; > > +} > > + > > +int > > +eth_igb_tx_done_cleanup(void *txq, uint32_t free_cnt) { > > + return igb_tx_done_cleanup(txq, free_cnt); } > > + > > static void > > igb_reset_tx_queue_stat(struct igb_tx_queue *txq) { > > -- > > 2.9.3 > I think the code is good. But I don't understand the whole idea. An API > means APP need call it. When should APP call This API? Does APP need to > check after sending every packet? Or just call it periodically? > >