From: "Ananyev, Konstantin" <konstantin.ananyev@intel.com>
To: "Pei, Yulong" <yulong.pei@intel.com>,
Vladyslav Buslov <vladyslav.buslov@harmonicinc.com>,
"Zhang, Helin" <helin.zhang@intel.com>,
"Wu, Jingjing" <jingjing.wu@intel.com>,
"Yigit, Ferruh" <ferruh.yigit@intel.com>
Cc: "dev@dpdk.org" <dev@dpdk.org>
Subject: Re: [dpdk-dev] [PATCH] net/i40e: add packet prefetch
Date: Mon, 3 Apr 2017 10:47:20 +0000 [thread overview]
Message-ID: <2601191342CEEE43887BDE71AB9772583FAE4246@IRSMSX109.ger.corp.intel.com> (raw)
In-Reply-To: <188971FCDA171749BED5DA74ABF3E6F03B6ACF0D@shsmsx102.ccr.corp.intel.com>
> -----Original Message-----
> From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Pei, Yulong
> Sent: Saturday, April 1, 2017 3:02 AM
> To: Vladyslav Buslov <vladyslav.buslov@harmonicinc.com>; Zhang, Helin <helin.zhang@intel.com>; Wu, Jingjing <jingjing.wu@intel.com>;
> Yigit, Ferruh <ferruh.yigit@intel.com>
> Cc: dev@dpdk.org
> Subject: Re: [dpdk-dev] [PATCH] net/i40e: add packet prefetch
>
> Hi All
>
> In Non-vector mode, without this patch, single core performance can reach 37.576Mpps with 64Byte packet,
> But after applied this patch , single core performance downgrade to 34.343Mpps with 64Byte packet.
>
> Best Regards
> Yulong Pei
>
> -----Original Message-----
> From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Vladyslav Buslov
> Sent: Wednesday, March 1, 2017 6:57 PM
> To: Zhang, Helin <helin.zhang@intel.com>; Wu, Jingjing <jingjing.wu@intel.com>; Yigit, Ferruh <ferruh.yigit@intel.com>
> Cc: dev@dpdk.org
> Subject: [dpdk-dev] [PATCH] net/i40e: add packet prefetch
>
> Prefetch both cache lines of mbuf and first cache line of payload if CONFIG_RTE_PMD_PACKET_PREFETCH is set.
>
> Signed-off-by: Vladyslav Buslov <vladyslav.buslov@harmonicinc.com>
> ---
> drivers/net/i40e/i40e_rxtx.c | 20 ++++++++++++++++----
> 1 file changed, 16 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/net/i40e/i40e_rxtx.c b/drivers/net/i40e/i40e_rxtx.c index 48429cc..2b4e5c9 100644
> --- a/drivers/net/i40e/i40e_rxtx.c
> +++ b/drivers/net/i40e/i40e_rxtx.c
> @@ -100,6 +100,12 @@
> #define I40E_TX_OFFLOAD_NOTSUP_MASK \
> (PKT_TX_OFFLOAD_MASK ^ I40E_TX_OFFLOAD_MASK)
>
> +#ifdef RTE_PMD_PACKET_PREFETCH
> +#define rte_packet_prefetch(p) rte_prefetch0(p)
> +#else
> +#define rte_packet_prefetch(p) do {} while (0)
> +#endif
> +
> static uint16_t i40e_xmit_pkts_simple(void *tx_queue,
> struct rte_mbuf **tx_pkts,
> uint16_t nb_pkts);
> @@ -495,6 +501,9 @@ i40e_rx_scan_hw_ring(struct i40e_rx_queue *rxq)
> /* Translate descriptor info to mbuf parameters */
> for (j = 0; j < nb_dd; j++) {
> mb = rxep[j].mbuf;
> + rte_packet_prefetch(
> + RTE_PTR_ADD(mb->buf_addr,
> + RTE_PKTMBUF_HEADROOM));
> qword1 = rte_le_to_cpu_64(\
> rxdp[j].wb.qword1.status_error_len);
> pkt_len = ((qword1 & I40E_RXD_QW1_LENGTH_PBUF_MASK) >> @@ -578,9 +587,11 @@
> i40e_rx_alloc_bufs(struct i40e_rx_queue *rxq)
>
> rxdp = &rxq->rx_ring[alloc_idx];
> for (i = 0; i < rxq->rx_free_thresh; i++) {
> - if (likely(i < (rxq->rx_free_thresh - 1)))
> + if (likely(i < (rxq->rx_free_thresh - 1))) {
> /* Prefetch next mbuf */
> - rte_prefetch0(rxep[i + 1].mbuf);
> + rte_packet_prefetch(rxep[i + 1].mbuf->cacheline0);
> + rte_packet_prefetch(rxep[i + 1].mbuf->cacheline1);
As I can see the line aove is the only real difference in that patch.
If that so, might be worth to re-run perf tests witout that line?
Konstantin
> + }
>
> mb = rxep[i].mbuf;
> rte_mbuf_refcnt_set(mb, 1);
> @@ -752,7 +763,8 @@ i40e_recv_pkts(void *rx_queue, struct rte_mbuf **rx_pkts, uint16_t nb_pkts)
> I40E_RXD_QW1_LENGTH_PBUF_SHIFT) - rxq->crc_len;
>
> rxm->data_off = RTE_PKTMBUF_HEADROOM;
> - rte_prefetch0(RTE_PTR_ADD(rxm->buf_addr, RTE_PKTMBUF_HEADROOM));
> + rte_packet_prefetch(RTE_PTR_ADD(rxm->buf_addr,
> + RTE_PKTMBUF_HEADROOM));
> rxm->nb_segs = 1;
> rxm->next = NULL;
> rxm->pkt_len = rx_packet_len;
> @@ -939,7 +951,7 @@ i40e_recv_scattered_pkts(void *rx_queue,
> first_seg->ol_flags |= pkt_flags;
>
> /* Prefetch data of first segment, if configured to do so. */
> - rte_prefetch0(RTE_PTR_ADD(first_seg->buf_addr,
> + rte_packet_prefetch(RTE_PTR_ADD(first_seg->buf_addr,
> first_seg->data_off));
> rx_pkts[nb_rx++] = first_seg;
> first_seg = NULL;
> --
> 2.1.4
next prev parent reply other threads:[~2017-04-03 10:47 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-03-01 10:56 Vladyslav Buslov
2017-03-07 10:37 ` Ferruh Yigit
2017-04-01 2:01 ` Pei, Yulong
2017-04-03 10:31 ` Ferruh Yigit
2017-04-06 9:29 ` Vladyslav Buslov
2017-04-06 9:54 ` Bruce Richardson
2017-04-03 10:47 ` Ananyev, Konstantin [this message]
2017-04-06 9:54 ` Bruce Richardson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2601191342CEEE43887BDE71AB9772583FAE4246@IRSMSX109.ger.corp.intel.com \
--to=konstantin.ananyev@intel.com \
--cc=dev@dpdk.org \
--cc=ferruh.yigit@intel.com \
--cc=helin.zhang@intel.com \
--cc=jingjing.wu@intel.com \
--cc=vladyslav.buslov@harmonicinc.com \
--cc=yulong.pei@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).