DPDK patches and discussions
 help / color / mirror / Atom feed
From: Sagi Grimberg <sagi@grimberg.me>
To: Yongseok Koh <yskoh@mellanox.com>
Cc: adrien.mazarguil@6wind.com, nelio.laranjeiro@6wind.com, dev@dpdk.org
Subject: Re: [dpdk-dev] [PATCH] net/mlx5: poll completion queue once per a call
Date: Sun, 23 Jul 2017 12:49:36 +0300	[thread overview]
Message-ID: <918eb17f-269c-7102-41ab-69ceb95fdacf@grimberg.me> (raw)
In-Reply-To: <20170721151006.GA38779@yongseok-MBP.local>

>>> mlx5_tx_complete() polls completion queue multiple times until it
>>> encounters an invalid entry. As Tx completions are suppressed by
>>> MLX5_TX_COMP_THRESH, it is waste of cycles to expect multiple completions
>>> in a poll. And freeing too many buffers in a call can cause high jitter.
>>> This patch improves throughput a little.
>>
>> What if the device generates burst of completions?
> mlx5 PMD suppresses completions anyway. It requests a completion per every
> MLX5_TX_COMP_THRESH Tx mbufs, not every single mbuf. So, the size of completion
> queue is even much small.

Yes I realize that, but can't the device still complete in a burst (of
unsuppressed completions)? I mean its not guaranteed that for every
txq_complete a signaled completion is pending right? What happens if
the device has inconsistent completion pacing? Can't the sw grow a
batch of completions if txq_complete will process a single completion
unconditionally?

>> Holding these completions un-reaped can theoretically cause resource stress on
>> the corresponding mempool(s).
> Can you make your point clearer? Do you think the "stress" can impact
> performance? I think stress doesn't matter unless it is depleted. And app is
> responsible for supplying enough mbufs considering the depth of all queues (max
> # of outstanding mbufs).

I might be missing something, but # of outstanding mbufs should be
relatively small as the pmd reaps every MLX5_TX_COMP_THRESH mbufs right?
Why should the pool account for the entire TX queue depth (which can
be very large)?

Is there a hard requirement documented somewhere that the application
needs to account for the entire TX queue depths for sizing its mbuf
pool?

My question is with the proposed change, doesn't this mean that the 
application might need to allocate a bigger TX mbuf pool? Because the
pmd can theoretically consume completions slower (as in multiple TX
burst calls)?

>> I totally get the need for a stopping condition, but is "loop once"
>> the best stop condition?
> Best for what?

Best condition to stop consuming TX completions. As I said, I think that
leaving TX completions un-reaped can (at least in theory) slow down the
mbuf reclamation, which impacts the application. (unless I'm not
understanding something fundamental)

>> Perhaps an adaptive budget (based on online stats) perform better?
> Please bring up any suggestion or submit a patch if any.

I was simply providing a review for the patch. I don't have the time
to come up with a better patch unfortunately, but I still think its
fair to raise a point.

> Does "budget" mean the
> threshold? If so, calculation of stats for adaptive threshold can impact single
> core performance. With multiple cores, adjusting threshold doesn't affect much.

If you look at mlx5e driver in the kernel, it maintains online stats on
its RX and TX queues. It maintain these stats mostly for adaptive
interrupt moderation control (but not only).

I was suggesting maintaining per TX queue stats on average completions
consumed for each TX burst call, and adjust the stopping condition
according to a calculated stat.

  reply	other threads:[~2017-07-23  9:49 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-07-20 15:48 Yongseok Koh
2017-07-20 16:34 ` Sagi Grimberg
2017-07-21 15:10   ` Yongseok Koh
2017-07-23  9:49     ` Sagi Grimberg [this message]
2017-07-25  7:43       ` Yongseok Koh
2017-07-27 11:12         ` Sagi Grimberg
2017-07-28  0:26           ` Yongseok Koh
2017-07-31 16:12 ` Ferruh Yigit

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=918eb17f-269c-7102-41ab-69ceb95fdacf@grimberg.me \
    --to=sagi@grimberg.me \
    --cc=adrien.mazarguil@6wind.com \
    --cc=dev@dpdk.org \
    --cc=nelio.laranjeiro@6wind.com \
    --cc=yskoh@mellanox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).