From: "Morten Brørup" <mb@smartsharesystems.com>
To: "Bruce Richardson" <bruce.richardson@intel.com>
Cc: "Stephen Hemminger" <stephen@networkplumber.org>, <dev@dpdk.org>,
"Harry van Haaren" <harry.van.haaren@intel.com>
Subject: RE: Thread priority for background tasks
Date: Thu, 1 May 2025 10:35:20 +0200 [thread overview]
Message-ID: <98CBD80474FA8B44BF855DF32C47DC35E9FC1C@smartserver.smartshare.dk> (raw)
In-Reply-To: <aBMnKlCGNu6fp7hu@bricha3-mobl1.ger.corp.intel.com>
> From: Bruce Richardson [mailto:bruce.richardson@intel.com]
> Sent: Thursday, 1 May 2025 09.48
>
> On Thu, May 01, 2025 at 09:05:32AM +0200, Morten Brørup wrote:
> > > From: Stephen Hemminger [mailto:stephen@networkplumber.org]
> > > Sent: Wednesday, 30 April 2025 21.45
> > >
> > > On Wed, 30 Apr 2025 20:40:52 +0200
> > > Morten Brørup <mb@smartsharesystems.com> wrote:
> > >
> > > > There are only two thread priorities in the enum
> rte_thread_priority:
> > > Normal and Real-time Critical.
> > > >
> > > > I would like to poll ethdev counters, collect garbage and perform
> > > other jitter non-sensitive tasks in a control thread with lower
> > > priority than my ordinary control threads, so it will be preempted
> by
> > > any work ready for my ordinary control threads.
> > > >
> > > > Which DPDK API am I supposed to use to assign this below-normal
> > > priority to my "background" control thread?
> > > >
> > > > Or: Aren't we missing a priority like Linux' SCHED_BATCH?
> > >
> > > Short answer: if your application is running on Linux, only ever
> use
> > > Normal.
> > > DPDK applications usually never sleep and this will starve the OS
> and
> > > cause instability.
> >
> > I was asking for the opposite of Critical priority.
> >
> > For the sake of discussion, imagine a (registered or unregistered)
> non-EAL thread doing something like this:
> > loop {
> > poll_counters(); // 1 ms execution time
> > sleep(99 ms);
> > }
> >
> > With normal scheduling priority, it will rack up a lot of scheduling
> credits during sleep(), so it might not be preempted by other threads
> while executing poll_counters().
> >
> > But if some other thread (on the same CPU core) changes state from
> Sleeping to Runnable, I want it to preempt the counter polling thread.
> > This other thread could be a control plane application, e.g. a DNS
> Server, which shouldn't suffer up to 1 ms scheduling lag if it becomes
> Runnable the instant the counter polling thread started executing
> poll_counters().
> >
> > So I'm looking for a DPDK API to apply a "low priority" scheduling
> policy, like SCHED_BATCH, to the counter polling thread.
> >
>
> Does this need to be done in DPDK?
No, not really.
> Unless you need to target Windows, would
> using the standard Unix/Posix scheduling/pthread APIs directly not be
> best,
> rather than having us try to wrap all such things inside DPDK APIs?
It probably would. That's how we do it today, anyway. :-)
> I worry
> about scope creep for such things, with us ending up wrapping a whole
> bunch
> of scheduling stuff into DPDK that we should not need to do.
I'm mainly asking for academic reasons.
I think the scope of my question was included into DPDK when it introduced the rte_thread_priority with RTE_THREAD_PRIORITY_NORMAL and RTE_THREAD_PRIORITY_REALTIME_CRITICAL.
I know this is mainly related to the control plane, and thus not the most relevant thing for DPDK.
But I think we need to offer something. Not only for applications, but drivers might want to run separate low-priority threads for background tasks, such as garbage collection, counter polling, or a link state machine.
IMHO, the kernel scheduler is a much better choice than DPDK's non-preemptive "Service Cores" scheduler for many purposes.
prev parent reply other threads:[~2025-05-01 8:35 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-30 18:40 Morten Brørup
2025-04-30 19:45 ` Stephen Hemminger
2025-05-01 7:05 ` Morten Brørup
2025-05-01 7:47 ` Bruce Richardson
2025-05-01 8:35 ` Morten Brørup [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=98CBD80474FA8B44BF855DF32C47DC35E9FC1C@smartserver.smartshare.dk \
--to=mb@smartsharesystems.com \
--cc=bruce.richardson@intel.com \
--cc=dev@dpdk.org \
--cc=harry.van.haaren@intel.com \
--cc=stephen@networkplumber.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).