From: David Marchand <david.marchand@redhat.com>
To: Owen Hilyard <ohilyard@iol.unh.edu>
Cc: dev <dev@dpdk.org>, dpdklab <dpdklab@iol.unh.edu>,
Gaetan Rivet <grive@u256.net>,
Jerin Jacob Kollanukkaran <jerinj@marvell.com>,
Aaron Conole <aconole@redhat.com>,
"Burakov, Anatoly" <anatoly.burakov@intel.com>
Subject: Re: [dpdk-dev] Memory leak in rte_pci_scan
Date: Wed, 16 Jun 2021 19:40:58 +0200 [thread overview]
Message-ID: <CAJFAV8zVRj1rbh=iwk881qJL2muvhj+oM50J4cmvS+bFon3fAw@mail.gmail.com> (raw)
In-Reply-To: <CAHx6DYAADd4b9U8m7hNk0d_KkxgYuLF7_ntf+q57tE39W0H2bQ@mail.gmail.com>
On Wed, Jun 16, 2021 at 6:27 PM Owen Hilyard <ohilyard@iol.unh.edu> wrote:
>> - For the fast-tests testsuite, the default timeout should be 10s, not 600s.
>> See timeout_seconds_fast,
>> https://git.dpdk.org/dpdk/tree/app/test/meson.build#n446
>> Odd that a 600s timeout has been applied to fast-tests in your run.
>> How do you invoke meson?
>
>
> # meson test -t 600
>
> I copied the invocation from the production scripts for the community lab and removed the --suite argument.
600?
-t is for timeout multiplier.
The default timeout for fast tests is 10s and the logs in a previous
mail show 600s for timeout, so I would expect a 60 multiplier.
>
>> It seems like there are multiple dpdk processes running in // in this
>> environment.
>> Any idea of what is happening on your system at the moment you tried
>> to run this test?
>
>
> I ran this on a VM that we keep in the same state as the production container runners. It is not attached to our Jenkins instance, and I was the only logged-in user. I re-ran the test suite with and without ASAN, and it seems like this type of failure only happens when ASAN is active. The failing tests are: eal_flags_a_opt_autotest, eal_flags_b_opt_autotest, eal_flags_c_opt_autotest, eal_flags_main_opt_autotest, eal_flags_misc_autotest. I've attached the log.
ASAN seems to break some assumption on the default virtual base
address used by the mp stuff.
It might be a reason for the secondary process init failure.
Still, we have probably a deadlock here, since the test should fail in
a reasonable amount of time.
My guess would be at some secondary process not releasing a lock and
the primary ends up waiting on it.
Here, a secondary process did not initialise correctly, but it tried
to cleanup afterwards... per chance, do you have a crash reported in
syslog?
--
David Marchand
prev parent reply other threads:[~2021-06-16 17:41 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-06-08 18:47 Owen Hilyard
2021-06-14 9:11 ` David Marchand
2021-06-14 10:30 ` David Marchand
2021-06-14 20:41 ` Owen Hilyard
2021-06-15 7:43 ` David Marchand
2021-06-15 15:15 ` Owen Hilyard
2021-06-16 9:37 ` David Marchand
[not found] ` <CAHx6DYAADd4b9U8m7hNk0d_KkxgYuLF7_ntf+q57tE39W0H2bQ@mail.gmail.com>
2021-06-16 17:40 ` David Marchand [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAJFAV8zVRj1rbh=iwk881qJL2muvhj+oM50J4cmvS+bFon3fAw@mail.gmail.com' \
--to=david.marchand@redhat.com \
--cc=aconole@redhat.com \
--cc=anatoly.burakov@intel.com \
--cc=dev@dpdk.org \
--cc=dpdklab@iol.unh.edu \
--cc=grive@u256.net \
--cc=jerinj@marvell.com \
--cc=ohilyard@iol.unh.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).