* [dpdk-dev] [PATCH] test/interrupt: account for race with callback
@ 2019-08-08 17:38 Aaron Conole
2019-08-09 9:18 ` David Marchand
2019-09-13 14:42 ` Aaron Conole
0 siblings, 2 replies; 4+ messages in thread
From: Aaron Conole @ 2019-08-08 17:38 UTC (permalink / raw)
To: dev; +Cc: Jeff Guo, Thomas Monjalon
Because the eal interrupt framework can race when invoking the callback
and a separate unregister call, the test needs to accommodate the chance
that the two collide. Do this by checking the return value of unregister
against the race-condition flag (EAGAIN).
Fixes: f1a6c22424ce ("app/test: update interrupts test")
Signed-off-by: Aaron Conole <aconole@redhat.com>
---
NOTE: it's difficult to reproduce this race. I tried a bit, but have
only seen it sporadically. In Travis environment, the CPU
resource can be very limited and this test is quite racy.
app/test/test_interrupts.c | 10 +++++++---
1 file changed, 7 insertions(+), 3 deletions(-)
diff --git a/app/test/test_interrupts.c b/app/test/test_interrupts.c
index d8c2d8124..233b14a70 100644
--- a/app/test/test_interrupts.c
+++ b/app/test/test_interrupts.c
@@ -370,9 +370,13 @@ test_interrupt_full_path_check(enum test_interrupt_handle_type intr_type)
rte_delay_ms(TEST_INTERRUPT_CHECK_INTERVAL);
rte_delay_ms(TEST_INTERRUPT_CHECK_INTERVAL);
- if (rte_intr_callback_unregister(&test_intr_handle,
- test_interrupt_callback, &test_intr_handle) < 0)
- return -1;
+ while ((count =
+ rte_intr_callback_unregister(&test_intr_handle,
+ test_interrupt_callback,
+ &test_intr_handle)) < 0) {
+ if (count != -EAGAIN)
+ return -1;
+ }
if (flag == 0) {
printf("callback has not been called\n");
--
2.21.0
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] test/interrupt: account for race with callback
2019-08-08 17:38 [dpdk-dev] [PATCH] test/interrupt: account for race with callback Aaron Conole
@ 2019-08-09 9:18 ` David Marchand
2019-09-13 15:01 ` Thomas Monjalon
2019-09-13 14:42 ` Aaron Conole
1 sibling, 1 reply; 4+ messages in thread
From: David Marchand @ 2019-08-09 9:18 UTC (permalink / raw)
To: Aaron Conole; +Cc: dev, Jeff Guo, Thomas Monjalon
On Thu, Aug 8, 2019 at 7:38 PM Aaron Conole <aconole@redhat.com> wrote:
>
> Because the eal interrupt framework can race when invoking the callback
> and a separate unregister call, the test needs to accommodate the chance
> that the two collide. Do this by checking the return value of unregister
> against the race-condition flag (EAGAIN).
>
> Fixes: f1a6c22424ce ("app/test: update interrupts test")
Not too sure about this tag, but anyway, this is old enough to apply
to every stable releases we have.
Cc: stable@dpdk.org ?
> Signed-off-by: Aaron Conole <aconole@redhat.com>
> ---
> NOTE: it's difficult to reproduce this race. I tried a bit, but have
> only seen it sporadically. In Travis environment, the CPU
> resource can be very limited and this test is quite racy.
Managed to reproduce it:
# time (log=/tmp/$$.log; while true; do echo interrupt_autotest
|taskset -c 0-1 ./build-gcc-static/app/test/dpdk-test -l 0-1 >$log
2>&1; grep -q 'Test OK' $log || break; done; cat $log; rm -f $log)
EAL: Detected 8 lcore(s)
EAL: Detected 1 NUMA nodes
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: No available hugepages reported in hugepages-1048576kB
EAL: Probing VFIO support...
EAL: PCI device 0000:00:1f.6 on NUMA socket -1
EAL: Invalid NUMA socket, default to 0
EAL: probe driver: 8086:15d7 net_e1000_em
APP: HPET is not enabled, using TSC as default timer
RTE>>interrupt_autotest
Check unknown valid interrupt full path
Check valid UIO interrupt full path
Check valid device event interrupt full path
count=-11 Resource temporarily unavailable
failure occurred during checking valid device event interrupt full path
Clearing for interrupt tests
Test Failed
RTE>>
real 0m38.081s
user 0m35.836s
sys 0m2.171s
>
> app/test/test_interrupts.c | 10 +++++++---
> 1 file changed, 7 insertions(+), 3 deletions(-)
>
> diff --git a/app/test/test_interrupts.c b/app/test/test_interrupts.c
> index d8c2d8124..233b14a70 100644
> --- a/app/test/test_interrupts.c
> +++ b/app/test/test_interrupts.c
> @@ -370,9 +370,13 @@ test_interrupt_full_path_check(enum test_interrupt_handle_type intr_type)
> rte_delay_ms(TEST_INTERRUPT_CHECK_INTERVAL);
>
> rte_delay_ms(TEST_INTERRUPT_CHECK_INTERVAL);
> - if (rte_intr_callback_unregister(&test_intr_handle,
> - test_interrupt_callback, &test_intr_handle) < 0)
> - return -1;
> + while ((count =
> + rte_intr_callback_unregister(&test_intr_handle,
> + test_interrupt_callback,
> + &test_intr_handle)) < 0) {
> + if (count != -EAGAIN)
> + return -1;
> + }
>
> if (flag == 0) {
> printf("callback has not been called\n");
> --
> 2.21.0
>
With this patch, my loop has been running for more than 10 minutes now.
Reviewed-by: David Marchand <david.marchand@redhat.com>
--
David Marchand
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] test/interrupt: account for race with callback
2019-08-08 17:38 [dpdk-dev] [PATCH] test/interrupt: account for race with callback Aaron Conole
2019-08-09 9:18 ` David Marchand
@ 2019-09-13 14:42 ` Aaron Conole
1 sibling, 0 replies; 4+ messages in thread
From: Aaron Conole @ 2019-09-13 14:42 UTC (permalink / raw)
To: dev; +Cc: Jeff Guo, Thomas Monjalon
Aaron Conole <aconole@redhat.com> writes:
> Because the eal interrupt framework can race when invoking the callback
> and a separate unregister call, the test needs to accommodate the chance
> that the two collide. Do this by checking the return value of unregister
> against the race-condition flag (EAGAIN).
>
> Fixes: f1a6c22424ce ("app/test: update interrupts test")
> Signed-off-by: Aaron Conole <aconole@redhat.com>
> ---
Ping. Still see these failures.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [dpdk-dev] [PATCH] test/interrupt: account for race with callback
2019-08-09 9:18 ` David Marchand
@ 2019-09-13 15:01 ` Thomas Monjalon
0 siblings, 0 replies; 4+ messages in thread
From: Thomas Monjalon @ 2019-09-13 15:01 UTC (permalink / raw)
To: Aaron Conole; +Cc: dev, David Marchand, Jeff Guo
09/08/2019 11:18, David Marchand:
> On Thu, Aug 8, 2019 at 7:38 PM Aaron Conole <aconole@redhat.com> wrote:
> >
> > Because the eal interrupt framework can race when invoking the callback
> > and a separate unregister call, the test needs to accommodate the chance
> > that the two collide. Do this by checking the return value of unregister
> > against the race-condition flag (EAGAIN).
> >
> > Fixes: f1a6c22424ce ("app/test: update interrupts test")
>
> Not too sure about this tag, but anyway, this is old enough to apply
> to every stable releases we have.
> Cc: stable@dpdk.org ?
>
> > Signed-off-by: Aaron Conole <aconole@redhat.com>
> > ---
> > NOTE: it's difficult to reproduce this race. I tried a bit, but have
> > only seen it sporadically. In Travis environment, the CPU
> > resource can be very limited and this test is quite racy.
>
> Managed to reproduce it:
>
> # time (log=/tmp/$$.log; while true; do echo interrupt_autotest
> |taskset -c 0-1 ./build-gcc-static/app/test/dpdk-test -l 0-1 >$log
> 2>&1; grep -q 'Test OK' $log || break; done; cat $log; rm -f $log)
> EAL: Detected 8 lcore(s)
> EAL: Detected 1 NUMA nodes
> EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
> EAL: Selected IOVA mode 'PA'
> EAL: No available hugepages reported in hugepages-1048576kB
> EAL: Probing VFIO support...
> EAL: PCI device 0000:00:1f.6 on NUMA socket -1
> EAL: Invalid NUMA socket, default to 0
> EAL: probe driver: 8086:15d7 net_e1000_em
> APP: HPET is not enabled, using TSC as default timer
> RTE>>interrupt_autotest
> Check unknown valid interrupt full path
> Check valid UIO interrupt full path
> Check valid device event interrupt full path
> count=-11 Resource temporarily unavailable
> failure occurred during checking valid device event interrupt full path
> Clearing for interrupt tests
> Test Failed
> RTE>>
> real 0m38.081s
> user 0m35.836s
> sys 0m2.171s
>
> With this patch, my loop has been running for more than 10 minutes now.
> Reviewed-by: David Marchand <david.marchand@redhat.com>
Applied, thanks
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2019-09-13 15:01 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-08-08 17:38 [dpdk-dev] [PATCH] test/interrupt: account for race with callback Aaron Conole
2019-08-09 9:18 ` David Marchand
2019-09-13 15:01 ` Thomas Monjalon
2019-09-13 14:42 ` Aaron Conole
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).