* [dpdk-dev] [Bug 826] red_autotest random failures @ 2021-10-08 7:23 bugzilla 2021-11-12 13:51 ` David Marchand 0 siblings, 1 reply; 16+ messages in thread From: bugzilla @ 2021-10-08 7:23 UTC (permalink / raw) To: dev https://bugs.dpdk.org/show_bug.cgi?id=826 Bug ID: 826 Summary: red_autotest random failures Product: DPDK Version: unspecified Hardware: All OS: All Status: UNCONFIRMED Severity: normal Priority: Normal Component: other Assignee: cristian.dumitrescu@intel.com Reporter: david.marchand@redhat.com CC: dev@dpdk.org, jasvinder.singh@intel.com Target Milestone: --- A recent failure can be found at: https://lab.dpdk.org/results/dashboard/patchsets/19223/ 50/94 DPDK:fast-tests / red_autotest FAIL 0.86s exit status 1 23:08:32 MALLOC_PERTURB_=116 DPDK_TEST=red_autotest /home-local/jenkins-local/jenkins-agent/workspace/Generic-Unit-Test-DPDK/dpdk/build/app/test/dpdk-test '-l 0-15' --file-prefix=red_autotest ----------------------------------- output ----------------------------------- stdout: RTE>>red_autotest -------------------------------------------------------------------------------- functional test 1 : use one rte_red configuration, increase average queue size to various levels, compare drop rate to drop probability avg queue size enqueued dropped drop prob % drop rate % diff % tolerance % 6 10000 0 0.0000 0.0000 0.0000 50.0000 12 10000 0 0.0000 0.0000 0.0000 50.0000 18 10000 0 0.0000 0.0000 0.0000 50.0000 24 10000 0 0.0000 0.0000 0.0000 50.0000 30 10000 0 0.0000 0.0000 0.0000 50.0000 36 9954 46 0.4167 0.4600 0.0000 50.0000 42 9901 99 1.0417 0.9900 0.0000 50.0000 48 9831 169 1.6667 1.6900 0.0000 50.0000 54 9761 239 2.2917 2.3900 0.0000 50.0000 60 9701 299 2.9167 2.9900 0.0000 50.0000 66 9618 382 3.5417 3.8200 0.0000 50.0000 72 9581 419 4.1667 4.1900 0.0000 50.0000 78 9490 510 4.7917 5.1000 0.0000 50.0000 84 9460 540 5.4167 5.4000 0.0000 50.0000 90 9398 602 6.0417 6.0200 0.0000 50.0000 96 9371 629 6.6667 6.2900 0.0000 50.0000 102 9259 741 7.2917 7.4100 0.0000 50.0000 108 9211 789 7.9167 7.8900 0.0000 50.0000 114 9133 867 8.5417 8.6700 0.0000 50.0000 120 9058 942 9.1667 9.4200 0.0000 50.0000 126 8987 1013 9.7917 10.1300 0.0000 50.0000 132 0 10000 100.0000 100.0000 0.0000 50.0000 138 0 10000 100.0000 100.0000 0.0000 50.0000 144 0 10000 100.0000 100.0000 0.0000 50.0000 -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 2 : use several RED configurations, increase average queue size to just below maximum threshold, compare drop rate to drop probability RED config avg queue size min threshold max threshold drop prob % drop rate % diff % tolerance % 0 127 32 128 9.8958 9.9400 0.0000 50.0000 1 127 32 128 4.9479 4.8100 0.0000 50.0000 2 127 32 128 3.2986 2.6800 0.0000 50.0000 3 127 32 128 2.4740 1.6300 0.0000 50.0000 4 127 32 128 1.9792 1.2700 0.0000 50.0000 5 127 32 128 1.6493 1.0300 0.0000 50.0000 6 127 32 128 1.4137 0.8300 0.0000 50.0000 7 127 32 128 1.2370 0.7300 0.0000 50.0000 8 127 32 128 1.0995 0.6200 0.0000 50.0000 9 127 32 128 0.9896 0.5500 0.0000 50.0000 -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 3 : use one RED configuration, increase average queue size to target level, dequeue all packets until queue is empty, confirm that average queue size is computed correctly while queue is empty q avg before q avg after expected difference % tolerance % result 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 5 : use several queues (each with its own run-time data), use several RED configurations (such that each configuration is shared by multiple queues), increase average queue size to just below maximum threshold, compare drop rate to drop probability, (this is a larger scale version of functional test 2) queue config avg queue size min threshold max threshold drop prob % drop rate % diff % tolerance % 0 0 127 32 128 9.8958 9.9700 0.0000 50.0000 1 0 127 32 128 9.8958 9.7200 0.0000 50.0000 2 1 127 32 128 4.9479 4.8900 0.0000 50.0000 3 1 127 32 128 4.9479 4.8200 0.0000 50.0000 -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 6 : use several queues (each with its own run-time data), use several RED configurations (such that each configuration is sharte_red by multiple queues), increase average queue size to target level, dequeue all packets until queue is empty, confirm that average queue size is computed correctly while queue is empty (this is a larger scale version of functional test 3) queue config q avg before q avg after expected difference % tolerance % result 0 0 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1 0 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 2 1 1022.0000 1022.0000 1010.1483 1.1733 5.0000 pass 3 1 1022.0000 937.1660 1010.1483 7.2249 5.0000 fail -------------------------------------<fail>------------------------------------- -------------------------------------------------------------------------------- overflow test 1 : use one RED configuration, increase average queue size to target level, check maximum number of bits requirte_red to represent avg_s avg queue size wq_log2 fraction bits max queue avg num bits enqueued dropped drop prob % drop rate % 1023 12 10 0xffc00000 32 0 941366 100.00 100.00 -------------------------------------<pass>------------------------------------- [total: 6, pass: 5, fail: 1] Test Failed -- You are receiving this mail because: You are on the CC list for the bug. ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-10-08 7:23 [dpdk-dev] [Bug 826] red_autotest random failures bugzilla @ 2021-11-12 13:51 ` David Marchand 2021-11-12 14:10 ` Lincoln Lavoie 0 siblings, 1 reply; 16+ messages in thread From: David Marchand @ 2021-11-12 13:51 UTC (permalink / raw) To: Cristian Dumitrescu; +Cc: dev, Aaron Conole, Thomas Monjalon, Yigit, Ferruh, ci On Fri, Oct 8, 2021 at 9:24 AM <bugzilla@dpdk.org> wrote: > > https://bugs.dpdk.org/show_bug.cgi?id=826 > > Bug ID: 826 > Summary: red_autotest random failures > Product: DPDK > Version: unspecified > Hardware: All > OS: All > Status: UNCONFIRMED > Severity: normal > Priority: Normal > Component: other > Assignee: cristian.dumitrescu@intel.com > Reporter: david.marchand@redhat.com > CC: dev@dpdk.org, jasvinder.singh@intel.com > Target Milestone: --- > > A recent failure can be found at: > https://lab.dpdk.org/results/dashboard/patchsets/19223/ > > 50/94 DPDK:fast-tests / red_autotest FAIL 0.86s > exit status 1 functional test 6 : use several queues (each with its own run-time data), use several RED configurations (such that each configuration is sharte_red by multiple queues), increase average queue size to target level, dequeue all packets until queue is empty, confirm that average queue size is computed correctly while queue is empty (this is a larger scale version of functional test 3) queue config q avg before q avg after expected difference % tolerance % result 0 0 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1 0 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 2 1 1022.0000 1022.0000 1010.1483 1.1733 5.0000 pass 3 1 1022.0000 937.1660 1010.1483 7.2249 5.0000 fail -------------------------------------<fail>------------------------------------- This failure keeps on popping in the CI. The bug report is one month old, with no reply. I sent a proposal of removing red_autotest from the list executed by the CI. https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2-david.marchand@redhat.com/ It might be the best solution waiting for an analysis. -- David Marchand ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-12 13:51 ` David Marchand @ 2021-11-12 14:10 ` Lincoln Lavoie 2021-11-12 14:15 ` David Marchand 0 siblings, 1 reply; 16+ messages in thread From: Lincoln Lavoie @ 2021-11-12 14:10 UTC (permalink / raw) To: David Marchand Cc: Cristian Dumitrescu, dev, Aaron Conole, Thomas Monjalon, Yigit, Ferruh, ci [-- Attachment #1: Type: text/plain, Size: 3293 bytes --] On Fri, Nov 12, 2021 at 8:52 AM David Marchand <david.marchand@redhat.com> wrote: > On Fri, Oct 8, 2021 at 9:24 AM <bugzilla@dpdk.org> wrote: > > > > https://bugs.dpdk.org/show_bug.cgi?id=826 > > > > Bug ID: 826 > > Summary: red_autotest random failures > > Product: DPDK > > Version: unspecified > > Hardware: All > > OS: All > > Status: UNCONFIRMED > > Severity: normal > > Priority: Normal > > Component: other > > Assignee: cristian.dumitrescu@intel.com > > Reporter: david.marchand@redhat.com > > CC: dev@dpdk.org, jasvinder.singh@intel.com > > Target Milestone: --- > > > > A recent failure can be found at: > > https://lab.dpdk.org/results/dashboard/patchsets/19223/ > > > > 50/94 DPDK:fast-tests / red_autotest FAIL > 0.86s > > exit status 1 > > > functional test 6 : use several queues (each with its own run-time data), > use several RED configurations (such that each > configuration is sharte_red by multiple queues), > increase average queue size to target level, > dequeue all packets until queue is empty, > confirm that average queue size is computed correctly > while queue is empty > (this is a larger scale version of functional test 3) > > queue config q avg before q avg after expected > difference % tolerance % result > 0 0 1022.0000 1022.0000 1016.0627 > 0.5843 5.0000 pass > 1 0 1022.0000 1022.0000 1016.0627 > 0.5843 5.0000 pass > 2 1 1022.0000 1022.0000 1010.1483 > 1.1733 5.0000 pass > 3 1 1022.0000 937.1660 1010.1483 > 7.2249 5.0000 fail > > -------------------------------------<fail>------------------------------------- > > > > This failure keeps on popping in the CI. > The bug report is one month old, with no reply. > > > I sent a proposal of removing red_autotest from the list executed by the > CI. > > https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2-david.marchand@redhat.com/ > > It might be the best solution waiting for an analysis. > > > -- > David Marchand > > Hi David, My understanding is, removing the test would require removing it from the DPDK unit tests, we are just running the fast-tests suite for the unit tests. DPDK's unit test structure / framework does not allow removing or customizing the suite of tests beyond the suites. In the lab, Brandon has been looking into and trying different configurations for running the tests within the containers along the lines of the CPU pinning requirements that might be assumed by the unit tests. So far, everything he has tried has still had the similar failures / issues. We are still looking into it, so the bug is not sitting without action, just no final resolution. Cheers, Lincoln -- *Lincoln Lavoie* Principal Engineer, Broadband Technologies 21 Madbury Rd., Ste. 100, Durham, NH 03824 lylavoie@iol.unh.edu https://www.iol.unh.edu +1-603-674-2755 (m) <https://www.iol.unh.edu> [-- Attachment #2: Type: text/html, Size: 5888 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-12 14:10 ` Lincoln Lavoie @ 2021-11-12 14:15 ` David Marchand 2021-11-15 11:51 ` Dumitrescu, Cristian 0 siblings, 1 reply; 16+ messages in thread From: David Marchand @ 2021-11-12 14:15 UTC (permalink / raw) To: Lincoln Lavoie Cc: Cristian Dumitrescu, dev, Aaron Conole, Thomas Monjalon, Yigit, Ferruh, ci On Fri, Nov 12, 2021 at 3:11 PM Lincoln Lavoie <lylavoie@iol.unh.edu> wrote: >> This failure keeps on popping in the CI. >> The bug report is one month old, with no reply. >> >> >> I sent a proposal of removing red_autotest from the list executed by the CI. >> https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2-david.marchand@redhat.com/ >> >> It might be the best solution waiting for an analysis. >> >> >> -- >> David Marchand >> > > Hi David, > > My understanding is, removing the test would require removing it from the DPDK unit tests, we are just running the fast-tests suite for the unit tests. DPDK's unit test structure / framework does not allow removing or customizing the suite of tests beyond the suites. https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2-david.marchand@redhat.com/ > > In the lab, Brandon has been looking into and trying different configurations for running the tests within the containers along the lines of the CPU pinning requirements that might be assumed by the unit tests. So far, everything he has tried has still had the similar failures / issues. We are still looking into it, so the bug is not sitting without action, just no final resolution. The mail I sent was not a comment for the investigation on UNH side. The ask is for Cristian to have a look too. -- David Marchand ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-12 14:15 ` David Marchand @ 2021-11-15 11:51 ` Dumitrescu, Cristian 2021-11-15 17:26 ` Liguzinski, WojciechX 0 siblings, 1 reply; 16+ messages in thread From: Dumitrescu, Cristian @ 2021-11-15 11:51 UTC (permalink / raw) To: David Marchand, Lincoln Lavoie, Liguzinski, WojciechX, Ajmera, Megha, Singh, Jasvinder Cc: dev, Aaron Conole, Thomas Monjalon, Yigit, Ferruh, ci > -----Original Message----- > From: David Marchand <david.marchand@redhat.com> > Sent: Friday, November 12, 2021 2:16 PM > To: Lincoln Lavoie <lylavoie@iol.unh.edu> > Cc: Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; dev > <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Thomas Monjalon > <thomas@monjalon.net>; Yigit, Ferruh <ferruh.yigit@intel.com>; > ci@dpdk.org > Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures > > On Fri, Nov 12, 2021 at 3:11 PM Lincoln Lavoie <lylavoie@iol.unh.edu> wrote: > >> This failure keeps on popping in the CI. > >> The bug report is one month old, with no reply. > >> > >> > >> I sent a proposal of removing red_autotest from the list executed by the > CI. > >> https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502- > 2-david.marchand@redhat.com/ > >> > >> It might be the best solution waiting for an analysis. > >> > >> > >> -- > >> David Marchand > >> > > > > Hi David, > > > > My understanding is, removing the test would require removing it from the > DPDK unit tests, we are just running the fast-tests suite for the unit tests. > DPDK's unit test structure / framework does not allow removing or > customizing the suite of tests beyond the suites. > > https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2- > david.marchand@redhat.com/ > > > > > > In the lab, Brandon has been looking into and trying different > configurations for running the tests within the containers along the lines of > the CPU pinning requirements that might be assumed by the unit tests. So > far, everything he has tried has still had the similar failures / issues. We are > still looking into it, so the bug is not sitting without action, just no final > resolution. > > The mail I sent was not a comment for the investigation on UNH side. > The ask is for Cristian to have a look too. > > > -- > David Marchand Wojciech, Megha, Are you able to take a look at why is the RED autotest failing, please? Thanks, Cristian ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-15 11:51 ` Dumitrescu, Cristian @ 2021-11-15 17:26 ` Liguzinski, WojciechX 2021-11-18 22:10 ` Liguzinski, WojciechX 0 siblings, 1 reply; 16+ messages in thread From: Liguzinski, WojciechX @ 2021-11-15 17:26 UTC (permalink / raw) To: Dumitrescu, Cristian, David Marchand, Lincoln Lavoie, Ajmera, Megha, Singh, Jasvinder Cc: dev, Aaron Conole, Thomas Monjalon, Yigit, Ferruh, ci Hi, Sure, I will have a look. Best Regards, Wojciech -----Original Message----- From: Dumitrescu, Cristian <cristian.dumitrescu@intel.com> Sent: Monday, November 15, 2021 12:51 PM To: David Marchand <david.marchand@redhat.com>; Lincoln Lavoie <lylavoie@iol.unh.edu>; Liguzinski, WojciechX <wojciechx.liguzinski@intel.com>; Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder <jasvinder.singh@intel.com> Cc: dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Thomas Monjalon <thomas@monjalon.net>; Yigit, Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org Subject: RE: [dpdk-dev] [Bug 826] red_autotest random failures > -----Original Message----- > From: David Marchand <david.marchand@redhat.com> > Sent: Friday, November 12, 2021 2:16 PM > To: Lincoln Lavoie <lylavoie@iol.unh.edu> > Cc: Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; dev > <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Thomas Monjalon > <thomas@monjalon.net>; Yigit, Ferruh <ferruh.yigit@intel.com>; > ci@dpdk.org > Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures > > On Fri, Nov 12, 2021 at 3:11 PM Lincoln Lavoie <lylavoie@iol.unh.edu> wrote: > >> This failure keeps on popping in the CI. > >> The bug report is one month old, with no reply. > >> > >> > >> I sent a proposal of removing red_autotest from the list executed > >> by the > CI. > >> https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502- > 2-david.marchand@redhat.com/ > >> > >> It might be the best solution waiting for an analysis. > >> > >> > >> -- > >> David Marchand > >> > > > > Hi David, > > > > My understanding is, removing the test would require removing it > > from the > DPDK unit tests, we are just running the fast-tests suite for the unit tests. > DPDK's unit test structure / framework does not allow removing or > customizing the suite of tests beyond the suites. > > https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2- > david.marchand@redhat.com/ > > > > > > In the lab, Brandon has been looking into and trying different > configurations for running the tests within the containers along the > lines of the CPU pinning requirements that might be assumed by the > unit tests. So far, everything he has tried has still had the similar > failures / issues. We are still looking into it, so the bug is not > sitting without action, just no final resolution. > > The mail I sent was not a comment for the investigation on UNH side. > The ask is for Cristian to have a look too. > > > -- > David Marchand Wojciech, Megha, Are you able to take a look at why is the RED autotest failing, please? Thanks, Cristian ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-15 17:26 ` Liguzinski, WojciechX @ 2021-11-18 22:10 ` Liguzinski, WojciechX 2021-11-19 7:26 ` Thomas Monjalon 0 siblings, 1 reply; 16+ messages in thread From: Liguzinski, WojciechX @ 2021-11-18 22:10 UTC (permalink / raw) To: Dumitrescu, Cristian, David Marchand, Lincoln Lavoie, Ajmera, Megha, Singh, Jasvinder Cc: dev, Aaron Conole, Thomas Monjalon, Yigit, Ferruh, ci, Zegota, AnnaX Hi, I was trying to reproduce this test failure, but for me RED tests are passing. I was running the exact test command like the one described in Bug 826 - 'red_autotest' on the current main branch. Here is an example when DPDK is build without RTE_SCHED_CMAN enabled, but with this flag set to true tests are also not failing. root@silpixa00400629:~/wojtek/dpdk/build/app/test# ./dpdk-test '-l 0-15' --file-prefix=red_autotest EAL: Detected CPU lcores: 96 EAL: Detected NUMA nodes: 2 EAL: Detected static linkage of DPDK EAL: Multi-process socket /var/run/dpdk/red_autotest/mp_socket EAL: Selected IOVA mode 'VA' EAL: VFIO support initialized TELEMETRY: No legacy callbacks, legacy socket not created APP: HPET is not enabled, using TSC as default timer RTE>>red_autotest -------------------------------------------------------------------------------- functional test 1 : use one rte_red configuration, increase average queue size to various levels, compare drop rate to drop probability avg queue size enqueued dropped drop prob % drop rate % diff % tolerance % 6 10000 0 0.0000 0.0000 0.0000 50.0000 12 10000 0 0.0000 0.0000 0.0000 50.0000 18 10000 0 0.0000 0.0000 0.0000 50.0000 24 10000 0 0.0000 0.0000 0.0000 50.0000 30 10000 0 0.0000 0.0000 0.0000 50.0000 36 9961 39 0.4167 0.3900 0.0000 50.0000 42 9898 102 1.0417 1.0200 0.0000 50.0000 48 9835 165 1.6667 1.6500 0.0000 50.0000 54 9785 215 2.2917 2.1500 0.0000 50.0000 60 9703 297 2.9167 2.9700 0.0000 50.0000 66 9627 373 3.5417 3.7300 0.0000 50.0000 72 9580 420 4.1667 4.2000 0.0000 50.0000 78 9511 489 4.7917 4.8900 0.0000 50.0000 84 9462 538 5.4167 5.3800 0.0000 50.0000 90 9398 602 6.0417 6.0200 0.0000 50.0000 96 9366 634 6.6667 6.3400 0.0000 50.0000 102 9267 733 7.2917 7.3300 0.0000 50.0000 108 9212 788 7.9167 7.8800 0.0000 50.0000 114 9146 854 8.5417 8.5400 0.0000 50.0000 120 9102 898 9.1667 8.9800 0.0000 50.0000 126 8984 1016 9.7917 10.1600 0.0000 50.0000 132 0 10000 100.0000 100.0000 0.0000 50.0000 138 0 10000 100.0000 100.0000 0.0000 50.0000 144 0 10000 100.0000 100.0000 0.0000 50.0000 -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 2 : use several RED configurations, increase average queue size to just below maximum threshold, compare drop rate to drop probability RED config avg queue size min threshold max threshold drop prob % drop rate % diff % tolerance % 0 127 32 128 9.8958 10.0100 0.0000 50.0000 1 127 32 128 4.9479 4.9700 0.0000 50.0000 2 127 32 128 3.2986 2.6800 0.0000 50.0000 3 127 32 128 2.4740 1.7000 0.0000 50.0000 4 127 32 128 1.9792 1.2700 0.0000 50.0000 5 127 32 128 1.6493 1.0500 0.0000 50.0000 6 127 32 128 1.4137 0.8100 0.0000 50.0000 7 127 32 128 1.2370 0.7100 0.0000 50.0000 8 127 32 128 1.0995 0.6200 0.0000 50.0000 9 127 32 128 0.9896 0.5600 0.0000 50.0000 -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 3 : use one RED configuration, increase average queue size to target level, dequeue all packets until queue is empty, confirm that average queue size is computed correctly while queue is empty q avg before q avg after expected difference % tolerance % result 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 5 : use several queues (each with its own run-time data), use several RED configurations (such that each configuration is shared by multiple queues), increase average queue size to just below maximum threshold, compare drop rate to drop probability, (this is a larger scale version of functional test 2) queue config avg queue size min threshold max threshold drop prob % drop rate % diff % tolerance % 0 0 127 32 128 9.8958 9.9200 0.0000 50.0000 1 0 127 32 128 9.8958 9.9700 0.0000 50.0000 2 1 127 32 128 4.9479 4.8600 0.0000 50.0000 3 1 127 32 128 4.9479 4.9400 0.0000 50.0000 -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- functional test 6 : use several queues (each with its own run-time data), use several RED configurations (such that each configuration is shared by multiple queues), increase average queue size to target level, dequeue all packets until queue is empty, confirm that average queue size is computed correctly while queue is empty (this is a larger scale version of functional test 3) queue config q avg before q avg after expected difference % tolerance % result 0 0 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 1 0 1022.0000 1022.0000 1016.0627 0.5843 5.0000 pass 2 1 1022.0000 1022.0000 1010.1483 1.1733 5.0000 pass 3 1 1022.0000 1022.0000 1010.1483 1.1733 5.0000 pass -------------------------------------<pass>------------------------------------- -------------------------------------------------------------------------------- overflow test 1 : use one RED configuration, increase average queue size to target level, check maximum number of bits requirte_red to represent avg_s avg queue size wq_log2 fraction bits max queue avg num bits enqueued dropped drop prob % drop rate % 1023 12 10 0xffc00000 32 0 941366 100.00 100.00 -------------------------------------<pass>------------------------------------- [total: 6, pass: 6] Test OK RTE>>quit Kind Regards, Wojtek -----Original Message----- From: Liguzinski, WojciechX Sent: Monday, November 15, 2021 6:27 PM To: Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; David Marchand <david.marchand@redhat.com>; Lincoln Lavoie <lylavoie@iol.unh.edu>; Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder <jasvinder.singh@intel.com> Cc: dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Thomas Monjalon <thomas@monjalon.net>; Yigit, Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org Subject: RE: [dpdk-dev] [Bug 826] red_autotest random failures Hi, Sure, I will have a look. Best Regards, Wojciech -----Original Message----- From: Dumitrescu, Cristian <cristian.dumitrescu@intel.com> Sent: Monday, November 15, 2021 12:51 PM To: David Marchand <david.marchand@redhat.com>; Lincoln Lavoie <lylavoie@iol.unh.edu>; Liguzinski, WojciechX <wojciechx.liguzinski@intel.com>; Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder <jasvinder.singh@intel.com> Cc: dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Thomas Monjalon <thomas@monjalon.net>; Yigit, Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org Subject: RE: [dpdk-dev] [Bug 826] red_autotest random failures > -----Original Message----- > From: David Marchand <david.marchand@redhat.com> > Sent: Friday, November 12, 2021 2:16 PM > To: Lincoln Lavoie <lylavoie@iol.unh.edu> > Cc: Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; dev > <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Thomas Monjalon > <thomas@monjalon.net>; Yigit, Ferruh <ferruh.yigit@intel.com>; > ci@dpdk.org > Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures > > On Fri, Nov 12, 2021 at 3:11 PM Lincoln Lavoie <lylavoie@iol.unh.edu> wrote: > >> This failure keeps on popping in the CI. > >> The bug report is one month old, with no reply. > >> > >> > >> I sent a proposal of removing red_autotest from the list executed > >> by the > CI. > >> https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502- > 2-david.marchand@redhat.com/ > >> > >> It might be the best solution waiting for an analysis. > >> > >> > >> -- > >> David Marchand > >> > > > > Hi David, > > > > My understanding is, removing the test would require removing it > > from the > DPDK unit tests, we are just running the fast-tests suite for the unit tests. > DPDK's unit test structure / framework does not allow removing or > customizing the suite of tests beyond the suites. > > https://patchwork.dpdk.org/project/dpdk/patch/20211027140458.2502-2- > david.marchand@redhat.com/ > > > > > > In the lab, Brandon has been looking into and trying different > configurations for running the tests within the containers along the > lines of the CPU pinning requirements that might be assumed by the > unit tests. So far, everything he has tried has still had the similar > failures / issues. We are still looking into it, so the bug is not > sitting without action, just no final resolution. > > The mail I sent was not a comment for the investigation on UNH side. > The ask is for Cristian to have a look too. > > > -- > David Marchand Wojciech, Megha, Are you able to take a look at why is the RED autotest failing, please? Thanks, Cristian ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-18 22:10 ` Liguzinski, WojciechX @ 2021-11-19 7:26 ` Thomas Monjalon 2021-11-19 16:53 ` Dumitrescu, Cristian 0 siblings, 1 reply; 16+ messages in thread From: Thomas Monjalon @ 2021-11-19 7:26 UTC (permalink / raw) To: Dumitrescu, Cristian, David Marchand, Lincoln Lavoie, Ajmera, Megha, Singh, Jasvinder, Liguzinski, WojciechX Cc: dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX 18/11/2021 23:10, Liguzinski, WojciechX: > Hi, > > I was trying to reproduce this test failure, but for me RED tests are passing. > I was running the exact test command like the one described in Bug 826 - 'red_autotest' on the current main branch. The test is not always failing. There are some failing conditions, please find them. I think you should try in a container with more limited resources. ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-19 7:26 ` Thomas Monjalon @ 2021-11-19 16:53 ` Dumitrescu, Cristian 2021-11-19 17:25 ` Lincoln Lavoie 2021-11-22 8:17 ` David Marchand 0 siblings, 2 replies; 16+ messages in thread From: Dumitrescu, Cristian @ 2021-11-19 16:53 UTC (permalink / raw) To: Thomas Monjalon, David Marchand, Lincoln Lavoie, Ajmera, Megha, Singh, Jasvinder, Liguzinski, WojciechX Cc: dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX > -----Original Message----- > From: Thomas Monjalon <thomas@monjalon.net> > Sent: Friday, November 19, 2021 7:26 AM > To: Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; David Marchand > <david.marchand@redhat.com>; Lincoln Lavoie <lylavoie@iol.unh.edu>; > Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder > <jasvinder.singh@intel.com>; Liguzinski, WojciechX > <wojciechx.liguzinski@intel.com> > Cc: dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Yigit, > Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org; Zegota, AnnaX > <annax.zegota@intel.com> > Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures > > 18/11/2021 23:10, Liguzinski, WojciechX: > > Hi, > > > > I was trying to reproduce this test failure, but for me RED tests are passing. > > I was running the exact test command like the one described in Bug 826 - > 'red_autotest' on the current main branch. > > The test is not always failing. > There are some failing conditions, please find them. > I think you should try in a container with more limited resources. > Hi Thomas, This is not a fair request IMO. We want to avoid wasting everybody's time, including Wojciech's time. Can the bug originator provide the details on the setup to reproduce the failure, please? Thank you! On a different point, we should probably tweak our autotests to differentiate between logical failures and those failures related to resources not being available, and flag the test result accordingly in the report. For example, if memory allocation fails, the test should be flagged as "Not enough resources" instead of simply "Failed". In the first case, the next step should be fixing the test setup, while in the second case the next step should be fixing the code. What do people think on this? Regards, Cristian ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-19 16:53 ` Dumitrescu, Cristian @ 2021-11-19 17:25 ` Lincoln Lavoie 2021-11-24 7:48 ` Liguzinski, WojciechX 2021-11-22 8:17 ` David Marchand 1 sibling, 1 reply; 16+ messages in thread From: Lincoln Lavoie @ 2021-11-19 17:25 UTC (permalink / raw) To: Dumitrescu, Cristian Cc: Thomas Monjalon, David Marchand, Ajmera, Megha, Singh, Jasvinder, Liguzinski, WojciechX, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX [-- Attachment #1: Type: text/plain, Size: 3602 bytes --] Hi All, I'm not sure if it will help, but this is an example of a failing case in the CI: https://lab.dpdk.org/results/dashboard/patchsets/20222/ The test is running within a docker container. CI is set up to only allow one active unit test at a time, so the host might be running compile jobs, but not other unit tests. This ensures there isn't "competition" for resources like hugepages between two running unit test jobs. The host is actually a VM running on VMware vCenter, not a bare-metal host, the VM's sole purpose is running the docker jobs. The command to start the unit test run is pretty generic (script is below). #!/bin/bash #################################################### # $1 argument: extra arguments to send to meson test #################################################### # Exit on first command failure set -e # Extract dpdk.tar.gz tar xzfm dpdk.tar.gz # Compile DPDK cd dpdk meson build --werror ninja -C build install # Unit test cd build meson test --suite fast-tests -t 60 $1 I think a starting point is to understand if the unit test expects or makes assumptions on the system / environment. If it has sole access to a CPU core, minimum number of hugepages, etc. If it would help, I can also give you the DockerFile to build the container (note the RHEL images have to be built on a licensed Redhat server, based on being able to install the required packages). Cheers, Lincoln On Fri, Nov 19, 2021 at 11:54 AM Dumitrescu, Cristian < cristian.dumitrescu@intel.com> wrote: > > > > -----Original Message----- > > From: Thomas Monjalon <thomas@monjalon.net> > > Sent: Friday, November 19, 2021 7:26 AM > > To: Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; David Marchand > > <david.marchand@redhat.com>; Lincoln Lavoie <lylavoie@iol.unh.edu>; > > Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder > > <jasvinder.singh@intel.com>; Liguzinski, WojciechX > > <wojciechx.liguzinski@intel.com> > > Cc: dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Yigit, > > Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org; Zegota, AnnaX > > <annax.zegota@intel.com> > > Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures > > > > 18/11/2021 23:10, Liguzinski, WojciechX: > > > Hi, > > > > > > I was trying to reproduce this test failure, but for me RED tests are > passing. > > > I was running the exact test command like the one described in Bug 826 > - > > 'red_autotest' on the current main branch. > > > > The test is not always failing. > > There are some failing conditions, please find them. > > I think you should try in a container with more limited resources. > > > > Hi Thomas, > > This is not a fair request IMO. We want to avoid wasting everybody's time, > including Wojciech's time. Can the bug originator provide the details on > the setup to reproduce the failure, please? Thank you! > > On a different point, we should probably tweak our autotests to > differentiate between logical failures and those failures related to > resources not being available, and flag the test result accordingly in the > report. For example, if memory allocation fails, the test should be flagged > as "Not enough resources" instead of simply "Failed". In the first case, > the next step should be fixing the test setup, while in the second case the > next step should be fixing the code. What do people think on this? > > Regards, > Cristian > -- *Lincoln Lavoie* Principal Engineer, Broadband Technologies 21 Madbury Rd., Ste. 100, Durham, NH 03824 lylavoie@iol.unh.edu https://www.iol.unh.edu +1-603-674-2755 (m) <https://www.iol.unh.edu> [-- Attachment #2: Type: text/html, Size: 6527 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-19 17:25 ` Lincoln Lavoie @ 2021-11-24 7:48 ` Liguzinski, WojciechX 2021-11-29 17:58 ` Brandon Lo 0 siblings, 1 reply; 16+ messages in thread From: Liguzinski, WojciechX @ 2021-11-24 7:48 UTC (permalink / raw) To: Lincoln Lavoie, Dumitrescu, Cristian Cc: Thomas Monjalon, David Marchand, Ajmera, Megha, Singh, Jasvinder, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX [-- Attachment #1.1: Type: text/plain, Size: 4724 bytes --] Hi, Thanks Lincoln, I will also have a try with such script. Cheers, Wojciech From: Lincoln Lavoie <lylavoie@iol.unh.edu> Sent: Friday, November 19, 2021 6:26 PM To: Dumitrescu, Cristian <cristian.dumitrescu@intel.com> Cc: Thomas Monjalon <thomas@monjalon.net>; David Marchand <david.marchand@redhat.com>; Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder <jasvinder.singh@intel.com>; Liguzinski, WojciechX <wojciechx.liguzinski@intel.com>; dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Yigit, Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org; Zegota, AnnaX <annax.zegota@intel.com> Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures Hi All, I'm not sure if it will help, but this is an example of a failing case in the CI: https://lab.dpdk.org/results/dashboard/patchsets/20222/ The test is running within a docker container. CI is set up to only allow one active unit test at a time, so the host might be running compile jobs, but not other unit tests. This ensures there isn't "competition" for resources like hugepages between two running unit test jobs. The host is actually a VM running on VMware vCenter, not a bare-metal host, the VM's sole purpose is running the docker jobs. The command to start the unit test run is pretty generic (script is below). #!/bin/bash #################################################### # $1 argument: extra arguments to send to meson test #################################################### # Exit on first command failure set -e # Extract dpdk.tar.gz tar xzfm dpdk.tar.gz # Compile DPDK cd dpdk meson build --werror ninja -C build install # Unit test cd build meson test --suite fast-tests -t 60 $1 I think a starting point is to understand if the unit test expects or makes assumptions on the system / environment. If it has sole access to a CPU core, minimum number of hugepages, etc. If it would help, I can also give you the DockerFile to build the container (note the RHEL images have to be built on a licensed Redhat server, based on being able to install the required packages). Cheers, Lincoln On Fri, Nov 19, 2021 at 11:54 AM Dumitrescu, Cristian <cristian.dumitrescu@intel.com<mailto:cristian.dumitrescu@intel.com>> wrote: > -----Original Message----- > From: Thomas Monjalon <thomas@monjalon.net<mailto:thomas@monjalon.net>> > Sent: Friday, November 19, 2021 7:26 AM > To: Dumitrescu, Cristian <cristian.dumitrescu@intel.com<mailto:cristian.dumitrescu@intel.com>>; David Marchand > <david.marchand@redhat.com<mailto:david.marchand@redhat.com>>; Lincoln Lavoie <lylavoie@iol.unh.edu<mailto:lylavoie@iol.unh.edu>>; > Ajmera, Megha <megha.ajmera@intel.com<mailto:megha.ajmera@intel.com>>; Singh, Jasvinder > <jasvinder.singh@intel.com<mailto:jasvinder.singh@intel.com>>; Liguzinski, WojciechX > <wojciechx.liguzinski@intel.com<mailto:wojciechx.liguzinski@intel.com>> > Cc: dev <dev@dpdk.org<mailto:dev@dpdk.org>>; Aaron Conole <aconole@redhat.com<mailto:aconole@redhat.com>>; Yigit, > Ferruh <ferruh.yigit@intel.com<mailto:ferruh.yigit@intel.com>>; ci@dpdk.org<mailto:ci@dpdk.org>; Zegota, AnnaX > <annax.zegota@intel.com<mailto:annax.zegota@intel.com>> > Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures > > 18/11/2021 23:10, Liguzinski, WojciechX: > > Hi, > > > > I was trying to reproduce this test failure, but for me RED tests are passing. > > I was running the exact test command like the one described in Bug 826 - > 'red_autotest' on the current main branch. > > The test is not always failing. > There are some failing conditions, please find them. > I think you should try in a container with more limited resources. > Hi Thomas, This is not a fair request IMO. We want to avoid wasting everybody's time, including Wojciech's time. Can the bug originator provide the details on the setup to reproduce the failure, please? Thank you! On a different point, we should probably tweak our autotests to differentiate between logical failures and those failures related to resources not being available, and flag the test result accordingly in the report. For example, if memory allocation fails, the test should be flagged as "Not enough resources" instead of simply "Failed". In the first case, the next step should be fixing the test setup, while in the second case the next step should be fixing the code. What do people think on this? Regards, Cristian -- Lincoln Lavoie Principal Engineer, Broadband Technologies 21 Madbury Rd., Ste. 100, Durham, NH 03824 lylavoie@iol.unh.edu<mailto:lylavoie@iol.unh.edu> https://www.iol.unh.edu +1-603-674-2755 (m) [Image removed by sender.]<https://www.iol.unh.edu/> [-- Attachment #1.2: Type: text/html, Size: 10574 bytes --] [-- Attachment #2: image002.jpg --] [-- Type: image/jpeg, Size: 444 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-24 7:48 ` Liguzinski, WojciechX @ 2021-11-29 17:58 ` Brandon Lo 2021-11-30 7:51 ` Liguzinski, WojciechX 0 siblings, 1 reply; 16+ messages in thread From: Brandon Lo @ 2021-11-29 17:58 UTC (permalink / raw) To: Liguzinski, WojciechX Cc: Lincoln Lavoie, Dumitrescu, Cristian, Thomas Monjalon, David Marchand, Ajmera, Megha, Singh, Jasvinder, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX [-- Attachment #1: Type: text/plain, Size: 725 bytes --] On Wed, Nov 24, 2021 at 2:48 AM Liguzinski, WojciechX < wojciechx.liguzinski@intel.com> wrote: > Hi, > > > > Thanks Lincoln, I will also have a try with such script. > > > > Cheers, > > Wojciech > > Hello Wojciech, I also recommend trying to run the test with around 4GB of RAM and 2GB of hugepages to see if it fails. That is roughly the number of resources we have per machine that is completely dedicated to unit tests. The amount of RAM available can sometimes increase depending on how many jobs are running per machine, but 4GB is the lowest it can go for the unit test job. Thanks, Brandon -- Brandon Lo UNH InterOperability Laboratory 21 Madbury Rd, Suite 100, Durham, NH 03824 blo@iol.unh.edu www.iol.unh.edu [-- Attachment #2: Type: text/html, Size: 1676 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-29 17:58 ` Brandon Lo @ 2021-11-30 7:51 ` Liguzinski, WojciechX 2021-12-10 13:31 ` Liguzinski, WojciechX 0 siblings, 1 reply; 16+ messages in thread From: Liguzinski, WojciechX @ 2021-11-30 7:51 UTC (permalink / raw) To: Brandon Lo Cc: Lincoln Lavoie, Dumitrescu, Cristian, Thomas Monjalon, David Marchand, Ajmera, Megha, Singh, Jasvinder, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX [-- Attachment #1: Type: text/plain, Size: 1549 bytes --] Ok, thanks Brandon for the tip :) Let’s see if I can setup the machine with such configuration. Cheers, Wojciech From: Brandon Lo <blo@iol.unh.edu> Sent: Monday, November 29, 2021 6:58 PM To: Liguzinski, WojciechX <wojciechx.liguzinski@intel.com> Cc: Lincoln Lavoie <lylavoie@iol.unh.edu>; Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; Thomas Monjalon <thomas@monjalon.net>; David Marchand <david.marchand@redhat.com>; Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder <jasvinder.singh@intel.com>; dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Yigit, Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org; Zegota, AnnaX <annax.zegota@intel.com> Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures On Wed, Nov 24, 2021 at 2:48 AM Liguzinski, WojciechX <wojciechx.liguzinski@intel.com<mailto:wojciechx.liguzinski@intel.com>> wrote: Hi, Thanks Lincoln, I will also have a try with such script. Cheers, Wojciech Hello Wojciech, I also recommend trying to run the test with around 4GB of RAM and 2GB of hugepages to see if it fails. That is roughly the number of resources we have per machine that is completely dedicated to unit tests. The amount of RAM available can sometimes increase depending on how many jobs are running per machine, but 4GB is the lowest it can go for the unit test job. Thanks, Brandon -- Brandon Lo UNH InterOperability Laboratory 21 Madbury Rd, Suite 100, Durham, NH 03824 blo@iol.unh.edu<mailto:blo@iol.unh.edu> www.iol.unh.edu<http://www.iol.unh.edu> [-- Attachment #2: Type: text/html, Size: 5152 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-30 7:51 ` Liguzinski, WojciechX @ 2021-12-10 13:31 ` Liguzinski, WojciechX 0 siblings, 0 replies; 16+ messages in thread From: Liguzinski, WojciechX @ 2021-12-10 13:31 UTC (permalink / raw) To: Brandon Lo Cc: Lincoln Lavoie, Dumitrescu, Cristian, Thomas Monjalon, David Marchand, Ajmera, Megha, Singh, Jasvinder, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX, Danilewicz, MarcinX [-- Attachment #1: Type: text/plain, Size: 3128 bytes --] Hi, Unfortunately, I haven’t been able to move the investigation much further. I have been running those tests on machines with higher amount of RAM than 4GB, but with hugepages set there to 1GB and using the script provided by Lincoln. For several runs red_autotest tests didn’t fail even once, not giving any clue what might be the cause of what’s happening on CI. +Adding Marcin Danilewicz To let you know, Marcin Danilewicz will be taking over my tasks, so for any further aspects please include or direct messages to him. Best Regards, Wojciech From: Liguzinski, WojciechX <wojciechx.liguzinski@intel.com> Sent: Tuesday, November 30, 2021 8:51 AM To: Brandon Lo <blo@iol.unh.edu> Cc: Lincoln Lavoie <lylavoie@iol.unh.edu>; Dumitrescu, Cristian <cristian.dumitrescu@intel.com>; Thomas Monjalon <thomas@monjalon.net>; David Marchand <david.marchand@redhat.com>; Ajmera, Megha <megha.ajmera@intel.com>; Singh, Jasvinder <jasvinder.singh@intel.com>; dev <dev@dpdk.org>; Aaron Conole <aconole@redhat.com>; Yigit, Ferruh <ferruh.yigit@intel.com>; ci@dpdk.org; Zegota, AnnaX <annax.zegota@intel.com> Subject: RE: [dpdk-dev] [Bug 826] red_autotest random failures Ok, thanks Brandon for the tip :) Let’s see if I can setup the machine with such configuration. Cheers, Wojciech From: Brandon Lo <blo@iol.unh.edu<mailto:blo@iol.unh.edu>> Sent: Monday, November 29, 2021 6:58 PM To: Liguzinski, WojciechX <wojciechx.liguzinski@intel.com<mailto:wojciechx.liguzinski@intel.com>> Cc: Lincoln Lavoie <lylavoie@iol.unh.edu<mailto:lylavoie@iol.unh.edu>>; Dumitrescu, Cristian <cristian.dumitrescu@intel.com<mailto:cristian.dumitrescu@intel.com>>; Thomas Monjalon <thomas@monjalon.net<mailto:thomas@monjalon.net>>; David Marchand <david.marchand@redhat.com<mailto:david.marchand@redhat.com>>; Ajmera, Megha <megha.ajmera@intel.com<mailto:megha.ajmera@intel.com>>; Singh, Jasvinder <jasvinder.singh@intel.com<mailto:jasvinder.singh@intel.com>>; dev <dev@dpdk.org<mailto:dev@dpdk.org>>; Aaron Conole <aconole@redhat.com<mailto:aconole@redhat.com>>; Yigit, Ferruh <ferruh.yigit@intel.com<mailto:ferruh.yigit@intel.com>>; ci@dpdk.org<mailto:ci@dpdk.org>; Zegota, AnnaX <annax.zegota@intel.com<mailto:annax.zegota@intel.com>> Subject: Re: [dpdk-dev] [Bug 826] red_autotest random failures On Wed, Nov 24, 2021 at 2:48 AM Liguzinski, WojciechX <wojciechx.liguzinski@intel.com<mailto:wojciechx.liguzinski@intel.com>> wrote: Hi, Thanks Lincoln, I will also have a try with such script. Cheers, Wojciech Hello Wojciech, I also recommend trying to run the test with around 4GB of RAM and 2GB of hugepages to see if it fails. That is roughly the number of resources we have per machine that is completely dedicated to unit tests. The amount of RAM available can sometimes increase depending on how many jobs are running per machine, but 4GB is the lowest it can go for the unit test job. Thanks, Brandon -- Brandon Lo UNH InterOperability Laboratory 21 Madbury Rd, Suite 100, Durham, NH 03824 blo@iol.unh.edu<mailto:blo@iol.unh.edu> www.iol.unh.edu<http://www.iol.unh.edu> [-- Attachment #2: Type: text/html, Size: 7735 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-19 16:53 ` Dumitrescu, Cristian 2021-11-19 17:25 ` Lincoln Lavoie @ 2021-11-22 8:17 ` David Marchand 2021-11-22 13:34 ` Lincoln Lavoie 1 sibling, 1 reply; 16+ messages in thread From: David Marchand @ 2021-11-22 8:17 UTC (permalink / raw) To: Dumitrescu, Cristian Cc: Thomas Monjalon, Lincoln Lavoie, Ajmera, Megha, Singh, Jasvinder, Liguzinski, WojciechX, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX On Fri, Nov 19, 2021 at 5:54 PM Dumitrescu, Cristian <cristian.dumitrescu@intel.com> wrote: > On a different point, we should probably tweak our autotests to differentiate between logical failures and those failures related to resources not being available, and flag the test result accordingly in the report. For example, if memory allocation fails, the test should be flagged as "Not enough resources" instead of simply "Failed". In the first case, the next step should be fixing the test setup, while in the second case the next step should be fixing the code. What do people think on this? In such case, the test must return TEST_SKIPPED. I did a pass for cores count / specific hw requirements, some time ago. See https://git.dpdk.org/dpdk/commit/?id=e0f4a0ed4237 -- David Marchand ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [dpdk-dev] [Bug 826] red_autotest random failures 2021-11-22 8:17 ` David Marchand @ 2021-11-22 13:34 ` Lincoln Lavoie 0 siblings, 0 replies; 16+ messages in thread From: Lincoln Lavoie @ 2021-11-22 13:34 UTC (permalink / raw) To: David Marchand Cc: Dumitrescu, Cristian, Thomas Monjalon, Ajmera, Megha, Singh, Jasvinder, Liguzinski, WojciechX, dev, Aaron Conole, Yigit, Ferruh, ci, Zegota, AnnaX [-- Attachment #1: Type: text/plain, Size: 1488 bytes --] On Mon, Nov 22, 2021 at 3:17 AM David Marchand <david.marchand@redhat.com> wrote: > On Fri, Nov 19, 2021 at 5:54 PM Dumitrescu, Cristian > <cristian.dumitrescu@intel.com> wrote: > > On a different point, we should probably tweak our autotests to > differentiate between logical failures and those failures related to > resources not being available, and flag the test result accordingly in the > report. For example, if memory allocation fails, the test should be flagged > as "Not enough resources" instead of simply "Failed". In the first case, > the next step should be fixing the test setup, while in the second case the > next step should be fixing the code. What do people think on this? > > In such case, the test must return TEST_SKIPPED. > > If the purpose of the component / function being tested is to get / create / reserve the resource(s), the failure might be valid. So it can't be applied across the board. But places where the test is checking other functionality, this might at least prevent some failures that are transient (i.e. based on what the test could "get" from the system at that moment in time). > I did a pass for cores count / specific hw requirements, some time ago. > See https://git.dpdk.org/dpdk/commit/?id=e0f4a0ed4237 > > > -- > David Marchand > > -- *Lincoln Lavoie* Principal Engineer, Broadband Technologies 21 Madbury Rd., Ste. 100, Durham, NH 03824 lylavoie@iol.unh.edu https://www.iol.unh.edu +1-603-674-2755 (m) <https://www.iol.unh.edu> [-- Attachment #2: Type: text/html, Size: 2996 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
end of thread, other threads:[~2021-12-10 13:31 UTC | newest] Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2021-10-08 7:23 [dpdk-dev] [Bug 826] red_autotest random failures bugzilla 2021-11-12 13:51 ` David Marchand 2021-11-12 14:10 ` Lincoln Lavoie 2021-11-12 14:15 ` David Marchand 2021-11-15 11:51 ` Dumitrescu, Cristian 2021-11-15 17:26 ` Liguzinski, WojciechX 2021-11-18 22:10 ` Liguzinski, WojciechX 2021-11-19 7:26 ` Thomas Monjalon 2021-11-19 16:53 ` Dumitrescu, Cristian 2021-11-19 17:25 ` Lincoln Lavoie 2021-11-24 7:48 ` Liguzinski, WojciechX 2021-11-29 17:58 ` Brandon Lo 2021-11-30 7:51 ` Liguzinski, WojciechX 2021-12-10 13:31 ` Liguzinski, WojciechX 2021-11-22 8:17 ` David Marchand 2021-11-22 13:34 ` Lincoln Lavoie
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).