From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by inbox.dpdk.org (Postfix) with ESMTP id ED46F48A44 for ; Fri, 31 Oct 2025 15:37:50 +0100 (CET) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id E573240150; Fri, 31 Oct 2025 15:37:50 +0100 (CET) Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by mails.dpdk.org (Postfix) with ESMTP id B315540678 for ; Fri, 31 Oct 2025 15:37:48 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1761921468; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=kdiN3HpB8Zu9389bt5cydrE5gPfSc3kqH27WouOoy34=; b=FHYaAg9GRvKrnODbrAPOx6vS8WQM83tD3DR1U/TyAllKDxEMHPSUYdNgTIKTB7275EkBxx BiDk4N5wwyHav217Xk/AW0G4Que2JcWin7mR5IM89QmoUMbO/4VS9OOkptnZUt2da9klGU BH3GtJXP8zNjFl9guQzvf54PJRTsfMk= Received: from mx-prod-mc-05.mail-002.prod.us-west-2.aws.redhat.com (ec2-54-186-198-63.us-west-2.compute.amazonaws.com [54.186.198.63]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-613-J_hgxLmnM-S1qi3Fpg8Haw-1; Fri, 31 Oct 2025 10:37:46 -0400 X-MC-Unique: J_hgxLmnM-S1qi3Fpg8Haw-1 X-Mimecast-MFC-AGG-ID: J_hgxLmnM-S1qi3Fpg8Haw_1761921465 Received: from mx-prod-int-08.mail-002.prod.us-west-2.aws.redhat.com (mx-prod-int-08.mail-002.prod.us-west-2.aws.redhat.com [10.30.177.111]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mx-prod-mc-05.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 8C0F219560A2; Fri, 31 Oct 2025 14:37:45 +0000 (UTC) Received: from rh.redhat.com (unknown [10.44.32.50]) by mx-prod-int-08.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTP id 94BF2180044F; Fri, 31 Oct 2025 14:37:43 +0000 (UTC) From: Kevin Traynor To: David Marchand Cc: Bruce Richardson , Dariusz Sosnowski , dpdk stable Subject: patch 'test/debug: fix crash with mlx5 devices' has been queued to stable release 24.11.4 Date: Fri, 31 Oct 2025 14:33:05 +0000 Message-ID: <20251031143421.324432-63-ktraynor@redhat.com> In-Reply-To: <20251031143421.324432-1-ktraynor@redhat.com> References: <20251031143421.324432-1-ktraynor@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 3.4.1 on 10.30.177.111 X-Mimecast-Spam-Score: 0 X-Mimecast-MFC-PROC-ID: cblM5GIbVAYv5yBJI1UNVXzbQATYCaN_3bIsPHNA32E_1761921465 X-Mimecast-Originator: redhat.com Content-Transfer-Encoding: 8bit content-type: text/plain; charset="US-ASCII"; x-default=true X-BeenThere: stable@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: patches for DPDK stable branches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: stable-bounces@dpdk.org Hi, FYI, your patch has been queued to stable release 24.11.4 Note it hasn't been pushed to http://dpdk.org/browse/dpdk-stable yet. It will be pushed if I get no objections before 11/05/25. So please shout if anyone has objections. Also note that after the patch there's a diff of the upstream commit vs the patch applied to the branch. This will indicate if there was any rebasing needed to apply to the stable branch. If there were code changes for rebasing (ie: not only metadata diffs), please double check that the rebase was correctly done. Queued patches are on a temporary branch at: https://github.com/kevintraynor/dpdk-stable This queued commit can be viewed at: https://github.com/kevintraynor/dpdk-stable/commit/c44513b1372b02a21c5c99edd7293faa792b0a6c Thanks. Kevin --- >From c44513b1372b02a21c5c99edd7293faa792b0a6c Mon Sep 17 00:00:00 2001 From: David Marchand Date: Thu, 2 Oct 2025 17:36:50 +0200 Subject: [PATCH] test/debug: fix crash with mlx5 devices [ upstream commit 2b403dd8fb37d0ba13723e44ffc7ee2c2795f838 ] Running rte_exit() in a forked process means that shared memory will be released by the child process before the parent process does the same. This issue has been seen recently when some GHA virtual machine (with some mlx5 devices) runs the debug_autotest unit test. Instead, run rte_panic() and rte_exit() from a new DPDK process spawned like for other recursive unit tests. Bugzilla ID: 1796 Fixes: af75078fece3 ("first public release") Signed-off-by: David Marchand Acked-by: Bruce Richardson Acked-by: Dariusz Sosnowski --- app/test/process.h | 2 +- app/test/test.c | 2 + app/test/test.h | 2 + app/test/test_debug.c | 92 ++++++++++++++++++++++++++++++------------- 4 files changed, 69 insertions(+), 29 deletions(-) diff --git a/app/test/process.h b/app/test/process.h index 9fb2bf481c..8e11d0b059 100644 --- a/app/test/process.h +++ b/app/test/process.h @@ -204,5 +204,5 @@ process_dup(const char *const argv[], int numargs, const char *env_value) */ #ifdef RTE_EXEC_ENV_LINUX -static char * +static inline char * get_current_prefix(char *prefix, int size) { diff --git a/app/test/test.c b/app/test/test.c index 680351f6a3..5b69f40f3d 100644 --- a/app/test/test.c +++ b/app/test/test.c @@ -81,4 +81,6 @@ do_recursive_call(void) { "test_file_prefix", no_action }, { "test_no_huge_flag", no_action }, + { "test_panic", test_panic }, + { "test_exit", test_exit }, #ifdef RTE_LIB_TIMER #ifndef RTE_EXEC_ENV_WINDOWS diff --git a/app/test/test.h b/app/test/test.h index 15e23d297f..fd8cc10b53 100644 --- a/app/test/test.h +++ b/app/test/test.h @@ -175,5 +175,7 @@ int commands_init(void); int command_valid(const char *cmd); +int test_exit(void); int test_mp_secondary(void); +int test_panic(void); int test_timer_secondary(void); diff --git a/app/test/test_debug.c b/app/test/test_debug.c index 8ad6d40fcb..fe5dd5b02d 100644 --- a/app/test/test_debug.c +++ b/app/test/test_debug.c @@ -9,4 +9,16 @@ #ifdef RTE_EXEC_ENV_WINDOWS +int +test_panic(void) +{ + printf("debug not supported on Windows, skipping test\n"); + return TEST_SKIPPED; +} +int +test_exit(void) +{ + printf("debug not supported on Windows, skipping test\n"); + return TEST_SKIPPED; +} static int test_debug(void) @@ -26,5 +38,7 @@ test_debug(void) #include #include -#include +#include + +#include "process.h" /* @@ -33,14 +47,12 @@ test_debug(void) */ -/* use fork() to test rte_panic() */ -static int +static const char *test_args[7]; + +int test_panic(void) { - int pid; int status; - pid = fork(); - - if (pid == 0) { + if (getenv(RECURSIVE_ENV_VAR) != NULL) { struct rlimit rl; @@ -49,9 +61,6 @@ test_panic(void) setrlimit(RLIMIT_CORE, &rl); rte_panic("Test Debug\n"); - } else if (pid < 0) { - printf("Fork Failed\n"); - return -1; } - wait(&status); + status = process_dup(test_args, RTE_DIM(test_args), "test_panic"); if(status == 0){ printf("Child process terminated normally!\n"); @@ -63,25 +72,14 @@ test_panic(void) } -/* use fork() to test rte_exit() */ static int test_exit_val(int exit_val) { - int pid; + char buf[5]; int status; - /* manually cleanup EAL memory, as the fork() below would otherwise - * cause the same hugepages to be free()-ed multiple times. - */ - rte_service_finalize(); - - pid = fork(); - - if (pid == 0) - rte_exit(exit_val, __func__); - else if (pid < 0){ - printf("Fork Failed\n"); - return -1; - } - wait(&status); + sprintf(buf, "%d", exit_val); + if (setenv("TEST_DEBUG_EXIT_VAL", buf, 1) == -1) + rte_panic("Failed to set exit value in env\n"); + status = process_dup(test_args, RTE_DIM(test_args), "test_exit"); printf("Child process status: %d\n", status); if(!WIFEXITED(status) || WEXITSTATUS(status) != (uint8_t)exit_val){ @@ -93,9 +91,20 @@ test_exit_val(int exit_val) } -static int +int test_exit(void) { int test_vals[] = { 0, 1, 2, 255, -1 }; unsigned i; + + if (getenv(RECURSIVE_ENV_VAR) != NULL) { + int exit_val; + + if (!getenv("TEST_DEBUG_EXIT_VAL")) + rte_panic("No exit value set in env\n"); + + exit_val = strtol(getenv("TEST_DEBUG_EXIT_VAL"), NULL, 0); + rte_exit(exit_val, __func__); + } + for (i = 0; i < RTE_DIM(test_vals); i++) { if (test_exit_val(test_vals[i]) < 0) @@ -129,4 +138,31 @@ static int test_debug(void) { +#ifdef RTE_EXEC_ENV_FREEBSD + /* BSD target doesn't support prefixes at this point, and we also need to + * run another primary process here. + */ + const char * prefix = "--no-shconf"; +#else + const char * prefix = "--file-prefix=debug"; +#endif + char core[10]; + + sprintf(core, "%d", rte_get_main_lcore()); + + test_args[0] = prgname; + test_args[1] = prefix; + test_args[2] = "-l"; + test_args[3] = core; + + if (rte_eal_has_hugepages()) { + test_args[4] = ""; + test_args[5] = ""; + test_args[6] = ""; + } else { + test_args[4] = "--no-huge"; + test_args[5] = "-m"; + test_args[6] = "2048"; + } + rte_dump_stack(); if (test_panic() < 0) -- 2.51.0 --- Diff of the applied patch vs upstream commit (please double-check if non-empty: --- --- - 2025-10-31 13:53:54.192058885 +0000 +++ 0063-test-debug-fix-crash-with-mlx5-devices.patch 2025-10-31 13:53:52.169523783 +0000 @@ -1 +1 @@ -From 2b403dd8fb37d0ba13723e44ffc7ee2c2795f838 Mon Sep 17 00:00:00 2001 +From c44513b1372b02a21c5c99edd7293faa792b0a6c Mon Sep 17 00:00:00 2001 @@ -5,0 +6,2 @@ +[ upstream commit 2b403dd8fb37d0ba13723e44ffc7ee2c2795f838 ] + @@ -16 +17,0 @@ -Cc: stable@dpdk.org @@ -40 +41 @@ -index fd653cbbfd..8a4598baee 100644 +index 680351f6a3..5b69f40f3d 100644 @@ -51 +52 @@ -index ebc4864bf8..c6d7d23313 100644 +index 15e23d297f..fd8cc10b53 100644