From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from dpdk.org (dpdk.org [92.243.14.124]) by inbox.dpdk.org (Postfix) with ESMTP id 61D4AA00E6 for ; Fri, 12 Jul 2019 18:39:04 +0200 (CEST) Received: from [92.243.14.124] (localhost [127.0.0.1]) by dpdk.org (Postfix) with ESMTP id 2BBB11BDE6; Fri, 12 Jul 2019 18:39:03 +0200 (CEST) Received: from guri.nttv6.jp (guri.nttv6.jp [115.69.228.140]) by dpdk.org (Postfix) with ESMTP id 40FDC1BDE3 for ; Fri, 12 Jul 2019 18:39:01 +0200 (CEST) Received: from z.nttv6.jp (z.nttv6.jp [IPv6:2402:c800:ff06:6::f]) by guri.nttv6.jp (NTTv6MTA) with ESMTP id B916925F6BD for ; Sat, 13 Jul 2019 01:38:56 +0900 (JST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nttv6.jp; s=20180820; t=1562949537; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding:in-reply-to: references; bh=z5sbyXISaMeNwcubFaxR7dZUjd7rgWTaoeoYzlJzW/I=; b=EYPoDQzkxruh8QuvqbfTFF/2KdLiDLMjkhxptLB8kNKRCNaMKCNXWuNjHimySgNTYgmvkS AE3gZww++oe/39q8dOmKm9dvPbDr4lhVver63fh1Q+jX/7P01CcM9+FFM3Crj8iO0oY1ah 6dzNzrqbGv4noOtveVzaSEWJOUfOiHc= Received: from localhost (fujiko.nttv6.jp [IPv6:2402:c800:ff06:136::141]) by z.nttv6.jp (NTTv6MTA) with ESMTPSA id A83A57634EE; Sat, 13 Jul 2019 01:38:56 +0900 (JST) Date: Sat, 13 Jul 2019 01:38:53 +0900 (JST) Message-Id: <20190713.013853.751044529514409504.yasu@nttv6.jp> To: dev@dpdk.org From: Yasuhiro Ohara Organizaton: NTT Communications X-Mailer: Mew version 6.8 on Emacs 26.1 Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Authentication-Results: guri.nttv6.jp; spf=pass smtp.mailfrom=yasu@nttv6.jp Subject: [dpdk-dev] ConnectX-4/mlx5 crashes around rxq_cqe_comp_en? X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" Hi, I get a crash when I put a significant amount of load on ConnectX-4/mlx5, i.e., 50Gbps for 100GbE port. Thread 22 "lcore-slave-19" received signal SIGSEGV, Segmentation fault. [Switching to Thread 0x7fffe77ee700 (LWP 33519)] 0x0000555555f010a3 in _mm_storeu_si128 (__B=..., __P=0x10) at /usr/lib/gcc/x86_64-linux-gnu/7/include/emmintrin.h:721 721 *__P = __B; (gdb) bt #0 0x0000555555f010a3 in _mm_storeu_si128 (__B=..., __P=0x10) at /usr/lib/gcc/x86_64-linux-gnu/7/include/emmintrin.h:721 #1 rxq_cq_decompress_v (rxq=0x22c910ccc0, cq=0x22c8fd1800, elts=0x22c910d240) at /usr/local/dpdk-stable-18.11.2/drivers/net/mlx5/mlx5_rxtx_vec_sse.h:421 #2 0x0000555555f04b42 in rxq_burst_v (rxq=0x22c910ccc0, pkts=0x7fffe77eba40, pkts_n=32, err=0x7fffe77dc978) at /usr/local/dpdk-stable-18.11.2/drivers/net/mlx5/mlx5_rxtx_vec_sse.h:956 #3 0x0000555555f055ea in mlx5_rx_burst_vec (dpdk_rxq=0x22c910ccc0, pkts=0x7fffe77eba40, pkts_n=32) at /usr/local/dpdk-stable-18.11.2/drivers/net/mlx5/mlx5_rxtx_vec.c:238 #4 0x0000555555632772 in rte_eth_rx_burst (port_id=4, queue_id=5, rx_pkts=0x7fffe77eba40, nb_pkts=32) at /usr/local/dpdk-18.11/x86_64-native-linuxapp-gcc/include/rte_ethdev.h:3879 My environments are: Ubuntu 18.04.2 LTS 4.15.0-50-generic MLNX_OFED_LINUX-4.5-1.0.1.0-ubuntu18.04-x86_64 fw_ver: 12.17.2020 vendor_id: 0x02c9 vendor_part_id: 4115 hw_ver: 0x0 board_id: LNR3270110033 DPDK 18.11.2 It looks like the CQE compression is the crashing place. dpdk-stable-18.11.2/drivers/net/mlx5/mlx5_rxtx_vec_sse.h:956 953 /* Decompress the last CQE if compressed. */ 954 if (comp_idx < MLX5_VPMD_DESCS_PER_LOOP && comp_idx == n) { 955 assert(comp_idx == (nocmp_n % MLX5_VPMD_DESCS_PER_LOOP)); 956 rxq_cq_decompress_v(rxq, &cq[nocmp_n], &elts[nocmp_n]); And I'm wondering how I can disable rxq_cqe_comp_en devargs. 22.5.3. Run-time configuration rxq_cqe_comp_en parameter [int] Any information or guesses are appreciated. Best regards, Yasu