From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-it0-f52.google.com (mail-it0-f52.google.com [209.85.214.52]) by dpdk.org (Postfix) with ESMTP id 0D32F2BAA for ; Fri, 14 Jul 2017 00:33:42 +0200 (CEST) Received: by mail-it0-f52.google.com with SMTP id 188so5571852itx.0 for ; Thu, 13 Jul 2017 15:33:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=pp/3fnDYRWA3I+rhd3g0kHHE/WG98BLL3drZI21DkF4=; b=GNeogIw/uY1q5kd9Sg8VRVMLnuaGA9M8B/y40YrMPH0Bydwc1gu2ES5Yv19iQhY2Wu Cl8rTOOn8A/Yn3Phb5YWXujeDzEQFeg9AxFyfJRL33clQUKD6g28s35xFCxYj5mzNWq3 7j03XB+QbCBth6WVfAYMMbz4wxi4bhU6wwT+m4D3eYL4rDlTPUxABiyNH2uTNaYGAuLE 1q5OF5fOvAR23T+rgkw/WRYokhIxsn16ffVlhqczXml0OtGNwlL+UTZnukkzHPS6Pkc2 WZDI5boeBTgK1/aAStwJcpJ8FOwJM7RJzjvuwNKC6LbN9lRvwgKWng6shiTzVwvOymlQ 6okQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=pp/3fnDYRWA3I+rhd3g0kHHE/WG98BLL3drZI21DkF4=; b=jAXzwiVzN/0JKL2x4A/fRPwaIdtNIwRzi9Yi0p3Zjq7g2U91/elQfbfY9rQwHp7jeI pLMUK7Czn8v71Dyr7ariFTB3NjkRZy4kVhqSuDHmP0sDzTIr1LxHxBn0SDOtMj0/uGBA qdiSoiKO1crEnxqM0Iu7wVP5Zg1s7PddnbmMrjwblqWXzNc2BC9AUQ0WLXEy/SCmBcyZ x4V5YBafszZCG/TkrgJeOGN7tEKrKouu+iBpcU6jXI78llwCI+TQfjqi+/Lgk7fPJ95k e9g5BhmkzhWHjJE1e/fyIXaTqAKjsYxrE1FYHEvKveZqwD1RJ9MtpBa2uWVN89396fM6 4LGQ== X-Gm-Message-State: AIVw1101NYan+HO7x4uXyd9DlPNIYDWrQIQw142CLcaLHVtpIbu163HY rFGg8nsg+9NrxFNFAcaaT6RZgZHwM3Ld X-Received: by 10.36.189.198 with SMTP id x189mr1130302ite.56.1499985221038; Thu, 13 Jul 2017 15:33:41 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.133.86 with HTTP; Thu, 13 Jul 2017 15:33:40 -0700 (PDT) From: Alex Kiselev Date: Fri, 14 Jul 2017 01:33:40 +0300 Message-ID: To: users Content-Type: text/plain; charset="UTF-8" Subject: [dpdk-users] bonding driver LACP mode issues X-BeenThere: users@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK usage discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 13 Jul 2017 22:33:42 -0000 New info: I've added RTE_LOG call right after the rte_eth_tx_burst in the bond_ethdev_tx_burst_8023ad in the file rte_eth_bond_pmd.c bond_ethdev_tx_burst_8023ad(...) { ... num_tx_slave = rte_eth_tx_burst(slaves[i], bd_tx_q->queue_id, slave_bufs[i], slave_nb_pkts[i]); RTE_LOG(DEBUG, PMD, "rte_eth_tx_burst: port_id %hhu, queue_id %hu, num pkts %hu, " "was sent %hu, slow pkts %hhu\n", slaves[i], bd_tx_q->queue_id, slave_nb_pkts[i], num_tx_slave, slave_slow_nb_pkts[i]); ... } When my app stopped forward packets I set log_level to debug and got some new debug data. It turned out that the problem is tx queue_id #0. It's the only tx queue that stops sending packets on both slave ports 0 and 1. tail -f /run/the_router | grep 'was sent 0' | 'grep port_id 0' Jul 14 02:51:03 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 0, queue_id 0, num pkts 1, was sent 0, slow pkts 0 Jul 14 02:51:03 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 0, queue_id 0, num pkts 1, was sent 0, slow pkts 0 Jul 14 02:51:03 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 0, queue_id 0, num pkts 1, was sent 0, slow pkts 0 tail -f /run/the_router | grep 'was sent 0' | 'grep port_id 1' Jul 14 02:44:45 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 0, num pkts 1, was sent 0, slow pkts 0 Jul 14 02:44:46 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 0, num pkts 31, was sent 0, slow pkts 0 Jul 14 02:44:46 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 0, num pkts 1, was sent 0, slow pkts 0 The others tx queues are ok. Each packet that goes into tx queue other than #0 is sent to the wire. tail -f /run/the_router | grep -v 'was sent 0' | grep 'was sent' Jul 14 03:04:06 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 3, num pkts 4, was sent 4, slow pkts 0 Jul 14 03:04:06 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 2, num pkts 1, was sent 1, slow pkts 0 Jul 14 03:04:06 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 3, num pkts 1, was sent 1, slow pkts 0 Jul 14 03:04:06 bizin the_router.lag[22550]: PMD: rte_eth_tx_burst: port_id 1, queue_id 3, num pkts 1, was sent 1, slow pkts 0 I still have no clue what could cause such behavior and I am running out of ideas how to further debug it. Please, anybody, help! I would love to hear any ideas. -- Alex Kiselev