From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <matheus.castanho@dcc.ufmg.br>
Received: from smtp.dcc.ufmg.br (turquesa.dcc.ufmg.br [150.164.0.150])
 by dpdk.org (Postfix) with ESMTP id 25BF94CA5
 for <users@dpdk.org>; Mon, 27 Aug 2018 21:55:05 +0200 (CEST)
Received: from mail-it0-f44.google.com (mail-it0-f44.google.com
 [209.85.214.44])
 by smtp.dcc.ufmg.br (Postfix) with ESMTPSA id BC73097C51
 for <users@dpdk.org>; Mon, 27 Aug 2018 16:55:02 -0300 (BRT)
Received: by mail-it0-f44.google.com with SMTP id 139-v6so243566itf.0
 for <users@dpdk.org>; Mon, 27 Aug 2018 12:55:02 -0700 (PDT)
X-Gm-Message-State: APzg51CY9FEtyxWRuI+MHOfJmPSZZwBDrC8PINzqIb2PYKhq0yCHdCAU
 kRGFTuXFPLCMG9oF/0hiLdQ6Ry9mMdaN4sBl8Jk=
X-Google-Smtp-Source: ANB0VdauZcW06Wycjyn+xB/uZkjab/BecJnbxr7jDb7BFKQ2xfmiKt/6RqacVfUcUeiBr1VhyMlmNZmaeOBVNYTv54o=
X-Received: by 2002:a02:6c45:: with SMTP id
 w66-v6mr11968250jab.87.1535399700885; 
 Mon, 27 Aug 2018 12:55:00 -0700 (PDT)
MIME-Version: 1.0
From: Matheus Salgueiro Castanho <matheus.castanho@dcc.ufmg.br>
Date: Mon, 27 Aug 2018 16:54:48 -0300
X-Gmail-Original-Message-ID: <CAPyJoH=vnotTECaQAEe7aOEG8tCcYa6irZ_CJfTLzBKjFWqFgw@mail.gmail.com>
Message-ID: <CAPyJoH=vnotTECaQAEe7aOEG8tCcYa6irZ_CJfTLzBKjFWqFgw@mail.gmail.com>
To: users@dpdk.org
Content-Type: text/plain; charset="UTF-8"
X-Content-Filtered-By: Mailman/MimeDel 2.1.15
Subject: [dpdk-users] Possible causes for packet drop during TX buffer flush
X-BeenThere: users@dpdk.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: DPDK usage discussions <users.dpdk.org>
List-Unsubscribe: <https://mails.dpdk.org/options/users>,
 <mailto:users-request@dpdk.org?subject=unsubscribe>
List-Archive: <http://mails.dpdk.org/archives/users/>
List-Post: <mailto:users@dpdk.org>
List-Help: <mailto:users-request@dpdk.org?subject=help>
List-Subscribe: <https://mails.dpdk.org/listinfo/users>,
 <mailto:users-request@dpdk.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Aug 2018 19:55:05 -0000

Hi all,

I've been working on a DPDK packet classifier, which receives packets
through one port, performs classification + encapsulation and then sends
packets out through a second port. The application is running inside a VM,
virtualized using QEMU/KVM and with OVS-DPDK for networking.

Although functional, it presents some packet loss (< 0.1%) starting at
350Mbps. I was able to reduce it through better resource allocation
(isolated and pinned CPU cores for VM through libvirt and for OVS PMD
threads in the host), but there is still some loss. What is even weirder is
that the loss grows linearly from 350 Mbps to 650 Mbps, and drops
dramatically at 700 Mbps, just to increase again from 750 Mbps up to 950
Mbps.

Right now I'm struggling to find other possible performance bottlenecks
that might be causing this issue.

I was able to confirm that the application does receive all packets sent
from my traffic generator (the number of packets match inside the
application and in the source), but for some reason some packets get
dropped during transmission. I'm counting packet loss using
rte_eth_tx_buffer_drop_callback(), registered through
rte_eth_tx_buffer_set_err_callback(). As the documentation states
<http://doc.dpdk.org/api/rte__ethdev_8h.html#aacd4952d9f45acd463e203c21db9a7bb>,
this callback is called when packets are dropped inside rte_eth_tx_buffer()
and rte_eth_tx_buffer_flush() APIs.

I imagined that packets might have been dropped because of a full TX
Buffer, so I increased its size to 1024, with no further improvement, so I
set it back to the same value as the burst size, which is 64.

I then changed my callback function to register the current TX buffer
length when a drop occurs. When I tested sending traffic at 650 Mbps, the
results showed that in 50% of the times packets get dropped when there is a
single packet in the TX buffer, other 23% happens with 32 packets in it,
and the rest is in between these values.

So I have a few questions:
- Since I'm able to receive, process and buffer all packets, what could be
happening that is causing packet drops?
- In which situations rte_eth_tx_buffer() and rte_eth_tx_buffer_flush()
drop packets? I couldn't get this information from the documentation.

I'd appreciate any kind of help.

Best Regards,
Matheus Castanho