From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <keith.wiles@intel.com>
Received: from mga11.intel.com (mga11.intel.com [192.55.52.93])
 by dpdk.org (Postfix) with ESMTP id B72715B12
 for <users@dpdk.org>; Wed, 14 Nov 2018 16:02:09 +0100 (CET)
X-Amp-Result: SKIPPED(no attachment in message)
X-Amp-File-Uploaded: False
Received: from orsmga006.jf.intel.com ([10.7.209.51])
 by fmsmga102.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384;
 14 Nov 2018 07:02:07 -0800
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="5.56,232,1539673200"; d="scan'208,217";a="91071031"
Received: from fmsmsx107.amr.corp.intel.com ([10.18.124.205])
 by orsmga006.jf.intel.com with ESMTP; 14 Nov 2018 07:02:07 -0800
Received: from fmsmsx151.amr.corp.intel.com (10.18.125.4) by
 fmsmsx107.amr.corp.intel.com (10.18.124.205) with Microsoft SMTP Server (TLS)
 id 14.3.408.0; Wed, 14 Nov 2018 07:02:07 -0800
Received: from fmsmsx117.amr.corp.intel.com ([169.254.3.70]) by
 FMSMSX151.amr.corp.intel.com ([169.254.7.81]) with mapi id 14.03.0415.000;
 Wed, 14 Nov 2018 07:02:06 -0800
From: "Wiles, Keith" <keith.wiles@intel.com>
To: Harsh Patel <thadodaharsh10@gmail.com>
CC: "users@dpdk.org" <users@dpdk.org>
Thread-Topic: [dpdk-users] Query on handling packets
Thread-Index: AQHUdzydRRkBFdv4fkKjO7j2RJcyb6VGGaEAgACGp4CAAAyagIABE0+AgAFRlQCAAnP3gIACAjmAgAC+mQCAAZQ9gIAAEtgA
Date: Wed, 14 Nov 2018 15:02:05 +0000
Message-ID: <6B171882-9794-4A80-977E-30BA3E52B3B5@intel.com>
References: <CAA0iYrE_OBz5dCAT4UrDNHqnR4LKeHDKVMD5+5CgGA4Va7tn+g@mail.gmail.com>
 <71CBA720-633D-4CFE-805C-606DAAEDD356@intel.com>
 <CAA0iYrHkp_UZ=GMuzG+Ti6dJk4+FWFDotuNWDpcWLCqA1T6NZg@mail.gmail.com>
 <D6A4CD43-BE09-4AFA-A82C-962650011A14@intel.com>
 <CAA0iYrFBzO_Bw2bUy46VBjpLJNzos3M57N=nfkx8FNUMgq+2bQ@mail.gmail.com>
 <3C60E59D-36AD-4382-8CC3-89D4EEB0140D@intel.com>
 <CAA0iYrErep=BitAUoj88m=4JpVDvCtgC1bs3UNCEvBd4_=7iLQ@mail.gmail.com>
 <CAA0iYrFAZYKLyZ6ZbZTy=PnHgo=tnOc1eNQA_rA-FxzGG5QSVw@mail.gmail.com>
 <76959924-D9DB-4C58-BB05-E33107AD98AC@intel.com>
 <CAA0iYrGiPeY7zSHd0ukbQu=VcSVkAJ8c4m-fJW5pzWmQi3ayxA@mail.gmail.com>
In-Reply-To: <CAA0iYrGiPeY7zSHd0ukbQu=VcSVkAJ8c4m-fJW5pzWmQi3ayxA@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
x-originating-ip: [10.252.138.154]
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-Content-Filtered-By: Mailman/MimeDel 2.1.15
Subject: Re: [dpdk-users] Query on handling packets
X-BeenThere: users@dpdk.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: DPDK usage discussions <users.dpdk.org>
List-Unsubscribe: <https://mails.dpdk.org/options/users>,
 <mailto:users-request@dpdk.org?subject=unsubscribe>
List-Archive: <http://mails.dpdk.org/archives/users/>
List-Post: <mailto:users@dpdk.org>
List-Help: <mailto:users-request@dpdk.org?subject=help>
List-Subscribe: <https://mails.dpdk.org/listinfo/users>,
 <mailto:users-request@dpdk.org?subject=subscribe>
X-List-Received-Date: Wed, 14 Nov 2018 15:02:10 -0000


On Nov 14, 2018, at 7:54 AM, Harsh Patel <thadodaharsh10@gmail.com<mailto:t=
hadodaharsh10@gmail.com>> wrote:

Hello,
This is a link to the complete source code of our project :- https://github=
.com/ns-3-dpdk-integration/ns-3-dpdk
For the description of the project, look through this :- https://ns-3-dpdk-=
integration.github.io/
Once you go through it, you will have a basic understanding of the project.
Installation instructions link are provided in the github.io<http://github.=
io/> page.

In the code we mentioned above, the master branch contains the implementati=
on of the logic using rte_rings which we mentioned at the very beginning of=
 the discussion. There is a branch named "newrxtx" which contains the imple=
mentation according to the logic you provided.

We would like you to take a look at the code in newrxtx branch. (https://gi=
thub.com/ns-3-dpdk-integration/ns-3-dpdk/tree/newrxtx)
In the code in this branch, go to ns-allinone-3.28.1/ns-3.28.1/src/fd-net-d=
evice/model/ directory. Here we have implemented the DpdkNetDevice model. T=
his model contains the code which implements the whole model providing inte=
raction between ns-3 and DPDK. We would like you take a look at our Read fu=
nction (https://github.com/ns-3-dpdk-integration/ns-3-dpdk/blob/newrxtx/ns-=
allinone-3.28.1/ns-3.28.1/src/fd-net-device/model/dpdk-net-device.cc#L626) =
and Write function (https://github.com/ns-3-dpdk-integration/ns-3-dpdk/blob=
/newrxtx/ns-allinone-3.28.1/ns-3.28.1/src/fd-net-device/model/dpdk-net-devi=
ce.cc#L576). These contains the logic you suggested.

I looked at the read and write routines briefly. The one thing that jumped =
out at me is you copy the packet from an internal data buffer to the mbuf o=
r mbuf to data buffer. You should try your hardest to remove these memcpy c=
alls in the data path as they will kill your performance. If you have to us=
e memcpy I would look at the rte_memcpy() routine to use as they are highly=
 optimized for DPDK. Even with using DPKD rte_memcpy() you will still see a=
 big performance hit.

I did not look at were the buffer came from, but maybe you could allocate a=
 pktmbuf pool (as you did) and when you main code asks for a buffer it grab=
s a mbufs points to the start of the mbuf and returns that pointer instead.=
 Then when you get to the write or read routine you find the start of the m=
buf header based on the buffer address or even some meta data attached to t=
he buffer. Then you can call the rte_eth_tx_buffer() routine with that mbuf=
 pointer. For the TX side the mbuf is freed by the driver, but could be on =
the TX done queue, just make sure you have enough buffers.

On the read side you need to also find the place the buffer is allocated an=
d allocate a mbuf then save the mbuf pointer in the meta data of the buffer=
 (if you have meta data per buffer) then you can at some point free the mbu=
f after you have processed the data buffer.

I hope that is clear, I meetings I must attend.


Can you go through this and suggest us some changes or find some mistake in=
 our code? If you need any help or have any doubt, ping us.

Thanks and Regards,
Harsh & Hrishikesh

On Tue, 13 Nov 2018 at 19:17, Wiles, Keith <keith.wiles@intel.com<mailto:ke=
ith.wiles@intel.com>> wrote:


> On Nov 12, 2018, at 8:25 PM, Harsh Patel <thadodaharsh10@gmail.com<mailto=
:thadodaharsh10@gmail.com>> wrote:
>
> Hello,
> It would be really helpful if you can provide us a link (for both Tx and =
Rx) to the project you mentioned earlier where you worked on a similar prob=
lem, if possible.
>

At this time I can not provide a link. I will try and see what I can do, bu=
t do not hold your breath it could be awhile as we have to go thru a lot of=
 legal stuff. If you can try vtune tool from Intel for x86 systems if you c=
an get a copy for your platform as it can tell you a lot about the code and=
 where the performance issues are located. If you are not running Intel x86=
 then my code may not work for you, I do not remember if you told me which =
platform.


> Thanks and Regards,
> Harsh & Hrishikesh.
>
> On Mon, 12 Nov 2018 at 01:15, Harsh Patel <thadodaharsh10@gmail.com<mailt=
o:thadodaharsh10@gmail.com>> wrote:
> Thanks a lot for all the support. We are looking into our work as of now =
and will contact you once we are done checking it completely from our side.=
 Thanks for the help.
>
> Regards,
> Harsh and Hrishikesh
>
> On Sat, 10 Nov 2018 at 11:47, Wiles, Keith <keith.wiles@intel.com<mailto:=
keith.wiles@intel.com>> wrote:
> Please make sure to send your emails in plain text format. The Mac mail p=
rogram loves to use rich-text format is the original email use it and I hav=
e told it not only send plain text :-(
>
> > On Nov 9, 2018, at 4:09 AM, Harsh Patel <thadodaharsh10@gmail.com<mailt=
o:thadodaharsh10@gmail.com>> wrote:
> >
> > We have implemented the logic for Tx/Rx as you suggested. We compared t=
he obtained throughput with another version of same application that uses L=
inux raw sockets.
> > Unfortunately, the throughput we receive in our DPDK application is les=
s by a good margin. Is this any way we can optimize our implementation or a=
nything that we are missing?
> >
>
> The PoC code I was developing for DAPI I did not have any performance of =
issues it run just as fast with my limited testing. I converted the l3fwd c=
ode and I saw 10G 64byte wire rate as I remember using pktgen to generate t=
he traffic.
>
> Not sure why you would see a big performance drop, but I do not know your=
 application or code.
>
> > Thanks and regards
> > Harsh & Hrishikesh
> >
> > On Thu, 8 Nov 2018 at 23:14, Wiles, Keith <keith.wiles@intel.com<mailto=
:keith.wiles@intel.com>> wrote:
> >
> >
> >> On Nov 8, 2018, at 4:58 PM, Harsh Patel <thadodaharsh10@gmail.com<mail=
to:thadodaharsh10@gmail.com>> wrote:
> >>
> >> Thanks
> >>  for your insight on the topic. Transmission is working with the funct=
ions you mentioned. We tried to search for some similar functions for handl=
ing incoming packets but could not find anything. Can you help us on that a=
s well?
> >>
> >
> > I do not know if a DPDK API set for RX side. But in the DAPI (DPDK API)=
 PoC I was working on and presented at the DPDK Summit last Sept. In the Po=
C I did create a RX side version. The issues it has a bit of tangled up in =
the DAPI PoC.
> >
> > The basic concept is a call to RX a single packet does a rx_burst of N =
number of packets keeping then in a mbuf list. The code would spin waiting =
for mbufs to arrive or return quickly if a flag was set. When it did find R=
X mbufs it would just return the single mbuf and keep the list of mbufs for=
 later requests until the list is empty then do another rx_burst call.
> >
> > Sorry this is a really quick note on how it works. If you need more det=
ails we can talk more later.
> >>
> >> Regards,
> >> Harsh
> >>  and Hrishikesh.
> >>
> >>
> >> On Thu, 8 Nov 2018 at 14:26, Wiles, Keith <keith.wiles@intel.com<mailt=
o:keith.wiles@intel.com>> wrote:
> >>
> >>
> >> > On Nov 8, 2018, at 8:24 AM, Harsh Patel <thadodaharsh10@gmail.com<ma=
ilto:thadodaharsh10@gmail.com>> wrote:
> >> >
> >> > Hi,
> >> > We are working on a project where we are trying to integrate DPDK wi=
th
> >> > another software. We are able to obtain packets from the other envir=
onment
> >> > to DPDK environment in one-by-one fashion. On the other hand DPDK al=
lows to
> >> > send/receive burst of data packets. We want to know if there is any
> >> > functionality in DPDK to achieve this conversion of single incoming =
packet
> >> > to a burst of packets sent on NIC and similarly, conversion of burst=
 read
> >> > packets from NIC to send it to other environment sequentially?
> >>
> >>
> >> Search in the docs or lib/librte_ethdev directory on rte_eth_tx_buffer=
_init, rte_eth_tx_buffer, ...
> >>
> >>
> >>
> >> > Thanks and regards
> >> > Harsh Patel, Hrishikesh Hiraskar
> >> > NITK Surathkal
> >>
> >> Regards,
> >> Keith
> >>
> >
> > Regards,
> > Keith
> >
>
> Regards,
> Keith
>

Regards,
Keith


Regards,
Keith