From: Moti Haimovsky <motih@mellanox.com>
To: adrien.mazarguil@6wind.com
Cc: dev@dpdk.org, Moti Haimovsky <motih@mellanox.com>
Subject: [dpdk-dev] [RFC] net/mlx4: add TSO support
Date: Thu, 31 May 2018 19:21:26 +0300 [thread overview]
Message-ID: <1527783686-1727-1-git-send-email-motih@mellanox.com> (raw)
TCP Segmentation Offload (TSO) is a feature which enables the TCP/IP
network stack to delegate segmentation of a TCP segment to the NIC,
thus saving compute resources.
This RFC proposes to add support for TSO to the MLX4 PMD.
Prerequisites:
In order for the PMD to recognize the TSO capabilities of the device
one has to use:
* RDMA-core v18.0 or above.
* Linux kernel 4.16 or above.
Assumptions:
* mlx4 PMD will follow the TSO support implemented in mlx5 PMD.
* PMD is backwards compatible.
** The PMD will continue work with the kernels and RDMA-core
supported by it today.
** The PMD will continue to work with devices not supporting TSO.
Changes proposed in the PMD for implementing TSO:
* At init, query the device for TSO support and MAX segment size
being supported.
This will also determine if the PMD will advertise support for TSO
(dev_info->tx_offload_capa |= DEV_TX_OFFLOAD_TCP_TSO;)
* Calling create-qp when creating a Tx queue will have to consider
the MAX TSO header size when calculating the actual queue buffer
size. This may be abstracted by calling ibv_create_qp_ex with
IBV_QP_INIT_ATTR_MAX_TSO_HEADER as comp flag rather than
ibv_create_qp.
If this breaks backwards compatibility then this calculation will
be done in the PMD code.
* Modify tx_burst function to:
** Check for TSO flag indication in the packets of the packet burst
(buf->ol_flags & PKT_TX_TCP_SEG).
** For TSO packet create the WQE appropriate for sending a TSO packet
and fill it with packet info and L2/L3/L4 Headers.
* Modify Tx completion function to handle releasing of TSO packet
buffers that were transmitted.
Concerns:
* Impact of changing Tx send routine on performance.
The performance of the tx_burst routine for non-TSO packets may be
affected just by placing the code that handles TSO packets in it,
so we may want to consider having a dedicated routine for TSO packets.
* No MAX-TSO parameter.
This is a cross-PMD issue that may need a separate mailing thread to handle.
As for today there is no way for the PMD to advertise the MAX-TSO
it or its HW support as done with other capabilities.
(The indirection table size for example.
see rte_eth_dev_info.reta_size in rte_ethdev.h).
Also there is no DPDK parameter or constant value that the PMD
can use in order to know the MAX-TSO the system requires.
This prevents applications from determining the MAX-TSO that can be
used leading to configuration mismatches that may lead to transmit
failures or to less-than-optimize TSO configuration in the best case.
I propose to add a max_tso field in rte_eth_dev_info that will allow
the PMD to advertise the max tso is supports. This can be used by
DPDK applications to determine what TSO size to use.
If this is a major change that cannot fit the 18.08 schedule then
I propose to add a MAX_TSO constant in rte_ethdev.h, The PMD will
compare this value whit its own MAX-TSO and if it cannot meet the
defined value it will not advertise that it is a TSO capable device.
* Handling packets longer then MAX-TSO
In case a PMD is requested to send a TSO packet which is longer than
MAX-TSO the PMD send routine should return with an error.
A different approach that can be used on the future is to apply GSO
to those packets using the GSO lib in DPDK.
I am interested in general design comments and concerns listed above.
Signed-off-by: Moti Haimovsky <motih@mellanox.com>
--
1.8.3.1
next reply other threads:[~2018-05-31 16:21 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-05-31 16:21 Moti Haimovsky [this message]
2018-05-31 18:27 ` Wiles, Keith
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1527783686-1727-1-git-send-email-motih@mellanox.com \
--to=motih@mellanox.com \
--cc=adrien.mazarguil@6wind.com \
--cc=dev@dpdk.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).