DPDK patches and discussions
 help / color / mirror / Atom feed
From: Ferruh Yigit <ferruh.yigit@intel.com>
To: "Wiles, Keith" <keith.wiles@intel.com>,
	Raslan Darawsheh <rasland@mellanox.com>
Cc: Thomas Monjalon <thomas@monjalon.net>,
	"dev@dpdk.org" <dev@dpdk.org>,
	Shahaf Shuler <shahafs@mellanox.com>,
	Ori Katz <orik@mellanox.com>
Subject: Re: [dpdk-dev] [PATCH v3 1/2] net/tap: change queue fd to be pointers to process private
Date: Wed, 3 Oct 2018 18:27:08 +0100	[thread overview]
Message-ID: <6d2b3acd-76fb-8931-b06d-a583ccdeb43d@intel.com> (raw)
In-Reply-To: <EF7090CA-F852-4162-9DA3-97A5037D0C5F@intel.com>

On 10/2/2018 1:58 PM, Wiles, Keith wrote:
> 
> 
>> On Oct 2, 2018, at 5:30 AM, Raslan Darawsheh <rasland@mellanox.com> wrote:
>>
>> Hi,
>>
>> what I'm really doing is simply do some private array for all the fd's that each process will allocate it separately which will allow that each process will be 
>> able to access the fd's for the queues in order not to overwrite the shared ones and it's working for me this way.
>>
>> Now coming to your comment I'm not sure I fully understand it but, what you are suggesting is to create an array which will be accessed by the process_id to store these fd's in it.
>> As far as I know we don't have something as process_id in dpdk we only have the system process id which is not relevant for the array of fd's.
>>
>> Please correct me if I'm wrong, 
>> I think this way we'll be limiting the number of secondary processes to number of queues by tap.
>> Meanwhile, in my solution we don't have such limitation.
> 
> I did not explain myself very well in the previous email let me try again.
> 
> Right now you have a pointer to an allocated piece of memory holding the FD’s  struct pmd_process_private *process_private; if we converted the pointer to an array of pmd_process_private structures using RTE_PMD_TAP_MAX_QUEUES as an index.
> 
> struct pmd_process_private {
> 	int rx_fd;
> 	int tx_fd;
> };
> 
> static struct pmd_process_private private_process[RTE_PMD_TAP_MAX_QUEUES]; /* each process has its own array */
> 
> struct rx_queue *rxq = pmd->dev->data->rx_queues[i];
> 
> struct rx_queue {
> 	struct pmd_process_private *priv;	// rxq->priv = &private_process[rx_queue_idx];
> . . .
> };
> 
> rxq->priv->rx_fd; // instead of the *rxq->fd seems shorter but less clear to me
> 
> Anyway my email compiler is not very good and maybe this will not work.

This should work, it is another kind of redirection.

But I think there are other problems, current implementation seems broken, I
will comment more detailed to the patch.
Briefly, rx_pkt_burst/tx_pkt_burst are not in eth_dev->data, but eth_dev because
they need to be private to process (can't share function pointers across
processes), this is the same problem.

> 
> We can leave the current design in place for now, till I have time to really look at doing a PoC for my suggestion.
> 
>>
>> Kindest regards,
>> Raslan Darawsheh
>>
>> -----Original Message-----
>> From: Wiles, Keith <keith.wiles@intel.com> 
>> Sent: Thursday, September 27, 2018 4:18 PM
>> To: Raslan Darawsheh <rasland@mellanox.com>
>> Cc: Thomas Monjalon <thomas@monjalon.net>; dev@dpdk.org; Shahaf Shuler <shahafs@mellanox.com>; Ori Katz <orik@mellanox.com>
>> Subject: Re: [PATCH v3 1/2] net/tap: change queue fd to be pointers to process private
>>
>>
>>
>>> On Sep 27, 2018, at 6:19 AM, Raslan Darawsheh <rasland@mellanox.com> wrote:
>>>
>>> change the fds for the queues to be pointers and add new process 
>>> private structure and make the queue fds point to it.
>>>
>>> Signed-off-by: Raslan Darawsheh <rasland@mellanox.com>
>>> ---
>>> drivers/net/tap/rte_eth_tap.c | 63 
>>> ++++++++++++++++++++++++-------------------
>>> drivers/net/tap/rte_eth_tap.h |  9 +++++--
>>> drivers/net/tap/tap_intr.c    |  4 +--
>>> 3 files changed, 44 insertions(+), 32 deletions(-)
>>>
>>> diff --git a/drivers/net/tap/rte_eth_tap.c 
>>> b/drivers/net/tap/rte_eth_tap.c index ad5ae98..8cc4552 100644
>>> --- a/drivers/net/tap/rte_eth_tap.c
>>> +++ b/drivers/net/tap/rte_eth_tap.c
>>> @@ -64,6 +64,7 @@
>>>
>>> static struct rte_vdev_driver pmd_tap_drv; static struct 
>>> rte_vdev_driver pmd_tun_drv;
>>> +static struct pmd_process_private *process_private;
>>
>> Maybe I do not see some minor point for making fd a pointer to fd when we could have an array of process_private[RTE_PMD_TAP_MAX_QUEUES] instead of a pointer type here. Then we do not need to allocate the memory each PMD and they would still have a private copy. Remove the array of rx/tx fds in the structure. This way it appears we can remove the code below that is making fd a pointer to fd. It just seems overly complex to me at the cost of a few more bytes of memory.
>>
>> This would remove int fd; from the structure and add a pointer to the pid_process_private instead, which is private by default.
>>
>> Did I miss some detail here that makes my comment wrong?
>>
>>>
>>> static const char *valid_arguments[] = {
>>> 	ETH_TAP_IFACE_ARG,
>>> @@ -331,7 +332,7 @@ pmd_rx_burst(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts)
>>> 		uint16_t data_off = rte_pktmbuf_headroom(mbuf);
>>> 		int len;
>>>
>>> -		len = readv(rxq->fd, *rxq->iovecs,
>>> +		len = readv(*rxq->fd, *rxq->iovecs,
>>> 			    1 +
>>> 			    (rxq->rxmode->offloads & DEV_RX_OFFLOAD_SCATTER ?
>>> 			     rxq->nb_rx_desc : 1));
>>> @@ -595,7 +596,7 @@ tap_write_mbufs(struct tx_queue *txq, uint16_t num_mbufs,
>>> 			tap_tx_l4_cksum(l4_cksum, l4_phdr_cksum, l4_raw_cksum);
>>>
>>> 		/* copy the tx frame data */
>>> -		n = writev(txq->fd, iovecs, j);
>>> +		n = writev(*txq->fd, iovecs, j);
>>> 		if (n <= 0)
>>> 			break;
>>> 		(*num_packets)++;
>>> @@ -976,13 +977,13 @@ tap_dev_close(struct rte_eth_dev *dev)
>>> 	tap_flow_implicit_flush(internals, NULL);
>>>
>>> 	for (i = 0; i < RTE_PMD_TAP_MAX_QUEUES; i++) {
>>> -		if (internals->rxq[i].fd != -1) {
>>> -			close(internals->rxq[i].fd);
>>> -			internals->rxq[i].fd = -1;
>>> +		if (*internals->rxq[i].fd != -1) {
>>> +			close(*internals->rxq[i].fd);
>>> +			*internals->rxq[i].fd = -1;
>>> 		}
>>> -		if (internals->txq[i].fd != -1) {
>>> -			close(internals->txq[i].fd);
>>> -			internals->txq[i].fd = -1;
>>> +		if (*internals->txq[i].fd != -1) {
>>> +			close(*internals->txq[i].fd);
>>> +			*internals->txq[i].fd = -1;
>>> 		}
>>> 	}
>>>
>>> @@ -1007,9 +1008,9 @@ tap_rx_queue_release(void *queue) {
>>> 	struct rx_queue *rxq = queue;
>>>
>>> -	if (rxq && (rxq->fd > 0)) {
>>> -		close(rxq->fd);
>>> -		rxq->fd = -1;
>>> +	if (rxq && rxq->fd && (*rxq->fd > 0)) {
>>> +		close(*rxq->fd);
>>> +		*rxq->fd = -1;
>>> 		rte_pktmbuf_free(rxq->pool);
>>> 		rte_free(rxq->iovecs);
>>> 		rxq->pool = NULL;
>>> @@ -1022,9 +1023,9 @@ tap_tx_queue_release(void *queue) {
>>> 	struct tx_queue *txq = queue;
>>>
>>> -	if (txq && (txq->fd > 0)) {
>>> -		close(txq->fd);
>>> -		txq->fd = -1;
>>> +	if (txq && txq->fd && (*txq->fd > 0)) {
>>> +		close(*txq->fd);
>>> +		*txq->fd = -1;
>>> 	}
>>> }
>>>
>>> @@ -1214,13 +1215,13 @@ tap_setup_queue(struct rte_eth_dev *dev,
>>> 	struct rte_gso_ctx *gso_ctx;
>>>
>>> 	if (is_rx) {
>>> -		fd = &rx->fd;
>>> -		other_fd = &tx->fd;
>>> +		fd = rx->fd;
>>> +		other_fd = tx->fd;
>>> 		dir = "rx";
>>> 		gso_ctx = NULL;
>>> 	} else {
>>> -		fd = &tx->fd;
>>> -		other_fd = &rx->fd;
>>> +		fd = tx->fd;
>>> +		other_fd = rx->fd;
>>> 		dir = "tx";
>>> 		gso_ctx = &tx->gso_ctx;
>>> 	}
>>> @@ -1331,7 +1332,7 @@ tap_rx_queue_setup(struct rte_eth_dev *dev,
>>> 	}
>>>
>>> 	TAP_LOG(DEBUG, "  RX TUNTAP device name %s, qid %d on fd %d",
>>> -		internals->name, rx_queue_id, internals->rxq[rx_queue_id].fd);
>>> +		internals->name, rx_queue_id, *internals->rxq[rx_queue_id].fd);
>>>
>>> 	return 0;
>>>
>>> @@ -1371,7 +1372,7 @@ tap_tx_queue_setup(struct rte_eth_dev *dev,
>>> 		return -1;
>>> 	TAP_LOG(DEBUG,
>>> 		"  TX TUNTAP device name %s, qid %d on fd %d csum %s",
>>> -		internals->name, tx_queue_id, internals->txq[tx_queue_id].fd,
>>> +		internals->name, tx_queue_id, *internals->txq[tx_queue_id].fd,
>>> 		txq->csum ? "on" : "off");
>>>
>>> 	return 0;
>>> @@ -1633,6 +1634,10 @@ eth_dev_tap_create(struct rte_vdev_device *vdev, char *tap_name,
>>> 		goto error_exit_nodev;
>>> 	}
>>>
>>> +	process_private = (struct pmd_process_private *)
>>> +		rte_zmalloc_socket(tap_name, sizeof(struct pmd_process_private),
>>> +			RTE_CACHE_LINE_SIZE, dev->device->numa_node);
>>> +
>>> 	pmd = dev->data->dev_private;
>>> 	pmd->dev = dev;
>>> 	snprintf(pmd->name, sizeof(pmd->name), "%s", tap_name); @@ -1669,8 
>>> +1674,10 @@ eth_dev_tap_create(struct rte_vdev_device *vdev, char *tap_name,
>>> 	/* Presetup the fds to -1 as being not valid */
>>> 	pmd->ka_fd = -1;
>>> 	for (i = 0; i < RTE_PMD_TAP_MAX_QUEUES; i++) {
>>> -		pmd->rxq[i].fd = -1;
>>> -		pmd->txq[i].fd = -1;
>>> +		process_private->rxq_fds[i] = -1;
>>> +		process_private->txq_fds[i] = -1;
>>> +		pmd->rxq[i].fd = &process_private->rxq_fds[i];
>>> +		pmd->txq[i].fd = &process_private->txq_fds[i];
>>> 	}
>>>
>>> 	if (pmd->type == ETH_TUNTAP_TYPE_TAP) { @@ -2089,13 +2096,13 @@ 
>>> rte_pmd_tap_remove(struct rte_vdev_device *dev)
>>> 		tap_nl_final(internals->nlsk_fd);
>>> 	}
>>> 	for (i = 0; i < RTE_PMD_TAP_MAX_QUEUES; i++) {
>>> -		if (internals->rxq[i].fd != -1) {
>>> -			close(internals->rxq[i].fd);
>>> -			internals->rxq[i].fd = -1;
>>> +		if (*internals->rxq[i].fd != -1) {
>>> +			close(*internals->rxq[i].fd);
>>> +			*internals->rxq[i].fd = -1;
>>> 		}
>>> -		if (internals->txq[i].fd != -1) {
>>> -			close(internals->txq[i].fd);
>>> -			internals->txq[i].fd = -1;
>>> +		if (*internals->txq[i].fd != -1) {
>>> +			close(*internals->txq[i].fd);
>>> +			*internals->txq[i].fd = -1;
>>> 		}
>>> 	}
>>>
>>> diff --git a/drivers/net/tap/rte_eth_tap.h 
>>> b/drivers/net/tap/rte_eth_tap.h index 44e2773..4146f5c 100644
>>> --- a/drivers/net/tap/rte_eth_tap.h
>>> +++ b/drivers/net/tap/rte_eth_tap.h
>>> @@ -46,7 +46,7 @@ struct rx_queue {
>>> 	struct rte_mempool *mp;         /* Mempool for RX packets */
>>> 	uint32_t trigger_seen;          /* Last seen Rx trigger value */
>>> 	uint16_t in_port;               /* Port ID */
>>> -	int fd;
>>> +	int *fd;
>>> 	struct pkt_stats stats;         /* Stats for this RX queue */
>>> 	uint16_t nb_rx_desc;            /* max number of mbufs available */
>>> 	struct rte_eth_rxmode *rxmode;  /* RX features */ @@ -56,7 +56,7 @@ 
>>> struct rx_queue { };
>>>
>>> struct tx_queue {
>>> -	int fd;
>>> +	int *fd;
>>> 	int type;                       /* Type field - TUN|TAP */
>>> 	uint16_t *mtu;                  /* Pointer to MTU from dev_data */
>>> 	uint16_t csum:1;                /* Enable checksum offloading */
>>> @@ -92,6 +92,11 @@ struct pmd_internals {
>>> 	int ka_fd;                        /* keep-alive file descriptor */
>>> };
>>>
>>> +struct pmd_process_private {
>>> +	int rxq_fds[RTE_PMD_TAP_MAX_QUEUES];
>>> +	int txq_fds[RTE_PMD_TAP_MAX_QUEUES]; };
>>> +
>>> /* tap_intr.c */
>>>
>>> int tap_rx_intr_vec_set(struct rte_eth_dev *dev, int set); diff --git 
>>> a/drivers/net/tap/tap_intr.c b/drivers/net/tap/tap_intr.c index 
>>> fc59018..a4b212d 100644
>>> --- a/drivers/net/tap/tap_intr.c
>>> +++ b/drivers/net/tap/tap_intr.c
>>> @@ -71,7 +71,7 @@ tap_rx_intr_vec_install(struct rte_eth_dev *dev)
>>> 		struct rx_queue *rxq = pmd->dev->data->rx_queues[i];
>>>
>>> 		/* Skip queues that cannot request interrupts. */
>>> -		if (!rxq || rxq->fd <= 0) {
>>> +		if (!rxq || !rxq->fd || *rxq->fd <= 0) {
>>> 			/* Use invalid intr_vec[] index to disable entry. */
>>> 			intr_handle->intr_vec[i] =
>>> 				RTE_INTR_VEC_RXTX_OFFSET +
>>> @@ -79,7 +79,7 @@ tap_rx_intr_vec_install(struct rte_eth_dev *dev)
>>> 			continue;
>>> 		}
>>> 		intr_handle->intr_vec[i] = RTE_INTR_VEC_RXTX_OFFSET + count;
>>> -		intr_handle->efds[count] = rxq->fd;
>>> +		intr_handle->efds[count] = *rxq->fd;
>>> 		count++;
>>> 	}
>>> 	if (!count)
>>> --
>>> 2.7.4
>>>
>>
>> Regards,
>> Keith
>>
> 
> Regards,
> Keith
> 

  reply	other threads:[~2018-10-03 17:27 UTC|newest]

Thread overview: 30+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-06-07 12:29 [dpdk-dev] [RFC] net/tap: add queues when attaching from secondary process Raslan Darawsheh
2018-06-07 19:09 ` Wiles, Keith
2018-06-07 23:24   ` Raslan Darawsheh
2018-06-08  2:52     ` Wiles, Keith
2018-06-12 12:46     ` Wiles, Keith
2018-06-12 13:21       ` Raslan Darawsheh
2018-07-20 10:57 ` [dpdk-dev] [PATCH v2] " Thomas Monjalon
2018-09-27 11:19   ` [dpdk-dev] [PATCH v3 1/2] net/tap: change queue fd to be pointers to process private Raslan Darawsheh
2018-09-27 11:19     ` [dpdk-dev] [PATCH v3 2/2] net/tap: add queues when attaching from secondary process Raslan Darawsheh
2018-09-27 13:04       ` Wiles, Keith
2018-09-27 18:53         ` Thomas Monjalon
2018-10-02 10:34       ` [dpdk-dev] [PATCH v4 1/2] net/tap: change queue fd to be pointers to process private Raslan Darawsheh
2018-10-02 10:34         ` [dpdk-dev] [PATCH v4 2/2] net/tap: add queues when attaching from secondary process Raslan Darawsheh
2018-10-02 10:41           ` Thomas Monjalon
2018-10-02 10:50             ` Raslan Darawsheh
2018-10-02 11:38               ` Thomas Monjalon
2018-10-03 16:28                 ` Ferruh Yigit
2018-10-02 10:43           ` Thomas Monjalon
2018-10-03 16:31             ` Ferruh Yigit
2018-10-03 18:00           ` Ferruh Yigit
2018-10-03 17:59         ` [dpdk-dev] [PATCH v4 1/2] net/tap: change queue fd to be pointers to process private Ferruh Yigit
2018-09-27 13:17     ` [dpdk-dev] [PATCH v3 " Wiles, Keith
2018-10-02 10:30       ` Raslan Darawsheh
2018-10-02 12:58         ` Wiles, Keith
2018-10-03 17:27           ` Ferruh Yigit [this message]
2018-07-20 11:15 ` [dpdk-dev] [PATCH v3] net/tap: add queues when attaching from secondary process Thomas Monjalon
2018-07-20 15:35   ` Wiles, Keith
2018-07-20 21:51     ` Thomas Monjalon
2018-07-21 13:44       ` Wiles, Keith
2018-08-23 11:51   ` Ferruh Yigit

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6d2b3acd-76fb-8931-b06d-a583ccdeb43d@intel.com \
    --to=ferruh.yigit@intel.com \
    --cc=dev@dpdk.org \
    --cc=keith.wiles@intel.com \
    --cc=orik@mellanox.com \
    --cc=rasland@mellanox.com \
    --cc=shahafs@mellanox.com \
    --cc=thomas@monjalon.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).