From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <jianfeng.tan@intel.com>
Received: from mga03.intel.com (mga03.intel.com [134.134.136.65])
 by dpdk.org (Postfix) with ESMTP id 2678137AA
 for <dev@dpdk.org>; Tue,  8 Mar 2016 18:12:14 +0100 (CET)
Received: from fmsmga004.fm.intel.com ([10.253.24.48])
 by orsmga103.jf.intel.com with ESMTP; 08 Mar 2016 09:11:22 -0800
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="5.22,557,1449561600"; d="scan'208";a="62229666"
Received: from ywang8-mobl1.ccr.corp.intel.com (HELO [10.255.31.199])
 ([10.255.31.199])
 by fmsmga004.fm.intel.com with ESMTP; 08 Mar 2016 09:11:18 -0800
To: "Ananyev, Konstantin" <konstantin.ananyev@intel.com>,
 "dev@dpdk.org" <dev@dpdk.org>
References: <1456795416-118189-1-git-send-email-jianfeng.tan@intel.com>
 <1457080687-114113-1-git-send-email-jianfeng.tan@intel.com>
 <2601191342CEEE43887BDE71AB97725836B185E5@irsmsx105.ger.corp.intel.com>
From: "Tan, Jianfeng" <jianfeng.tan@intel.com>
Message-ID: <56DF07B6.4010808@intel.com>
Date: Wed, 9 Mar 2016 01:11:18 +0800
User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:38.0) Gecko/20100101
 Thunderbird/38.6.0
MIME-Version: 1.0
In-Reply-To: <2601191342CEEE43887BDE71AB97725836B185E5@irsmsx105.ger.corp.intel.com>
Content-Type: text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding: 7bit
Subject: Re: [dpdk-dev] [PATCH v2] examples/l3fwd: fix using packet type
	blindly
X-BeenThere: dev@dpdk.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: patches and discussions about DPDK <dev.dpdk.org>
List-Unsubscribe: <http://dpdk.org/ml/options/dev>,
 <mailto:dev-request@dpdk.org?subject=unsubscribe>
List-Archive: <http://dpdk.org/ml/archives/dev/>
List-Post: <mailto:dev@dpdk.org>
List-Help: <mailto:dev-request@dpdk.org?subject=help>
List-Subscribe: <http://dpdk.org/ml/listinfo/dev>,
 <mailto:dev-request@dpdk.org?subject=subscribe>
X-List-Received-Date: Tue, 08 Mar 2016 17:12:14 -0000

Hi Konstantin,

On 3/8/2016 2:51 AM, Ananyev, Konstantin wrote:
> Hi Jianfeng,
>
>> +/* Requirements:
>> + * 1. IP packets without extension;
>> + * 2. L4 payload should be either TCP or UDP.
>> + */
>> +int
>> +em_check_ptype(int portid)
>> +{
>> +	int i, ret;
>> +	int ptype_l3_ipv4_ext = 0;
>> +	int ptype_l3_ipv6_ext = 0;
>> +	int ptype_l4_tcp = 0;
>> +	int ptype_l4_udp = 0;
>> +
>> +	ret = rte_eth_dev_get_ptype_info(portid,
>> +					 RTE_PTYPE_L3_MASK | RTE_PTYPE_L4_MASK,
>> +					 NULL, 0);
>> +	if (ret <= 0)
>> +		return 0;
>> +
>> +	uint32_t ptypes[ret];
>> +
>> +	ret = rte_eth_dev_get_ptype_info(portid,
>> +					 RTE_PTYPE_L3_MASK | RTE_PTYPE_L4_MASK,
>> +					 ptypes, ret);
>> +	for (i = 0; i < ret; ++i) {
>> +		switch (ptypes[i]) {
>> +		case RTE_PTYPE_L3_IPV4_EXT:
>> +			ptype_l3_ipv4_ext = 1;
>> +			break;
>> +		case RTE_PTYPE_L3_IPV6_EXT:
>> +			ptype_l3_ipv6_ext = 1;
>> +			break;
>> +		case RTE_PTYPE_L4_TCP:
>> +			ptype_l4_tcp = 1;
>> +			break;
>> +		case RTE_PTYPE_L4_UDP:
>> +			ptype_l4_udp = 1;
>> +			break;
>> +		}
>> +	}
>> +
>> +	if (ptype_l3_ipv4_ext == 0)
>> +		printf("port %d cannot parse RTE_PTYPE_L3_IPV4_EXT\n", portid);
>> +	if (ptype_l3_ipv6_ext == 0)
>> +		printf("port %d cannot parse RTE_PTYPE_L3_IPV6_EXT\n", portid);
>> +	if (!ptype_l3_ipv4_ext || !ptype_l3_ipv6_ext)
>> +		return 0;
>> +
>> +	if (ptype_l4_tcp == 0)
>> +		printf("port %d cannot parse RTE_PTYPE_L4_TCP\n", portid);
>> +	if (ptype_l4_udp == 0)
>> +		printf("port %d cannot parse RTE_PTYPE_L4_UDP\n", portid);
>> +	if (ptype_l4_tcp || ptype_l4_udp)
>> +		return 1;
> Should probably be: if (ptype_l4_tcp && ptype_l4_udp)
> ?

Oops, yes, we need to make it as a strict checking here.

>
>> +
>> +	return 0;
>> +}
>> +
>> +static inline void
>> +em_parse_ptype(struct rte_mbuf *m)
>> +{
>> +	struct ether_hdr *eth_hdr;
>> +	uint32_t packet_type = RTE_PTYPE_UNKNOWN;
>> +	uint16_t ethertype;
>> +	void *l3;
>> +	int hdr_len;
>> +	struct ipv4_hdr *ipv4_hdr;
>> +	struct ipv6_hdr *ipv6_hdr;
>> +
>> +	eth_hdr = rte_pktmbuf_mtod(m, struct ether_hdr *);
>> +	ethertype = rte_be_to_cpu_16(eth_hdr->ether_type);
> I think you can avoid bswap here, if use constants in BE format
> for comparison instead:
>   
> ethertype = eth_hdr->ether_type;
> switch (ethertype) {
> case (rte_cpu_to_be_16(ETHER_TYPE_IPv4)):
> ...
>
> Same for lpm.

Great advice!

>
>> @@ -612,6 +622,14 @@ parse_args(int argc, char **argv)
>>   					return -1;
>>   				}
>>   			}
>> +
>> +			if (!strncmp(lgopts[option_index].name,
>> +				     CMD_LINE_OPT_PARSE_PTYPE,
>> +				     sizeof(CMD_LINE_OPT_PARSE_PTYPE))) {
>> +				printf("soft parse-ptype is enabled\n");
>> +				parse_ptype = 1;
>> +			}
>> +
>>   			break;
>>
>>   		default:
>> @@ -965,6 +983,40 @@ main(int argc, char **argv)
>>   			rte_eth_promiscuous_enable(portid);
>>   	}
>>
>> +	printf("\n");
>> +
>> +	for (lcore_id = 0; lcore_id < RTE_MAX_LCORE; lcore_id++) {
>> +		if (rte_lcore_is_enabled(lcore_id) == 0)
>> +			continue;
>> +		qconf = &lcore_conf[lcore_id];
>> +		for (queue = 0; queue < qconf->n_rx_queue; ++queue) {
>> +			portid = qconf->rx_queue_list[queue].port_id;
>> +			queueid = qconf->rx_queue_list[queue].queue_id;
>> +
>> +			ret = l3fwd_lkp.check_ptype(portid);
>> +			if (ret) {
>> +				printf("Port %d: built-in packet type info\n",
>> +				       portid);
>> +				continue;
>> +			}
>
> I thought that if parse_ptype != 0 we always want to use a SW parsing,
> no matter does HW support it or not, no?

Thanks for pointing this out. It's actually worth discussion. It depends 
on how --parse-ptype option means:
a. try best to use hw way, if fails, turns to sw way;
b. use sw way no matter hw way is workable.

Way a makes it portable to use the same cmd line of l3fwd for all devices.
Way b makes it more clear for this option.

I am actually more inclined to way b (your suggested way).

> So user can measure what is the performance difference between HW and SW versions?
> Another thing, that I forgot to ask earlier - with ptype querying in place, would we like to set:
> RTE_I40E_INC_VECTOR=y
> by default?

Thoughtful! I think you are right, and I'll double check with Tao Zhe.

Thanks,
Jianfeng

>
> Konstantin