From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga05.intel.com (mga05.intel.com [192.55.52.43]) by dpdk.org (Postfix) with ESMTP id 56EB8FFA for ; Tue, 29 Aug 2017 18:53:07 +0200 (CEST) Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga105.fm.intel.com with ESMTP; 29 Aug 2017 09:53:06 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.41,445,1498546800"; d="scan'208";a="123714750" Received: from fyigit-mobl1.ger.corp.intel.com (HELO [10.237.220.57]) ([10.237.220.57]) by orsmga004.jf.intel.com with ESMTP; 29 Aug 2017 09:53:04 -0700 To: Shahaf Shuler , nelio.laranjeiro@6wind.com, adrien.mazarguil@6wind.com Cc: dev@dpdk.org References: <1503301622-14220-1-git-send-email-sagi@grimberg.me> From: Ferruh Yigit Message-ID: Date: Tue, 29 Aug 2017 17:53:03 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.3.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Subject: Re: [dpdk-dev] [PATCH v2 0/2] mlx5 high latency observed on send operations X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 29 Aug 2017 16:53:07 -0000 On 8/27/2017 7:47 AM, Shahaf Shuler wrote: > from sagi@grimberg.me: > > When measuring latency when running a latency critical workload on mlx5 pmd drivers we noticed high latency can occur due to delayed doorbell record update flush. > > This can be reproduced using the simple program [1] against testpmd macswap fwd mode. This utility sends a raw ethernet frame to the dpdk port and measures the time between send and the received mirrored frame. > > This patchset guarantees immediate doorbell updates visibility by making the doorbell a non-cacheble memory. > In addition, we relax the memory barrier for dma-able memory. > > Without this fix the tsc delta was 3550760-5993019 cycles (which translates to 2-6 ms on 1.7 GHz processor). > > With the fix applied the tsc delta reduced to 17740-29663 (wich translates to 9-17 us). > > on v2: > * replace compiler barrier with rte_io_wmb. > > Shahaf Shuler (2): > net/mlx5: replace memory barrier type > net/mlx5: don't map doorbell register to write combining Series applied to dpdk-next-net/master, thanks.