From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-we0-f181.google.com (mail-we0-f181.google.com [74.125.82.181]) by dpdk.org (Postfix) with ESMTP id 495969A9B for ; Wed, 25 Feb 2015 11:47:18 +0100 (CET) Received: by wesk11 with SMTP id k11so2694052wes.11 for ; Wed, 25 Feb 2015 02:47:18 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:date:message-id:organization :user-agent:in-reply-to:references:mime-version :content-transfer-encoding:content-type; bh=tIxzH6udKxiGuqSAxSRuLDleNXM2UdijeiP1F9Ni3qo=; b=gOtN4eqc0t+jU32JSer8fgGrkpEoEenK16YRh4N6/zJEGraqeaYZHjIPCIetpN7TGl 6Mw2aFrN3t0zNE5ohJ6pH8y7lhWDO62KnyoVNUr//H7yUSus1d54tKS6z/VqGqWfxIFi jouuKST3WVVensv58YY/YVklrc/BqwHwCxbW9txoCh5q1O79mjXiFXOrxOqDRia22dYb KaIVWKhu2oBhfkuJKVBTEItWxiJK+KsUy8EyYY7RvcsoAva27KeExmmDkqRytyK5i22b qMCFjkX33+LZrZPQbAPoSid5fjfBSQULBE3cWuZIiUcGTWGw0toOIqCeqirrA+FwpMrt TxnA== X-Gm-Message-State: ALoCoQnKTzmicIRLp03/e4v1y+wZ9AqBbdmRXZaYRe0mkjcgV16rQEalBUYH94DPVhlWy8B4l6AQ X-Received: by 10.194.81.104 with SMTP id z8mr4811113wjx.45.1424861238099; Wed, 25 Feb 2015 02:47:18 -0800 (PST) Received: from xps13.localnet (136-92-190-109.dsl.ovh.fr. [109.190.92.136]) by mx.google.com with ESMTPSA id v6sm18330950wix.8.2015.02.25.02.47.16 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 25 Feb 2015 02:47:17 -0800 (PST) From: Thomas Monjalon To: "Wang, Zhihong" , konstantin.ananyev@intel.com Date: Wed, 25 Feb 2015 11:46:44 +0100 Message-ID: <2838998.kCC4edMq5Z@xps13> Organization: 6WIND User-Agent: KMail/4.14.4 (Linux/3.18.4-1-ARCH; KDE/4.14.4; x86_64; ; ) In-Reply-To: References: <1422499127-11689-1-git-send-email-zhihong.wang@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="us-ascii" Cc: dev@dpdk.org Subject: Re: [dpdk-dev] [PATCH v2 0/4] DPDK memcpy optimization X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 25 Feb 2015 10:47:18 -0000 > > This patch set optimizes memcpy for DPDK for both SSE and AVX platforms. > > It also extends memcpy test coverage with unaligned cases and more test > > points. > > > > Optimization techniques are summarized below: > > > > 1. Utilize full cache bandwidth > > > > 2. Enforce aligned stores > > > > 3. Apply load address alignment based on architecture features > > > > 4. Make load/store address available as early as possible > > > > 5. General optimization techniques like inlining, branch reducing, prefetch > > pattern access > > > > -------------- > > Changes in v2: > > > > 1. Reduced constant test cases in app/test/test_memcpy_perf.c for fast > > build > > > > 2. Modified macro definition for better code readability & safety > > > > Zhihong Wang (4): > > app/test: Disabled VTA for memcpy test in app/test/Makefile > > app/test: Removed unnecessary test cases in app/test/test_memcpy.c > > app/test: Extended test coverage in app/test/test_memcpy_perf.c > > lib/librte_eal: Optimized memcpy in arch/x86/rte_memcpy.h for both SSE > > and AVX platforms > > Acked-by: Pablo de Lara Applied, thanks for the great work! Note: we are still looking for a maintainer of x86 EAL.