From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pf1-f176.google.com (mail-pf1-f176.google.com [209.85.210.176]) by dpdk.org (Postfix) with ESMTP id 58D1F1B4B0 for ; Sun, 23 Dec 2018 13:43:33 +0100 (CET) Received: by mail-pf1-f176.google.com with SMTP id c73so4721288pfe.13 for ; Sun, 23 Dec 2018 04:43:32 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20150623.gappssmtp.com; s=20150623; h=user-agent:date:subject:from:to:message-id:thread-topic :mime-version:content-transfer-encoding; bh=ASHLrIL/XxFGQEb84Qe65F1KvcxW6v9rL6XhHSTBqfk=; b=kVd1SfyQ485ma666BmzCZ/6/Gh/MJMZ2YX9iSV8QeYhG5qXXq4SLW0FfOiHOKmiAe3 kDk8qTR/xvFHUKCwIB0q+qqsKe3BOHszgk72wHHn2Y07UezLy4MKsiWHpzqxI1FGIlDp vLYb3U8FZFIIBrVs8kJp1ydFrsIe8V+l8b1VDBwBjm4J86mwmjwDN1S9nfWpehNXBSKl 5DB/q1LEHcqsFoo64AVCkZhENtVZvs0Clbj/2RCIVZeR+5rS89+M2+0zQubJsw4pwE7b yBKGNQ4li7hiVn9DOPvJBsV0E1xN05pFvDZw7ds3WimvTsslae+5LAm7JwJgxF2asnMF EWQw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:user-agent:date:subject:from:to:message-id :thread-topic:mime-version:content-transfer-encoding; bh=ASHLrIL/XxFGQEb84Qe65F1KvcxW6v9rL6XhHSTBqfk=; b=VYmB+b8IwHRah66YH4pFUbqKQFeXUqScmpwmAR9bhCff4dljBCIxGacA0Has5xUVnu xw5Sg08MJ+Ze/Yv/q0aYYxZITx6L9cpO26PEo/f5hZsytO25vpY12irklOd+JAUQKMbm /XQD5qwzOcsN46wx2vEEuH7NhztKX6EYaNNGzMBqGc9x/bqZFFKzRk9U757YHyDNoZtg X7dDl4z6LlLGiWttMCL+0aylGErwJjsIBMQcege1saGX+ovVcg9uBag5VQlnaaj1uRPM RFMQDiCTs2K+rxUsV/T9qe53MEAkEdGnrPJtxh9qr0VPh9gHk3spQZSCe2blbcqZ1Qvr 8D/A== X-Gm-Message-State: AJcUukclhuIdCtTllxjii9M63kdbEAx8iLz38jsDZEzYrJgrkOUfcHWE cxvuRKGzlbu56n9VNqxo1VNUMyqc725y+g== X-Google-Smtp-Source: ALg8bN4pu8lrxGg79ivP6ptfLB30cQpGb+Wn2Rec85V2099YcHqQLtQOyTiy0G0D++XLbjhdrQPdPw== X-Received: by 2002:a63:e101:: with SMTP id z1mr9060318pgh.310.1545569011861; Sun, 23 Dec 2018 04:43:31 -0800 (PST) Received: from [10.2.196.1] ([61.120.150.67]) by smtp.gmail.com with ESMTPSA id o189sm44669816pfg.117.2018.12.23.04.43.30 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 23 Dec 2018 04:43:31 -0800 (PST) User-Agent: Microsoft-MacOutlook/10.14.0.181208 Date: Sun, 23 Dec 2018 20:43:22 +0800 From: zhuangyan To: Message-ID: <84917CEF-AC2A-4938-97A1-E9303A4CBA2D@bytedance.com> Thread-Topic: vmovdqa64 instruction: how to be disabled in skylake CPU Mime-version: 1.0 Content-type: text/plain; charset="UTF-8" Content-transfer-encoding: quoted-printable Subject: [dpdk-dev] vmovdqa64 instruction: how to be disabled in skylake CPU X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 23 Dec 2018 12:43:33 -0000 Hello, Is there any way to disable the generation of =E2=80=9Cvmovdqa64=E2=80=9D instruction d= uring the compilation of dpdk library on Intel Skylake CPU? My dev VM uses Intel Skylake CPU while my test machine uses Intel uses Inte= l(R) Xeon(R) CPU E5-2630 v4 that is some sort of Intel Broadwell CPU. And my generated executable file of dpdk application contains "vmovdqa64" i= nstruction that is NOT supported in Intel Broadwell CPU so my dpdk applicati= on fails to be run. I tried MACHINE_CFLAGS=3D -march=3Dnative/broadwell/ivybridge within dpdk-stabl= e-17.11.2/mk/machine/native/rte.vars.mk, however it does not work. Additionally, my gcc version is gcc (Debian 4.9.2-10+deb8u1) 4.9.2. - My dev VM zhuangyan@devbox-zhuangyan:~/my-dpdk$ lscpu Architecture:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 x86_64 CPU op-mode(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 32-bit, 64-bit Byte Order:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 Little Endian CPU(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 4 On-line CPU(s) list:=C2=A0=C2=A0 0-3 Thread(s) per core:=C2=A0=C2=A0=C2=A0 1 Core(s) per socket:=C2=A0=C2=A0=C2=A0 1 Socket(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 4 NUMA node(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 1 Vendor ID:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 GenuineIntel CPU family:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 6 Model:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 85 Model name:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 Intel Xeon Processor (Skylake, IBRS) Stepping:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 4 CPU MHz: =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A02294.608 BogoMIPS:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 4589.21 Hypervisor vendor:=C2=A0=C2=A0=C2=A0=C2=A0 KVM Virtualization type:=C2=A0=C2=A0 full L1d cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 32K L1i cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 32K L2 cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 4096K L3 cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 16384K NUMA node0 CPU(s):=C2=A0=C2=A0=C2=A0=C2=A0 0-3 - My test VM root@n10-023-013:~# lscpu =20 Architecture:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 x86_64 CPU op-mode(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 32-bit, 64-bit Byte Order:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 Little Endian CPU(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 40 On-line CPU(s) list:=C2=A0=C2=A0 0-39 Thread(s) per core:=C2=A0=C2=A0=C2=A0 2 Core(s) per socket: =C2=A0=C2=A0=C2=A010 Socket(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 2 NUMA node(s):=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 2 Vendor ID:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 GenuineIntel CPU family:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 6 Model:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 79 Model name:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 Intel(R) Xeon(R) CPU E5-2630 v4 @ 2.20GHz Stepping:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 1 CPU MHz:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 2401.281 CPU max MHz:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 3100.0000 CPU min MHz:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 1200.0000 BogoMIPS:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 4401.20 Virtualization:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 VT-x L1d cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 32K L1i cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 32K L2 cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 256K L3 cache:=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 25600K NUMA node0 CPU(s):=C2=A0=C2=A0=C2=A0=C2=A0 0-9,20-29 NUMA node1 CPU(s):=C2=A0=C2=A0=C2=A0=C2=A0 10-19,30-39 =C2=A0 zhuangyan@devbox-zhuangyan:~/ttgw-3/bin$ grep -rn "vmovdqa64" ttgw-obj 346885: 5463ab: 62 f1 fd 48 6f 3d 4b vmovdqa64 0x9bd4b(%rip),%zmm7 = # 5e2100 <__func__.24871+0x60> 346889: 5463c2: 62 f1 fd 48 6f 15 f4 vmovdqa64 0x9bcf4(%rip),%zmm2 = # 5e20c0 <__func__.24871+0x20> 346891: 5463cc: 62 f1 fd 48 6f 25 6a vmovdqa64 0x9bd6a(%rip),%zmm4 = # 5e2140 <__func__.24871+0xa0> 346893: 5463d6: 62 f1 fd 48 6f 1d a0 vmovdqa64 0x9bda0(%rip),%zmm3 = # 5e2180 <__func__.24871+0xe0> 346896: 5463e6: 62 f1 fd 48 6f f4 vmovdqa64 %zmm4,%zmm6 346898: 5463f2: 62 e1 fd 48 6f c3 vmovdqa64 %zmm3,%zmm16 346899: 5463f8: 62 e1 fd 48 6f cb vmovdqa64 %zmm3,%zmm17 346906: 546424: 62 f1 fd 48 6f cc vmovdqa64 %zmm4,%zmm1 . . . =20 =20 Regards, Yan(Andrew) Zhuang zhuangyan@bytedance.com =20 =20