From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from out0-241.mail.aliyun.com (out0-241.mail.aliyun.com [140.205.0.241]) by dpdk.org (Postfix) with ESMTP id 487131B32B for ; Mon, 13 Nov 2017 06:53:00 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alibaba-inc.com; s=default; t=1510552379; h=Date:Subject:From:To:Message-ID:Mime-version:Content-type; bh=YQHMKOmXL2eJsQ1FxJN26eT73HvFAPjqFfjaTL92ojA=; b=ALEa7PjgVx7250zzrbJl62FpDmasuPPuv76POWVv1ZMuvXjf7ew1UAjeS1XqL5yp5h1TP5picB6CAg/L7fjPoTxqRtTYGvR+5yICc/MGCSqLsjawaFqWYQv9Ja37wNs4zyouXjS2bPkJEOcnudCGEzrMGnBYxVNxxvlfPCiPdH0= X-Alimail-AntiSpam: AC=PASS; BC=-1|-1; BR=01201311R171e4; FP=0|-1|-1|-1|0|-1|-1|-1; HT=e02c03296; MF=j.huo@alibaba-inc.com; NM=1; PH=DS; RN=1; SR=0; TI=SMTPD_---.9Odrcat_1510552377; Received: from 192.168.0.199(mailfrom:j.huo@alibaba-inc.com ip:121.0.29.201) by smtp.aliyun-inc.com(127.0.0.1); Mon, 13 Nov 2017 13:52:58 +0800 User-Agent: Microsoft-MacOutlook/f.23.0.170610 Date: Mon, 13 Nov 2017 13:52:54 +0800 From: "Jianjian Huo" To: Message-ID: <2A950137-2E48-41B5-AFD3-5E38E4AAD7D5@alibaba-inc.com> Thread-Topic: DPDK memory error check and offline bad pages References: <8585190C-A984-4563-BF89-97CEFE6B87AB@alibaba-inc.com> In-Reply-To: <8585190C-A984-4563-BF89-97CEFE6B87AB@alibaba-inc.com> Mime-version: 1.0 Content-type: text/plain; charset="UTF-8" Content-transfer-encoding: quoted-printable Subject: Re: [dpdk-dev] DPDK memory error check and offline bad pages X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 13 Nov 2017 05:53:03 -0000 Anyone has any idea on this? Can=E2=80=99t believe DPDK doesn=E2=80=99t support such an important feature. This is g= oing to be a show stopper for real production system. -Jianjian On 11/7/17, 1:13 PM, "Jianjian Huo" wrote: Hi dpdk developers, =20 I have a question regarding how DPDK memory module treats memory errors= . =20 In Linux kernel, it has mechanism (mcelog and EDAC) to monitor the memo= ry controller and report correctable/uncorrectable memory errors. Using some= configurations, if memory errors exceed threshold, system can offline bad m= emory pages and avoid applications to access/crash. Do we have similar mechanism in DPDK? =20 Thanks, Jianjian =20 =20