From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga01.intel.com (mga01.intel.com [192.55.52.88]) by dpdk.org (Postfix) with ESMTP id 812182BD3 for ; Tue, 14 Nov 2017 01:04:04 +0100 (CET) Received: from orsmga002.jf.intel.com ([10.7.209.21]) by fmsmga101.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 13 Nov 2017 16:04:03 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.44,392,1505804400"; d="scan'208";a="7261950" Received: from tanjianf-mobl.ccr.corp.intel.com (HELO [10.255.24.142]) ([10.255.24.142]) by orsmga002.jf.intel.com with ESMTP; 13 Nov 2017 16:04:00 -0800 To: "Wiles, Keith" References: <8585190C-A984-4563-BF89-97CEFE6B87AB@alibaba-inc.com> <4F0B926E-5BC4-43C4-8DD6-FCEF4B811795@intel.com> Cc: Jianjian Huo , "dev@dpdk.org" From: "Tan, Jianfeng" Message-ID: Date: Tue, 14 Nov 2017 08:03:59 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0 MIME-Version: 1.0 In-Reply-To: <4F0B926E-5BC4-43C4-8DD6-FCEF4B811795@intel.com> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: [dpdk-dev] DPDK memory error check and offline bad pages X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 14 Nov 2017 00:04:04 -0000 On 11/14/2017 5:40 AM, Wiles, Keith wrote: > >> On Nov 12, 2017, at 11:08 PM, Tan, Jianfeng wrote: >> >> Hi Jianjian, >> >>> -----Original Message----- >>> From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Jianjian Huo >>> Sent: Wednesday, November 8, 2017 5:13 AM >>> To: dev@dpdk.org >>> Subject: [dpdk-dev] DPDK memory error check and offline bad pages >>> >>> Hi dpdk developers, >>> >>> I have a question regarding how DPDK memory module treats memory >>> errors. >> You mean hardware error which cannot be fixed by ECC? >> >>> In Linux kernel, it has mechanism (mcelog and EDAC) to monitor the memory >>> controller and report correctable/uncorrectable memory errors. Using some >>> configurations, if memory errors exceed threshold, system can offline bad >>> memory pages and avoid applications to access/crash. >> DPDK app is just one of applications. Are there any framework to notify such error to applications? >> To notify is the first thing, to recover is another thing which takes more effort. >> >>> Do we have similar mechanism in DPDK? >> No, as far as I know. > > Because DPDK runs as a normal user space application in Linux then the current features in the Linux Kernel can be used correct? I suppose so, but we still have not leveraged any of those features explicitly AFAIK. Implicitly? Tend to be problematic, as DPDK only translates physical address once at the very beginning. Thanks, Jianfeng > >> Thanks, >> Jianfeng >> >>> Thanks, >>> Jianjian >>> > Regards, > Keith >