From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga05.intel.com (mga05.intel.com [192.55.52.43]) by dpdk.org (Postfix) with ESMTP id B1DA9200 for ; Mon, 16 Jul 2018 17:09:06 +0200 (CEST) X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga005.jf.intel.com ([10.7.209.41]) by fmsmga105.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 16 Jul 2018 08:09:05 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.51,361,1526367600"; d="scan'208";a="240694739" Received: from aburakov-mobl.ger.corp.intel.com (HELO [10.237.220.102]) ([10.237.220.102]) by orsmga005.jf.intel.com with ESMTP; 16 Jul 2018 08:09:01 -0700 To: "Eads, Gage" , "dev@dpdk.org" References: <9184057F7FC11744A2107296B6B8EB1E446E9A79@FMSMSX108.amr.corp.intel.com> From: "Burakov, Anatoly" Message-ID: Date: Mon, 16 Jul 2018 16:09:00 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <9184057F7FC11744A2107296B6B8EB1E446E9A79@FMSMSX108.amr.corp.intel.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Subject: Re: [dpdk-dev] DPDK and forked processes X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 Jul 2018 15:09:07 -0000 On 16-Jul-18 4:00 PM, Eads, Gage wrote: > Hi all, > > Does DPDK support forking secondary processes after executing > rte_eal_init()? The l2fwd_fork example and at least one application > (OpenEM: https://sourceforge.net/projects/eventmachine/) use this model, > and they do so by fixing up the EAL internals (e.g. manually changing > process_type from primary to secondary) at the start of the child > process. This feels like a hack, and I can’t find any documentation > describing this model. > > Moreover, this approach doesn’t appear to be compatible with recent EAL > changes. For instance, the multi-process communication creates a couple > handler threads (“rte_mp_handle” and “rte_mp_async”) during EAL > initialization. The child processes won’t inherit these threads, and so > won’t be able to participate in multi-process comms. This means the > reworked memory subsystem and upcoming device hotplug support > (http://mails.dpdk.org/archives/dev/2018-July/107704.html) won’t work > with this fork-after-init model. > > This is just one example – there may be other features/subsystems that > won’t work. As far as I can tell there is no official stance (though the > l2fwd_fork example implies it’s supported, IMO); I think either DPDK > should either drop the example and not support this model, or support it > and either document its limitations or resolve them. This model could be > an interesting way to run multi-process DPDK on an ASLR-enabled system, > but supporting this wouldn’t be trivial. > > Thanks, > > Gage > I think it's a very bad idea to use such a model in recent versions of DPDK. As you have correctly pointed out, IPC will not work in such a scenario, and given how our memory subsystem relies on IPC, this is a recipe for memory corruption and divergent memory maps (since technically both initial and forked processes believe they are primary). Even hacking rte_config to make DPDK think it's a secondary process will not work, because the initialization has already completed, but all of the threads (IPC, interrupt, etc.) are gone and correct IPC socket was not created, which means the process becomes invisible to the primary for all intents and purposes. We _could_ introduce some kind of "official DPDK fork" function that would fork the process and then restart interrupt, IPC etc. stuff on an already running instance of DPDK, but that seems like a workaround for a problem that shouldn't exist in the first place, because such usage is fundamentally incompatible with DPDK as it stands now. -- Thanks, Anatoly