From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mailout1.w1.samsung.com (mailout1.w1.samsung.com [210.118.77.11]) by dpdk.org (Postfix) with ESMTP id F3F7311DE for ; Mon, 11 Jul 2016 13:47:59 +0200 (CEST) Received: from eucpsbgm2.samsung.com (unknown [203.254.199.245]) by mailout1.w1.samsung.com (Oracle Communications Messaging Server 7.0.5.31.0 64bit (built May 5 2014)) with ESMTP id <0OA500LW5FFYDL90@mailout1.w1.samsung.com> for dev@dpdk.org; Mon, 11 Jul 2016 12:47:58 +0100 (BST) X-AuditID: cbfec7f5-f792a6d000001302-8a-5783876d02e0 Received: from eusync3.samsung.com ( [203.254.199.213]) by eucpsbgm2.samsung.com (EUCPMTA) with SMTP id 5D.28.04866.D6783875; Mon, 11 Jul 2016 12:47:57 +0100 (BST) Received: from [106.109.129.180] by eusync3.samsung.com (Oracle Communications Messaging Server 7.0.5.31.0 64bit (built May 5 2014)) with ESMTPA id <0OA500CIHFFW3G70@eusync3.samsung.com>; Mon, 11 Jul 2016 12:47:57 +0100 (BST) To: Yuanhan Liu References: <1463748604-27251-1-git-send-email-i.maximets@samsung.com> <20160701073506.GQ2831@yliu-dev.sh.intel.com> <577CE930.2070007@samsung.com> <20160706122446.GO26521@yliu-dev.sh.intel.com> <577F9328.1030901@samsung.com> <20160710131731.GS26521@yliu-dev.sh.intel.com> <20160711083825.GY26521@yliu-dev.sh.intel.com> <57836BE0.2070401@samsung.com> <20160711110503.GZ26521@yliu-dev.sh.intel.com> Cc: dev@dpdk.org, Huawei Xie , Dyasly Sergey , Heetae Ahn , Jianfeng Tan , Stephen Hemminger From: Ilya Maximets Message-id: <5783876C.1050103@samsung.com> Date: Mon, 11 Jul 2016 14:47:56 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.8.0 MIME-version: 1.0 In-reply-to: <20160711110503.GZ26521@yliu-dev.sh.intel.com> Content-type: text/plain; charset=windows-1252 Content-transfer-encoding: 7bit X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFrrLLMWRmVeSWpSXmKPExsVy+t/xq7q57c3hBnsf61q8+7SdyWLa59vs Fu0zzzJZXGn/yW7RPfsLm8Xk2VIWi+/IWVyfcIHVgcPj14KlrB6L97xk8ph3MtCj5+Q8Jo++ LasYA1ijuGxSUnMyy1KL9O0SuDI+/t/IXDBVrGJb73fGBsajgl2MnBwSAiYS86bvYoewxSQu 3FvP1sXIxSEksJRRonfGNFYI5wWjxJxfc8CqhAX8JNZumMQIYosI6Eo8nbOOFcQWEpjOLHHl eRFIA7PAXUaJDa2PmUASbAI6EqdWHwFq4ODgFdCS2HfZACTMIqAqcW7xRrBeUYEIiVnbf4CV 8woISvyYfI8FxOYUsJZ42N8C1sosoCdx/6IWSJhZQF5i85q3zBMYBWYh6ZiFUDULSdUCRuZV jKKppckFxUnpuUZ6xYm5xaV56XrJ+bmbGCGB/nUH49JjVocYBTgYlXh4A7Y3hQuxJpYVV+Ye YpTgYFYS4S1tag4X4k1JrKxKLcqPLyrNSS0+xCjNwaIkzjtz1/sQIYH0xJLU7NTUgtQimCwT B6dUA+P6j3P/il/6dn+Cc2XCMiXFZh6ntMVO2mlXku9OcyufKx4q+fXTiZqCKr+s1eZyaVaL yrfpLf9vErlhqcyXV++C/8271ddQEnRjjbjwI3urrVVnuOVZ5xQ7s5c+ldo14dcZlvLMHU/v CS2fosLaabTo7gRXXeX9pbOW7dv7NT+/WzT+teaB7zFKLMUZiYZazEXFiQCKyE9RcAIAAA== Subject: Re: [dpdk-dev] [PATCH] vhost: fix segfault on bad descriptor address. X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 Jul 2016 11:48:00 -0000 On 11.07.2016 14:05, Yuanhan Liu wrote: > On Mon, Jul 11, 2016 at 12:50:24PM +0300, Ilya Maximets wrote: >> On 11.07.2016 11:38, Yuanhan Liu wrote: >>> On Sun, Jul 10, 2016 at 09:17:31PM +0800, Yuanhan Liu wrote: >>>> On Fri, Jul 08, 2016 at 02:48:56PM +0300, Ilya Maximets wrote: >>>>> >>>>> Another point is that crash constantly happens on queue_id=3 (second RX queue) in >>>>> my scenario. It is newly allocated virtqueue while reconfiguration from rxq=1 to >>>>> rxq=2. >>>> >>>> That's a valuable message: what's your DPDK HEAD commit while triggering >>>> this issue? >> >> fbfd99551ca3 ("mbuf: add raw allocation function") >> >>> >>> I guess I have understood what goes wrong in you case. >>> >>> I would guess that your vhost has 2 queues (here I mean queue-pairs, >>> including one Tx and Rx queue; below usage is the same) configured, >>> so does to your QEMU. However, you just enabled 1 queue while starting >>> testpmd inside the guest, and you want to enable 2 queues by running >>> following testpmd commands: >>> >>> stop >>> port stop all >>> port config all rxq 2 >>> port config all txq 2 >>> port start all >>> >>> Badly, that won't work for current virtio PMD implementation, and what's >>> worse, it triggers a vhost crash, the one you saw. >>> >>> Here is how it comes. Since you just enabled 1 queue while starting >>> testpmd, it will setup 1 queue only, meaning only one queue's **valid** >>> information will be sent to vhost. You might see SET_VRING_ADDR >>> (and related vhost messages) for the other queue as well, but they >>> are just the dummy messages: they don't include any valid/real >>> information about the 2nd queue: the driver don't setup it after all. >>> >>> So far, so good. It became broken when you run above commands. Those >>> commands do setup for the 2nd queue, however, they failed to trigger >>> the QEMU virtio device to start the vhost-user negotiation, meaning >>> no SET_VRING_ADDR will be sent for the 2nd queue, leaving vhost >>> untold and not updated. >>> >>> What's worse, above commands trigger the QEMU to send SET_VRING_ENABLE >>> messages, to enable all the vrings. And since the vrings for the 2nd >>> queue are not properly configured, the crash happens. >> >> Hmm, why 2nd queue works properly with my fix to vhost in this case? > > Hmm, really? You are sure that data flows in your 2nd queue after those > commands? From what I know is that your patch just avoid a crash, but > does not fix it. Oh, sorry. Yes, it doesn't work. With my patch applied I have a QEMU hang. >>> So maybe we should do virtio reset on port start? >> >> I guess it was removed by this patch: >> a85786dc816f ("virtio: fix states handling during initialization"). > > Seems yes. > > --yliu > >