DPDK patches and discussions
 help / color / mirror / Atom feed
From: Maxime Coquelin <maxime.coquelin@redhat.com>
To: dev@dpdk.org, tiwei.bie@intel.com, zhihong.wang@intel.com,
	jfreimann@redhat.com, nicknickolaev@gmail.com,
	i.maximets@samsung.com, bruce.richardson@intel.com,
	alejandro.lucero@netronome.com
Cc: dgilbert@redhat.com, stable@dpdk.org
Subject: Re: [dpdk-dev] [PATCH v7 00/19] vhost: add postcopy live-migration support
Date: Tue, 16 Oct 2018 10:37:53 +0200	[thread overview]
Message-ID: <9e0f6b4b-92c0-cee2-134b-05a5c63da676@redhat.com> (raw)
In-Reply-To: <20181012124047.17884-1-maxime.coquelin@redhat.com>



On 10/12/2018 02:40 PM, Maxime Coquelin wrote:
> In this v7:
> - Move vh_result enum declaration to vhost.h (Ilya)
> - Remove check in read_fd_message() for more fds received
>    than requested as already checked with MSG_CTRUNC (Ilya)
> - Don't return an error in vhost-crypto post message handler
>    if not a crypto message as all messages go through it.
> 
> With classic live-migration, the VM runs on source while its
> content is being migrated to destination. When pages already
> migrated to destination are dirtied by the source, they get
> copied until both source and destination memory converge.
> At that time, the source is stopped and destination is
> started.
> 
> With postcopy live-migration, the VM is started on destination
> before all the memory has been migrated. When the VM tries to
> access a page that haven't been migrated yet, a pagefault is
> triggered, handled by userfaultfd which pauses the thread.
> A Qemu thread in charge of postcopy request the source for
> the missing page. Once received and mapped, the paused thread
> gets resumed.
> 
> Userfaultfd supports handling faults from a different process,
> and Qemu supports postcopy with vhost-user backends since
> v2.12.
> 
> One problem encountered with classic live-migration for VMs
> relying on vhost-user backends is that when the traffic is
> high (e.g. PVP), it happens that it never converges as
> pages gets dirtied at a faster rate than they are copied
> to the destination.
> It is expected this problem sould be solved with using
> postcopy, as rings memory and buffers will be copied once,
> when destination will pagefault on them.
> 
> Note that it will certainly require a rebase to apply on top
> of Nikolay's vhost-user message handling rework.
> 
> Steps to test postcopy:
> 1. Run DPDK's Testpmd application on source:
> ./install/bin/testpmd -m 512 --file-prefix=src -l 0,2 -n 4 \
>    --vdev 'net_vhost0,iface=/tmp/vu-src' -- --portmask=1 -i \
>    --rxq=1 --txq=1 --nb-cores=1 --eth-peer=0,52:54:00:11:22:12 \
>    --no-mlockall
> 
> 2. Run DPDK's Testpmd application on destination:
> ./install/bin/testpmd -m 512 --file-prefix=dst -l 0,2 -n 4 \
>    --vdev 'net_vhost0,iface=/tmp/vu-dst,postcopy-support=1' -- --portmask=1 -i \
>    --rxq=1 --txq=1 --nb-cores=1 --eth-peer=0,52:54:00:11:22:12 \
>    --no-mlockall
> 
> 3. Launch VM on source:
> ./x86_64-softmmu/qemu-system-x86_64 -enable-kvm -m 3G -smp 2 -cpu host \
>    -object memory-backend-file,id=mem,size=3G,mem-path=/dev/shm,share=on \
>    -numa node,memdev=mem -mem-prealloc \
>    -chardev socket,id=char0,path=/tmp/vu-src \
>    -netdev type=vhost-user,id=mynet1,chardev=char0,vhostforce \
>    -device virtio-net-pci,netdev=mynet1 /home/virt/rhel7.6-1-clone.qcow2 \
>    -net none -vnc :0 -monitor stdio
> 
> 4. Launch VM on destination:
> ./x86_64-softmmu/qemu-system-x86_64 -enable-kvm -m 3G -smp 2 -cpu host \
>    -object memory-backend-file,id=mem,size=3G,mem-path=/dev/shm,share=on \
>    -numa node,memdev=mem -mem-prealloc \
>    -chardev socket,id=char0,path=/tmp/vu-dst \
>    -netdev type=vhost-user,id=mynet1,chardev=char0,vhostforce \
>    -device virtio-net-pci,netdev=mynet1 /home/virt/rhel7.6-1-clone.qcow2 \
>    -net none -vnc :1 -monitor stdio -incoming tcp::8888
> 
> 5. In both testpmd prompts, start flooding the virtio-net device:
> testpmd> set fwd txonly
> testpmd> start
> 
> 6. In destination's Qemu monitor, enable postcopy:
> (qemu) migrate_set_capability postcopy-ram on
> 
> 7. In source's Qemu monitor, enable postcopy and launch migration:
> (qemu) migrate_set_capability postcopy-ram on
> (qemu) migrate -d tcp:0:8888
> (qemu) migrate_start_postcopy
> 
> Maxime Coquelin (19):
>    vhost: fix messages results handling
>    vhost: fix return code of messages requiring replies
>    vhost: clarify reply-ack in case a reply was already sent
>    vhost: fix payload size of reply
>    vhost: fix error handling when mem table gets updated
>    vhost: define postcopy protocol flag
>    vhost: add number of fds to vhost-user messages and use it
>    vhost: pass socket fd to message handling callbacks
>    vhost: enable fds passing when sending vhost-user messages
>    vhost: add config flag for postcopy feature
>    vhost: introduce postcopy's advise message
>    vhost: add support for postcopy's listen message
>    vhost: register new regions with userfaultfd
>    vhost: avoid useless VhostUserMemory copy
>    vhost: send userfault range addresses back to qemu
>    vhost: add support to postcopy's end request
>    vhost: restrict postcopy live-migration enablement
>    net/vhost: add parameter to enable postcopy support
>    vhost: enable postcopy protocol feature
> 
>   config/common_linuxapp              |   1 +
>   doc/guides/nics/vhost.rst           |   5 +
>   doc/guides/prog_guide/vhost_lib.rst |   8 +
>   drivers/net/vhost/rte_eth_vhost.c   |  13 ++
>   lib/librte_vhost/meson.build        |   2 +
>   lib/librte_vhost/rte_vhost.h        |   5 +
>   lib/librte_vhost/socket.c           |  46 +++-
>   lib/librte_vhost/vhost.h            |  32 ++-
>   lib/librte_vhost/vhost_crypto.c     |  25 +-
>   lib/librte_vhost/vhost_user.c       | 344 +++++++++++++++++++++-------
>   lib/librte_vhost/vhost_user.h       |  12 +-
>   11 files changed, 377 insertions(+), 116 deletions(-)
> 

Applied to dpdk-next-virtio/master.

Maxime

      parent reply	other threads:[~2018-10-16  8:38 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-10-12 12:40 Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 01/19] vhost: fix messages results handling Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 02/19] vhost: fix return code of messages requiring replies Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 03/19] vhost: clarify reply-ack in case a reply was already sent Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 04/19] vhost: fix payload size of reply Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 05/19] vhost: fix error handling when mem table gets updated Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 06/19] vhost: define postcopy protocol flag Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 07/19] vhost: add number of fds to vhost-user messages and use it Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 08/19] vhost: pass socket fd to message handling callbacks Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 09/19] vhost: enable fds passing when sending vhost-user messages Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 10/19] vhost: add config flag for postcopy feature Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 11/19] vhost: introduce postcopy's advise message Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 12/19] vhost: add support for postcopy's listen message Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 13/19] vhost: register new regions with userfaultfd Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 14/19] vhost: avoid useless VhostUserMemory copy Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 15/19] vhost: send userfault range addresses back to qemu Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 16/19] vhost: add support to postcopy's end request Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 17/19] vhost: restrict postcopy live-migration enablement Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 18/19] net/vhost: add parameter to enable postcopy support Maxime Coquelin
2018-10-12 12:40 ` [dpdk-dev] [PATCH v7 19/19] vhost: enable postcopy protocol feature Maxime Coquelin
2018-10-16  8:37 ` Maxime Coquelin [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9e0f6b4b-92c0-cee2-134b-05a5c63da676@redhat.com \
    --to=maxime.coquelin@redhat.com \
    --cc=alejandro.lucero@netronome.com \
    --cc=bruce.richardson@intel.com \
    --cc=dev@dpdk.org \
    --cc=dgilbert@redhat.com \
    --cc=i.maximets@samsung.com \
    --cc=jfreimann@redhat.com \
    --cc=nicknickolaev@gmail.com \
    --cc=stable@dpdk.org \
    --cc=tiwei.bie@intel.com \
    --cc=zhihong.wang@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).