From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: <qian.q.xu@intel.com> Received: from mga09.intel.com (mga09.intel.com [134.134.136.24]) by dpdk.org (Postfix) with ESMTP id CF0EA2716 for <dev@dpdk.org>; Wed, 24 Aug 2016 09:26:11 +0200 (CEST) Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by orsmga102.jf.intel.com with ESMTP; 24 Aug 2016 00:26:11 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.28,569,1464678000"; d="scan'208";a="1046343906" Received: from fmsmsx105.amr.corp.intel.com ([10.18.124.203]) by fmsmga002.fm.intel.com with ESMTP; 24 Aug 2016 00:26:10 -0700 Received: from shsmsx151.ccr.corp.intel.com (10.239.6.50) by FMSMSX105.amr.corp.intel.com (10.18.124.203) with Microsoft SMTP Server (TLS) id 14.3.248.2; Wed, 24 Aug 2016 00:26:09 -0700 Received: from shsmsx102.ccr.corp.intel.com ([169.254.2.147]) by SHSMSX151.ccr.corp.intel.com ([169.254.3.194]) with mapi id 14.03.0301.000; Wed, 24 Aug 2016 15:26:07 +0800 From: "Xu, Qian Q" <qian.q.xu@intel.com> To: Yuanhan Liu <yuanhan.liu@linux.intel.com>, "dev@dpdk.org" <dev@dpdk.org> CC: Maxime Coquelin <maxime.coquelin@redhat.com> Thread-Topic: [dpdk-dev] [PATCH 1/6] vhost: simplify memory regions handling Thread-Index: AQHR/RSbAYYg7bj93EeBFvKYzjWSwaBXtuFw Date: Wed, 24 Aug 2016 07:26:07 +0000 Message-ID: <82F45D86ADE5454A95A89742C8D1410E03322F77@shsmsx102.ccr.corp.intel.com> References: <1471939839-29778-1-git-send-email-yuanhan.liu@linux.intel.com> <1471939839-29778-2-git-send-email-yuanhan.liu@linux.intel.com> In-Reply-To: <1471939839-29778-2-git-send-email-yuanhan.liu@linux.intel.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.239.127.40] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Subject: Re: [dpdk-dev] [PATCH 1/6] vhost: simplify memory regions handling X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: patches and discussions about DPDK <dev.dpdk.org> List-Unsubscribe: <http://dpdk.org/ml/options/dev>, <mailto:dev-request@dpdk.org?subject=unsubscribe> List-Archive: <http://dpdk.org/ml/archives/dev/> List-Post: <mailto:dev@dpdk.org> List-Help: <mailto:dev-request@dpdk.org?subject=help> List-Subscribe: <http://dpdk.org/ml/listinfo/dev>, <mailto:dev-request@dpdk.org?subject=subscribe> X-List-Received-Date: Wed, 24 Aug 2016 07:26:12 -0000 I want to apply the patch on the latest DPDK, see below commit ID but faile= d since no vhost.h and vhost-user.h files. So do you have any dependency on= other patches?=20 commit 28d8abaf250c3fb4dcb6416518f4c54b4ae67205 Author: Deirdre O'Connor <deirdre.o.connor@intel.com> Date: Mon Aug 22 17:20:08 2016 +0100 doc: fix patchwork link Fixes: 58abf6e77c6b ("doc: add contributors guide") Reported-by: Jon Loeliger <jdl@netgate.com> Signed-off-by: Deirdre O'Connor <deirdre.o.connor@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com> -----Original Message----- From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Yuanhan Liu Sent: Tuesday, August 23, 2016 4:11 PM To: dev@dpdk.org Cc: Maxime Coquelin <maxime.coquelin@redhat.com>; Yuanhan Liu <yuanhan.liu@= linux.intel.com> Subject: [dpdk-dev] [PATCH 1/6] vhost: simplify memory regions handling Due to history reason (that vhost-cuse comes before vhost-user), some field= s for maintaining the vhost-user memory mappings (such as mmapped address a= nd size, with those we then can unmap on destroy) are kept in "orig_region_= map" struct, a structure that is defined only in vhost-user source file. The right way to go is to remove the structure and move all those fields in= to virtio_memory_region struct. But we simply can't do that before, because= it breaks the ABI. Now, thanks to the ABI refactoring, it's never been a blocking issue any mo= re. And here it goes: this patch removes orig_region_map and redefines virt= io_memory_region, to include all necessary info. With that, we can simplify the guest/host address convert a bit. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> --- lib/librte_vhost/vhost.h | 49 ++++++------ lib/librte_vhost/vhost_user.c | 172 +++++++++++++++++---------------------= ---- 2 files changed, 90 insertions(+), 131 deletions(-) diff --git a/lib/librte_vhost/vhost.h b/lib/librte_vhost/vhost.h index c2df= c3c..df2107b 100644 --- a/lib/librte_vhost/vhost.h +++ b/lib/librte_vhost/vhost.h @@ -143,12 +143,14 @@ struct virtio_net { * Information relating to memory regions including offsets to * addresses in QEMUs memory file. */ -struct virtio_memory_regions { - uint64_t guest_phys_address; - uint64_t guest_phys_address_end; - uint64_t memory_size; - uint64_t userspace_address; - uint64_t address_offset; +struct virtio_memory_region { + uint64_t guest_phys_addr; + uint64_t guest_user_addr; + uint64_t host_user_addr; + uint64_t size; + void *mmap_addr; + uint64_t mmap_size; + int fd; }; =20 =20 @@ -156,12 +158,8 @@ struct virtio_memory_regions { * Memory structure includes region and mapping information. */ struct virtio_memory { - /* Base QEMU userspace address of the memory file. */ - uint64_t base_address; - uint64_t mapped_address; - uint64_t mapped_size; uint32_t nregions; - struct virtio_memory_regions regions[0]; + struct virtio_memory_region regions[0]; }; =20 =20 @@ -200,26 +198,23 @@ extern uint64_t VHOST_FEATURES; #define MAX_VHOST_DEVICE 1024 extern struct virtio_net *vhost_devices[MAX_VHOST_DEVICE]; =20 -/** - * Function to convert guest physical addresses to vhost virtual addresses= . - * This is used to convert guest virtio buffer addresses. - */ +/* Convert guest physical Address to host virtual address */ static inline uint64_t __attribute__((always_inline)) -gpa_to_vva(struct v= irtio_net *dev, uint64_t guest_pa) +gpa_to_vva(struct virtio_net *dev, uint64_t gpa) { - struct virtio_memory_regions *region; - uint32_t regionidx; - uint64_t vhost_va =3D 0; - - for (regionidx =3D 0; regionidx < dev->mem->nregions; regionidx++) { - region =3D &dev->mem->regions[regionidx]; - if ((guest_pa >=3D region->guest_phys_address) && - (guest_pa <=3D region->guest_phys_address_end)) { - vhost_va =3D region->address_offset + guest_pa; - break; + struct virtio_memory_region *reg; + uint32_t i; + + for (i =3D 0; i < dev->mem->nregions; i++) { + reg =3D &dev->mem->regions[i]; + if (gpa >=3D reg->guest_phys_addr && + gpa < reg->guest_phys_addr + reg->size) { + return gpa - reg->guest_phys_addr + + reg->host_user_addr; } } - return vhost_va; + + return 0; } =20 struct virtio_net_device_ops const *notify_ops; diff --git a/lib/librte_vh= ost/vhost_user.c b/lib/librte_vhost/vhost_user.c index eee99e9..d2071fd 100= 644 --- a/lib/librte_vhost/vhost_user.c +++ b/lib/librte_vhost/vhost_user.c @@ -74,18 +74,6 @@ static const char *vhost_message_str[VHOST_USER_MAX] =3D= { [VHOST_USER_SEND_RARP] =3D "VHOST_USER_SEND_RARP", }; =20 -struct orig_region_map { - int fd; - uint64_t mapped_address; - uint64_t mapped_size; - uint64_t blksz; -}; - -#define orig_region(ptr, nregions) \ - ((struct orig_region_map *)RTE_PTR_ADD((ptr), \ - sizeof(struct virtio_memory) + \ - sizeof(struct virtio_memory_regions) * (nregions))) - static uint64_t get_blk_size(int fd) { @@ -99,18 +87,17 @@ get_blk_size(int fd) static void free_mem_region(stru= ct virtio_net *dev) { - struct orig_region_map *region; - unsigned int idx; + uint32_t i; + struct virtio_memory_region *reg; =20 if (!dev || !dev->mem) return; =20 - region =3D orig_region(dev->mem, dev->mem->nregions); - for (idx =3D 0; idx < dev->mem->nregions; idx++) { - if (region[idx].mapped_address) { - munmap((void *)(uintptr_t)region[idx].mapped_address, - region[idx].mapped_size); - close(region[idx].fd); + for (i =3D 0; i < dev->mem->nregions; i++) { + reg =3D &dev->mem->regions[i]; + if (reg->host_user_addr) { + munmap(reg->mmap_addr, reg->mmap_size); + close(reg->fd); } } } @@ -120,7 +107,7 @@ vhost_backend_cleanup(struct virtio_net *dev) { if (dev->mem) { free_mem_region(dev); - free(dev->mem); + rte_free(dev->mem); dev->mem =3D NULL; } if (dev->log_addr) { @@ -286,25 +273,23 @@ numa_realloc(struct virtio_net *dev, int index __rte_= unused) * used to convert the ring addresses to our address space. */ static uint64_t -qva_to_vva(struct virtio_net *dev, uint64_t qemu_va) +qva_to_vva(struct virtio_net *dev, uint64_t qva) { - struct virtio_memory_regions *region; - uint64_t vhost_va =3D 0; - uint32_t regionidx =3D 0; + struct virtio_memory_region *reg; + uint32_t i; =20 /* Find the region where the address lives. */ - for (regionidx =3D 0; regionidx < dev->mem->nregions; regionidx++) { - region =3D &dev->mem->regions[regionidx]; - if ((qemu_va >=3D region->userspace_address) && - (qemu_va <=3D region->userspace_address + - region->memory_size)) { - vhost_va =3D qemu_va + region->guest_phys_address + - region->address_offset - - region->userspace_address; - break; + for (i =3D 0; i < dev->mem->nregions; i++) { + reg =3D &dev->mem->regions[i]; + + if (qva >=3D reg->guest_user_addr && + qva < reg->guest_user_addr + reg->size) { + return qva - reg->guest_user_addr + + reg->host_user_addr; } } - return vhost_va; + + return 0; } =20 /* @@ -391,11 +376,13 @@ static int vhost_user_set_mem_table(struct virtio_net *dev, struct VhostUserMsg *pmsg= ) { struct VhostUserMemory memory =3D pmsg->payload.memory; - struct virtio_memory_regions *pregion; - uint64_t mapped_address, mapped_size; - unsigned int idx =3D 0; - struct orig_region_map *pregion_orig; + struct virtio_memory_region *reg; + void *mmap_addr; + uint64_t mmap_size; + uint64_t mmap_offset; uint64_t alignment; + uint32_t i; + int fd; =20 /* Remove from the data plane. */ if (dev->flags & VIRTIO_DEV_RUNNING) { @@ -405,14 +392,12 @@ vhost_user_s= et_mem_table(struct virtio_net *dev, struct VhostUserMsg *pmsg) =20 if (dev->mem) { free_mem_region(dev); - free(dev->mem); + rte_free(dev->mem); dev->mem =3D NULL; } =20 - dev->mem =3D calloc(1, - sizeof(struct virtio_memory) + - sizeof(struct virtio_memory_regions) * memory.nregions + - sizeof(struct orig_region_map) * memory.nregions); + dev->mem =3D rte_zmalloc("vhost-mem-table", sizeof(struct virtio_memory) = + + sizeof(struct virtio_memory_region) * memory.nregions, 0); if (dev->mem =3D=3D NULL) { RTE_LOG(ERR, VHOST_CONFIG, "(%d) failed to allocate memory for dev->mem\n", @@ -421,22 +406,17 @@ = vhost_user_set_mem_table(struct virtio_net *dev, struct VhostUserMsg *pmsg) } dev->mem->nregions =3D memory.nregions; =20 - pregion_orig =3D orig_region(dev->mem, memory.nregions); - for (idx =3D 0; idx < memory.nregions; idx++) { - pregion =3D &dev->mem->regions[idx]; - pregion->guest_phys_address =3D - memory.regions[idx].guest_phys_addr; - pregion->guest_phys_address_end =3D - memory.regions[idx].guest_phys_addr + - memory.regions[idx].memory_size; - pregion->memory_size =3D - memory.regions[idx].memory_size; - pregion->userspace_address =3D - memory.regions[idx].userspace_addr; - - /* This is ugly */ - mapped_size =3D memory.regions[idx].memory_size + - memory.regions[idx].mmap_offset; + for (i =3D 0; i < memory.nregions; i++) { + fd =3D pmsg->fds[i]; + reg =3D &dev->mem->regions[i]; + + reg->guest_phys_addr =3D memory.regions[i].guest_phys_addr; + reg->guest_user_addr =3D memory.regions[i].userspace_addr; + reg->size =3D memory.regions[i].memory_size; + reg->fd =3D fd; + + mmap_offset =3D memory.regions[i].mmap_offset; + mmap_size =3D reg->size + mmap_offset; =20 /* mmap() without flag of MAP_ANONYMOUS, should be called * with length argument aligned with hugepagesz at older @@ -446,67 +426= ,51 @@ vhost_user_set_mem_table(struct virtio_net *dev, struct VhostUserMsg= *pmsg) * to avoid failure, make sure in caller to keep length * aligned. */ - alignment =3D get_blk_size(pmsg->fds[idx]); + alignment =3D get_blk_size(fd); if (alignment =3D=3D (uint64_t)-1) { RTE_LOG(ERR, VHOST_CONFIG, "couldn't get hugepage size through fstat\n"); goto err_mmap; } - mapped_size =3D RTE_ALIGN_CEIL(mapped_size, alignment); + mmap_size =3D RTE_ALIGN_CEIL(mmap_size, alignment); =20 - mapped_address =3D (uint64_t)(uintptr_t)mmap(NULL, - mapped_size, - PROT_READ | PROT_WRITE, MAP_SHARED, - pmsg->fds[idx], - 0); + mmap_addr =3D mmap(NULL, mmap_size, + PROT_READ | PROT_WRITE, MAP_SHARED, fd, 0); =20 - RTE_LOG(INFO, VHOST_CONFIG, - "mapped region %d fd:%d to:%p sz:0x%"PRIx64" " - "off:0x%"PRIx64" align:0x%"PRIx64"\n", - idx, pmsg->fds[idx], (void *)(uintptr_t)mapped_address, - mapped_size, memory.regions[idx].mmap_offset, - alignment); - - if (mapped_address =3D=3D (uint64_t)(uintptr_t)MAP_FAILED) { + if (mmap_addr =3D=3D MAP_FAILED) { RTE_LOG(ERR, VHOST_CONFIG, - "mmap qemu guest failed.\n"); + "mmap region %u failed.\n", i); goto err_mmap; } =20 - pregion_orig[idx].mapped_address =3D mapped_address; - pregion_orig[idx].mapped_size =3D mapped_size; - pregion_orig[idx].blksz =3D alignment; - pregion_orig[idx].fd =3D pmsg->fds[idx]; - - mapped_address +=3D memory.regions[idx].mmap_offset; + reg->mmap_addr =3D mmap_addr; + reg->mmap_size =3D mmap_size; + reg->host_user_addr =3D (uint64_t)(uintptr_t)mmap_addr + mmap_offset; =20 - pregion->address_offset =3D mapped_address - - pregion->guest_phys_address; - - if (memory.regions[idx].guest_phys_addr =3D=3D 0) { - dev->mem->base_address =3D - memory.regions[idx].userspace_addr; - dev->mem->mapped_address =3D - pregion->address_offset; - } - - LOG_DEBUG(VHOST_CONFIG, - "REGION: %u GPA: %p QEMU VA: %p SIZE (%"PRIu64")\n", - idx, - (void *)(uintptr_t)pregion->guest_phys_address, - (void *)(uintptr_t)pregion->userspace_address, - pregion->memory_size); + RTE_LOG(INFO, VHOST_CONFIG, + "guest memory region %u, size: 0x%" PRIx64 "\n" + "\t guest physical addr: 0x%" PRIx64 "\n" + "\t guest virtual addr: 0x%" PRIx64 "\n" + "\t host virtual addr: 0x%" PRIx64 "\n" + "\t mmap addr : 0x%" PRIx64 "\n" + "\t mmap size : 0x%" PRIx64 "\n" + "\t mmap align: 0x%" PRIx64 "\n" + "\t mmap off : 0x%" PRIx64 "\n", + i, reg->size, + reg->guest_phys_addr, + reg->guest_user_addr, + reg->host_user_addr, + (uint64_t)(uintptr_t)mmap_addr, + mmap_size, + alignment, + mmap_offset); } =20 return 0; =20 err_mmap: - while (idx--) { - munmap((void *)(uintptr_t)pregion_orig[idx].mapped_address, - pregion_orig[idx].mapped_size); - close(pregion_orig[idx].fd); - } - free(dev->mem); + free_mem_region(dev); + rte_free(dev->mem); dev->mem =3D NULL; return -1; } -- 1.9.0