* [dpdk-dev] no hugepage with UIO poll-mode driver @ 2015-11-25 5:45 Younghwan Go 2015-11-25 6:19 ` Tan, Jianfeng 0 siblings, 1 reply; 13+ messages in thread From: Younghwan Go @ 2015-11-25 5:45 UTC (permalink / raw) To: dev Hello, I'm trying to create rte mempool with my own memory scheme (shared virtual memory between CPU & GPU) other than using hugepage. I'm using IGB UIO module with Ethernet 10G 2P X520 Adapter NIC. I first tried to disable hugepage by inserting "--no-huge" option, but it gave an error at rte_mempool_create function. Then I found in rte_eal.h file that said "no-huge mode cannot be used with UIO poll-mode drivers like igb/ixgbe". Is there a way to switch DPDK code to not use hugepage with ixgbe driver or is it not possible to do so? Thank you, Younghwan ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 5:45 [dpdk-dev] no hugepage with UIO poll-mode driver Younghwan Go @ 2015-11-25 6:19 ` Tan, Jianfeng 2015-11-25 6:39 ` Younghwan Go 0 siblings, 1 reply; 13+ messages in thread From: Tan, Jianfeng @ 2015-11-25 6:19 UTC (permalink / raw) To: Younghwan Go, dev Hi Younghwan When you specify --no-huge, by default, only 64M memory is used. Can you use -m 1024 to Mmap 1G memory to have a try? Jianfeng > -----Original Message----- > From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Younghwan Go > Sent: Wednesday, November 25, 2015 1:46 PM > To: dev@dpdk.org > Subject: [dpdk-dev] no hugepage with UIO poll-mode driver > > Hello, > > I'm trying to create rte mempool with my own memory scheme (shared > virtual memory between CPU & GPU) other than using hugepage. I'm using > IGB UIO module with Ethernet 10G 2P X520 Adapter NIC. > > I first tried to disable hugepage by inserting "--no-huge" option, but it gave > an error at rte_mempool_create function. Then I found in rte_eal.h file that > said "no-huge mode cannot be used with UIO poll-mode drivers like > igb/ixgbe". > > Is there a way to switch DPDK code to not use hugepage with ixgbe driver or > is it not possible to do so? > > Thank you, > Younghwan ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 6:19 ` Tan, Jianfeng @ 2015-11-25 6:39 ` Younghwan Go 2015-11-25 10:08 ` Bruce Richardson 0 siblings, 1 reply; 13+ messages in thread From: Younghwan Go @ 2015-11-25 6:39 UTC (permalink / raw) To: Tan, Jianfeng, dev Hi Jianfeng, Thanks for the email. rte mempool was successfully created without any error. Now the next problem is that rte_eth_rx_burst() is always returning 0 as if there was no packet to receive... Do you have any suggestion on what might be causing this issue? In the meantime, I will be digging through ixgbe driver code to see what's going on. Thank you, Younghwan 2015-11-25 오후 3:19에 Tan, Jianfeng 이(가) 쓴 글: > Hi Younghwan > When you specify --no-huge, by default, only 64M memory is used. Can you use -m 1024 to > Mmap 1G memory to have a try? > > Jianfeng > >> -----Original Message----- >> From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Younghwan Go >> Sent: Wednesday, November 25, 2015 1:46 PM >> To: dev@dpdk.org >> Subject: [dpdk-dev] no hugepage with UIO poll-mode driver >> >> Hello, >> >> I'm trying to create rte mempool with my own memory scheme (shared >> virtual memory between CPU & GPU) other than using hugepage. I'm using >> IGB UIO module with Ethernet 10G 2P X520 Adapter NIC. >> >> I first tried to disable hugepage by inserting "--no-huge" option, but it gave >> an error at rte_mempool_create function. Then I found in rte_eal.h file that >> said "no-huge mode cannot be used with UIO poll-mode drivers like >> igb/ixgbe". >> >> Is there a way to switch DPDK code to not use hugepage with ixgbe driver or >> is it not possible to do so? >> >> Thank you, >> Younghwan ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 6:39 ` Younghwan Go @ 2015-11-25 10:08 ` Bruce Richardson 2015-11-25 10:23 ` Thomas Monjalon 0 siblings, 1 reply; 13+ messages in thread From: Bruce Richardson @ 2015-11-25 10:08 UTC (permalink / raw) To: Younghwan Go; +Cc: dev On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > Hi Jianfeng, > > Thanks for the email. rte mempool was successfully created without any > error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > as if there was no packet to receive... Do you have any suggestion on what > might be causing this issue? In the meantime, I will be digging through > ixgbe driver code to see what's going on. > > Thank you, > Younghwan > The problem is that with --no-huge we don't have the physical address of the memory to write to the network card. That's what it's marked as for testing only. /Bruce > 2015-11-25 오후 3:19에 Tan, Jianfeng 이(가) 쓴 글: > >Hi Younghwan > >When you specify --no-huge, by default, only 64M memory is used. Can you use -m 1024 to > >Mmap 1G memory to have a try? > > > >Jianfeng > > > >>-----Original Message----- > >>From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Younghwan Go > >>Sent: Wednesday, November 25, 2015 1:46 PM > >>To: dev@dpdk.org > >>Subject: [dpdk-dev] no hugepage with UIO poll-mode driver > >> > >>Hello, > >> > >>I'm trying to create rte mempool with my own memory scheme (shared > >>virtual memory between CPU & GPU) other than using hugepage. I'm using > >>IGB UIO module with Ethernet 10G 2P X520 Adapter NIC. > >> > >>I first tried to disable hugepage by inserting "--no-huge" option, but it gave > >>an error at rte_mempool_create function. Then I found in rte_eal.h file that > >>said "no-huge mode cannot be used with UIO poll-mode drivers like > >>igb/ixgbe". > >> > >>Is there a way to switch DPDK code to not use hugepage with ixgbe driver or > >>is it not possible to do so? > >> > >>Thank you, > >>Younghwan > ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 10:08 ` Bruce Richardson @ 2015-11-25 10:23 ` Thomas Monjalon 2015-11-25 11:00 ` Bruce Richardson 0 siblings, 1 reply; 13+ messages in thread From: Thomas Monjalon @ 2015-11-25 10:23 UTC (permalink / raw) To: Bruce Richardson; +Cc: dev 2015-11-25 10:08, Bruce Richardson: > On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > > Hi Jianfeng, > > > > Thanks for the email. rte mempool was successfully created without any > > error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > > as if there was no packet to receive... Do you have any suggestion on what > > might be causing this issue? In the meantime, I will be digging through > > ixgbe driver code to see what's going on. > > > > Thank you, > > Younghwan > > > > The problem is that with --no-huge we don't have the physical address of the memory > to write to the network card. That's what it's marked as for testing only. Even with rte_mem_virt2phy() + rte_mem_lock_page() ? ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 10:23 ` Thomas Monjalon @ 2015-11-25 11:00 ` Bruce Richardson 2015-11-25 11:03 ` Thomas Monjalon 0 siblings, 1 reply; 13+ messages in thread From: Bruce Richardson @ 2015-11-25 11:00 UTC (permalink / raw) To: Thomas Monjalon; +Cc: dev On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: > 2015-11-25 10:08, Bruce Richardson: > > On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > > > Hi Jianfeng, > > > > > > Thanks for the email. rte mempool was successfully created without any > > > error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > > > as if there was no packet to receive... Do you have any suggestion on what > > > might be causing this issue? In the meantime, I will be digging through > > > ixgbe driver code to see what's going on. > > > > > > Thank you, > > > Younghwan > > > > > > > The problem is that with --no-huge we don't have the physical address of the memory > > to write to the network card. That's what it's marked as for testing only. > > Even with rte_mem_virt2phy() + rte_mem_lock_page() ? > With no-huge, we just set up a single memory segment at startup and set its "physaddr" to be the virtual address. /Bruce /* hugetlbfs can be disabled */ if (internal_config.no_hugetlbfs) { addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); if (addr == MAP_FAILED) { RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, strerror(errno)); return -1; } mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; mcfg->memseg[0].addr = addr; mcfg->memseg[0].hugepage_sz = RTE_PGSIZE_4K; mcfg->memseg[0].len = internal_config.memory; mcfg->memseg[0].socket_id = 0; return 0; } ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 11:00 ` Bruce Richardson @ 2015-11-25 11:03 ` Thomas Monjalon 2015-11-25 12:02 ` Bruce Richardson 0 siblings, 1 reply; 13+ messages in thread From: Thomas Monjalon @ 2015-11-25 11:03 UTC (permalink / raw) To: Bruce Richardson; +Cc: dev 2015-11-25 11:00, Bruce Richardson: > On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: > > 2015-11-25 10:08, Bruce Richardson: > > > On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > > > > Hi Jianfeng, > > > > > > > > Thanks for the email. rte mempool was successfully created without any > > > > error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > > > > as if there was no packet to receive... Do you have any suggestion on what > > > > might be causing this issue? In the meantime, I will be digging through > > > > ixgbe driver code to see what's going on. > > > > > > > > Thank you, > > > > Younghwan > > > > > > > > > > The problem is that with --no-huge we don't have the physical address of the memory > > > to write to the network card. That's what it's marked as for testing only. > > > > Even with rte_mem_virt2phy() + rte_mem_lock_page() ? > > > With no-huge, we just set up a single memory segment at startup and set its > "physaddr" to be the virtual address. > > /* hugetlbfs can be disabled */ > if (internal_config.no_hugetlbfs) { > addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, > MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); > if (addr == MAP_FAILED) { > RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, > strerror(errno)); > return -1; > } > mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; rte_mem_virt2phy() does not use memseg.phys_addr but /proc/self/pagemap: /* * the pfn (page frame number) are bits 0-54 (see * pagemap.txt in linux Documentation) */ physaddr = ((page & 0x7fffffffffffffULL) * page_size) + ((unsigned long)virtaddr % page_size); ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 11:03 ` Thomas Monjalon @ 2015-11-25 12:02 ` Bruce Richardson 2015-11-25 13:22 ` Thomas Monjalon 0 siblings, 1 reply; 13+ messages in thread From: Bruce Richardson @ 2015-11-25 12:02 UTC (permalink / raw) To: Thomas Monjalon; +Cc: dev On Wed, Nov 25, 2015 at 12:03:05PM +0100, Thomas Monjalon wrote: > 2015-11-25 11:00, Bruce Richardson: > > On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: > > > 2015-11-25 10:08, Bruce Richardson: > > > > On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > > > > > Hi Jianfeng, > > > > > > > > > > Thanks for the email. rte mempool was successfully created without any > > > > > error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > > > > > as if there was no packet to receive... Do you have any suggestion on what > > > > > might be causing this issue? In the meantime, I will be digging through > > > > > ixgbe driver code to see what's going on. > > > > > > > > > > Thank you, > > > > > Younghwan > > > > > > > > > > > > > The problem is that with --no-huge we don't have the physical address of the memory > > > > to write to the network card. That's what it's marked as for testing only. > > > > > > Even with rte_mem_virt2phy() + rte_mem_lock_page() ? > > > > > With no-huge, we just set up a single memory segment at startup and set its > > "physaddr" to be the virtual address. > > > > /* hugetlbfs can be disabled */ > > if (internal_config.no_hugetlbfs) { > > addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, > > MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); > > if (addr == MAP_FAILED) { > > RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, > > strerror(errno)); > > return -1; > > } > > mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; > > rte_mem_virt2phy() does not use memseg.phys_addr but /proc/self/pagemap: > > /* > * the pfn (page frame number) are bits 0-54 (see > * pagemap.txt in linux Documentation) > */ > physaddr = ((page & 0x7fffffffffffffULL) * page_size) > + ((unsigned long)virtaddr % page_size); > Yes, you are right. I was not aware that that function was used as part of the mempool init, but now I see that "rte_mempool_virt2phy()" does indeed call that function if hugepages are disabled, so my bad. /Bruce ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 12:02 ` Bruce Richardson @ 2015-11-25 13:22 ` Thomas Monjalon 2015-11-25 13:44 ` Sergio Gonzalez Monroy 0 siblings, 1 reply; 13+ messages in thread From: Thomas Monjalon @ 2015-11-25 13:22 UTC (permalink / raw) To: Bruce Richardson; +Cc: dev 2015-11-25 12:02, Bruce Richardson: > On Wed, Nov 25, 2015 at 12:03:05PM +0100, Thomas Monjalon wrote: > > 2015-11-25 11:00, Bruce Richardson: > > > On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: > > > > 2015-11-25 10:08, Bruce Richardson: > > > > > On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > > > > > > Hi Jianfeng, > > > > > > > > > > > > Thanks for the email. rte mempool was successfully created without any > > > > > > error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > > > > > > as if there was no packet to receive... Do you have any suggestion on what > > > > > > might be causing this issue? In the meantime, I will be digging through > > > > > > ixgbe driver code to see what's going on. > > > > > > > > > > > > Thank you, > > > > > > Younghwan > > > > > > > > > > > > > > > > The problem is that with --no-huge we don't have the physical address of the memory > > > > > to write to the network card. That's what it's marked as for testing only. > > > > > > > > Even with rte_mem_virt2phy() + rte_mem_lock_page() ? > > > > > > > With no-huge, we just set up a single memory segment at startup and set its > > > "physaddr" to be the virtual address. > > > > > > /* hugetlbfs can be disabled */ > > > if (internal_config.no_hugetlbfs) { > > > addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, > > > MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); > > > if (addr == MAP_FAILED) { > > > RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, > > > strerror(errno)); > > > return -1; > > > } > > > mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; > > > > rte_mem_virt2phy() does not use memseg.phys_addr but /proc/self/pagemap: > > > > /* > > * the pfn (page frame number) are bits 0-54 (see > > * pagemap.txt in linux Documentation) > > */ > > physaddr = ((page & 0x7fffffffffffffULL) * page_size) > > + ((unsigned long)virtaddr % page_size); > > > > Yes, you are right. I was not aware that that function was used as part of the > mempool init, but now I see that "rte_mempool_virt2phy()" does indeed call that > function if hugepages are disabled, so my bad. Do you think we could move --no-huge in the main section (not only for testing)? ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 13:22 ` Thomas Monjalon @ 2015-11-25 13:44 ` Sergio Gonzalez Monroy 2015-11-25 14:12 ` Ananyev, Konstantin 0 siblings, 1 reply; 13+ messages in thread From: Sergio Gonzalez Monroy @ 2015-11-25 13:44 UTC (permalink / raw) To: Thomas Monjalon; +Cc: dev On 25/11/2015 13:22, Thomas Monjalon wrote: > 2015-11-25 12:02, Bruce Richardson: >> On Wed, Nov 25, 2015 at 12:03:05PM +0100, Thomas Monjalon wrote: >>> 2015-11-25 11:00, Bruce Richardson: >>>> On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: >>>>> 2015-11-25 10:08, Bruce Richardson: >>>>>> On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: >>>>>>> Hi Jianfeng, >>>>>>> >>>>>>> Thanks for the email. rte mempool was successfully created without any >>>>>>> error. Now the next problem is that rte_eth_rx_burst() is always returning 0 >>>>>>> as if there was no packet to receive... Do you have any suggestion on what >>>>>>> might be causing this issue? In the meantime, I will be digging through >>>>>>> ixgbe driver code to see what's going on. >>>>>>> >>>>>>> Thank you, >>>>>>> Younghwan >>>>>>> >>>>>> The problem is that with --no-huge we don't have the physical address of the memory >>>>>> to write to the network card. That's what it's marked as for testing only. >>>>> Even with rte_mem_virt2phy() + rte_mem_lock_page() ? >>>>> >>>> With no-huge, we just set up a single memory segment at startup and set its >>>> "physaddr" to be the virtual address. >>>> >>>> /* hugetlbfs can be disabled */ >>>> if (internal_config.no_hugetlbfs) { >>>> addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, >>>> MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); >>>> if (addr == MAP_FAILED) { >>>> RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, >>>> strerror(errno)); >>>> return -1; >>>> } >>>> mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; >>> rte_mem_virt2phy() does not use memseg.phys_addr but /proc/self/pagemap: >>> >>> /* >>> * the pfn (page frame number) are bits 0-54 (see >>> * pagemap.txt in linux Documentation) >>> */ >>> physaddr = ((page & 0x7fffffffffffffULL) * page_size) >>> + ((unsigned long)virtaddr % page_size); >>> >> Yes, you are right. I was not aware that that function was used as part of the >> mempool init, but now I see that "rte_mempool_virt2phy()" does indeed call that >> function if hugepages are disabled, so my bad. > Do you think we could move --no-huge in the main section (not only for testing)? Hi, I think the main issue is going to be the HW descriptors queues. AFAIK drivers now call rte_eth_dma_zone_reserve, which is basically a wrapper around rte_memzone_reserve, to get a chunk of phys memory, and in the case of --no-huge is not going to be really phys contiguous. Ideally we would move and expand the functionality of dma_zone reserve API to the EAL, so we could detect what page size we have and set the boundary for such page size. dma_zone_reserve does something similar to work on Xen target by reserving memzones on 2MB boundary. Sergio ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 13:44 ` Sergio Gonzalez Monroy @ 2015-11-25 14:12 ` Ananyev, Konstantin 2015-11-26 4:47 ` Younghwan Go 0 siblings, 1 reply; 13+ messages in thread From: Ananyev, Konstantin @ 2015-11-25 14:12 UTC (permalink / raw) To: Gonzalez Monroy, Sergio, Thomas Monjalon; +Cc: dev > -----Original Message----- > From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Sergio Gonzalez Monroy > Sent: Wednesday, November 25, 2015 1:44 PM > To: Thomas Monjalon > Cc: dev@dpdk.org > Subject: Re: [dpdk-dev] no hugepage with UIO poll-mode driver > > On 25/11/2015 13:22, Thomas Monjalon wrote: > > 2015-11-25 12:02, Bruce Richardson: > >> On Wed, Nov 25, 2015 at 12:03:05PM +0100, Thomas Monjalon wrote: > >>> 2015-11-25 11:00, Bruce Richardson: > >>>> On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: > >>>>> 2015-11-25 10:08, Bruce Richardson: > >>>>>> On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: > >>>>>>> Hi Jianfeng, > >>>>>>> > >>>>>>> Thanks for the email. rte mempool was successfully created without any > >>>>>>> error. Now the next problem is that rte_eth_rx_burst() is always returning 0 > >>>>>>> as if there was no packet to receive... Do you have any suggestion on what > >>>>>>> might be causing this issue? In the meantime, I will be digging through > >>>>>>> ixgbe driver code to see what's going on. > >>>>>>> > >>>>>>> Thank you, > >>>>>>> Younghwan > >>>>>>> > >>>>>> The problem is that with --no-huge we don't have the physical address of the memory > >>>>>> to write to the network card. That's what it's marked as for testing only. > >>>>> Even with rte_mem_virt2phy() + rte_mem_lock_page() ? > >>>>> > >>>> With no-huge, we just set up a single memory segment at startup and set its > >>>> "physaddr" to be the virtual address. > >>>> > >>>> /* hugetlbfs can be disabled */ > >>>> if (internal_config.no_hugetlbfs) { > >>>> addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, > >>>> MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); > >>>> if (addr == MAP_FAILED) { > >>>> RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, > >>>> strerror(errno)); > >>>> return -1; > >>>> } > >>>> mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; > >>> rte_mem_virt2phy() does not use memseg.phys_addr but /proc/self/pagemap: > >>> > >>> /* > >>> * the pfn (page frame number) are bits 0-54 (see > >>> * pagemap.txt in linux Documentation) > >>> */ > >>> physaddr = ((page & 0x7fffffffffffffULL) * page_size) > >>> + ((unsigned long)virtaddr % page_size); > >>> > >> Yes, you are right. I was not aware that that function was used as part of the > >> mempool init, but now I see that "rte_mempool_virt2phy()" does indeed call that > >> function if hugepages are disabled, so my bad. > > Do you think we could move --no-huge in the main section (not only for testing)? > Hi, > > I think the main issue is going to be the HW descriptors queues. > AFAIK drivers now call rte_eth_dma_zone_reserve, which is basically a > wrapper around > rte_memzone_reserve, to get a chunk of phys memory, and in the case of > --no-huge is > not going to be really phys contiguous. > > Ideally we would move and expand the functionality of dma_zone reserve > API to the EAL, > so we could detect what page size we have and set the boundary for such > page size. > dma_zone_reserve does something similar to work on Xen target by > reserving memzones > on 2MB boundary. With xen we have a special kernel driver that allocates physically continuous chunks of memory for us. So we can guarantee that each such chunk would be at least 2MB long. That's enough to allocate HW rings (max HW ring size for let say ixgbe is ~64KB). Here there is absolutely no guarantee that memory allocated by kernel will be memory continuous. Of course you can search though all pages that you allocated and most likely you'll find a continuous chunk big enough for that. Another problem - mbufs. You need to be sure that each mbuf doesn't cross page boundary (in case next page is not adjacent to current one). So you'll probably need to use rte_mempool_xmem_create() to allocate mbufs from no hugepages. BTW, as I remember with vfio in place you should be able to do IO with no-hugepages options, no? As it relies on vfio ability to setup IOMMU tables for you. Konstantin > > Sergio ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-25 14:12 ` Ananyev, Konstantin @ 2015-11-26 4:47 ` Younghwan Go 2015-11-26 14:37 ` Kyle Larose 0 siblings, 1 reply; 13+ messages in thread From: Younghwan Go @ 2015-11-26 4:47 UTC (permalink / raw) To: dev Hello, Thank you all for helping us understand on issues with no hugepage option. As Konstantin mentioned at the end, I tried using VFIO module instead of IGB UIO module. I enabled all necessary parameters (IOMMU, virtualization, vfio-pci, VFIO permission) and ran my code with no hugepage option. At first, it seemed to receive packets fine, but after a while, it stopped receiving packets. I could temporarily remove this issue by not calling rte_eth_tx_burst(). Also, when I looked at the received packets, they all contained 0s instead of actual data. Was there anything that I missed in running with VFIO? I'm curious if no hugepage with no hugepage option was confirmed to run with VFIO. Thank you, Younghwan 2015-11-25 오후 11:12에 Ananyev, Konstantin 이(가) 쓴 글: > >> -----Original Message----- >> From: dev [mailto:dev-bounces@dpdk.org] On Behalf Of Sergio Gonzalez Monroy >> Sent: Wednesday, November 25, 2015 1:44 PM >> To: Thomas Monjalon >> Cc: dev@dpdk.org >> Subject: Re: [dpdk-dev] no hugepage with UIO poll-mode driver >> >> On 25/11/2015 13:22, Thomas Monjalon wrote: >>> 2015-11-25 12:02, Bruce Richardson: >>>> On Wed, Nov 25, 2015 at 12:03:05PM +0100, Thomas Monjalon wrote: >>>>> 2015-11-25 11:00, Bruce Richardson: >>>>>> On Wed, Nov 25, 2015 at 11:23:57AM +0100, Thomas Monjalon wrote: >>>>>>> 2015-11-25 10:08, Bruce Richardson: >>>>>>>> On Wed, Nov 25, 2015 at 03:39:17PM +0900, Younghwan Go wrote: >>>>>>>>> Hi Jianfeng, >>>>>>>>> >>>>>>>>> Thanks for the email. rte mempool was successfully created without any >>>>>>>>> error. Now the next problem is that rte_eth_rx_burst() is always returning 0 >>>>>>>>> as if there was no packet to receive... Do you have any suggestion on what >>>>>>>>> might be causing this issue? In the meantime, I will be digging through >>>>>>>>> ixgbe driver code to see what's going on. >>>>>>>>> >>>>>>>>> Thank you, >>>>>>>>> Younghwan >>>>>>>>> >>>>>>>> The problem is that with --no-huge we don't have the physical address of the memory >>>>>>>> to write to the network card. That's what it's marked as for testing only. >>>>>>> Even with rte_mem_virt2phy() + rte_mem_lock_page() ? >>>>>>> >>>>>> With no-huge, we just set up a single memory segment at startup and set its >>>>>> "physaddr" to be the virtual address. >>>>>> >>>>>> /* hugetlbfs can be disabled */ >>>>>> if (internal_config.no_hugetlbfs) { >>>>>> addr = mmap(NULL, internal_config.memory, PROT_READ | PROT_WRITE, >>>>>> MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); >>>>>> if (addr == MAP_FAILED) { >>>>>> RTE_LOG(ERR, EAL, "%s: mmap() failed: %s\n", __func__, >>>>>> strerror(errno)); >>>>>> return -1; >>>>>> } >>>>>> mcfg->memseg[0].phys_addr = (phys_addr_t)(uintptr_t)addr; >>>>> rte_mem_virt2phy() does not use memseg.phys_addr but /proc/self/pagemap: >>>>> >>>>> /* >>>>> * the pfn (page frame number) are bits 0-54 (see >>>>> * pagemap.txt in linux Documentation) >>>>> */ >>>>> physaddr = ((page & 0x7fffffffffffffULL) * page_size) >>>>> + ((unsigned long)virtaddr % page_size); >>>>> >>>> Yes, you are right. I was not aware that that function was used as part of the >>>> mempool init, but now I see that "rte_mempool_virt2phy()" does indeed call that >>>> function if hugepages are disabled, so my bad. >>> Do you think we could move --no-huge in the main section (not only for testing)? >> Hi, >> >> I think the main issue is going to be the HW descriptors queues. >> AFAIK drivers now call rte_eth_dma_zone_reserve, which is basically a >> wrapper around >> rte_memzone_reserve, to get a chunk of phys memory, and in the case of >> --no-huge is >> not going to be really phys contiguous. >> >> Ideally we would move and expand the functionality of dma_zone reserve >> API to the EAL, >> so we could detect what page size we have and set the boundary for such >> page size. >> dma_zone_reserve does something similar to work on Xen target by >> reserving memzones >> on 2MB boundary. > With xen we have a special kernel driver that allocates physically continuous > chunks of memory for us. > So we can guarantee that each such chunk would be at least 2MB long. > That's enough to allocate HW rings (max HW ring size for let say ixgbe is ~64KB). > Here there is absolutely no guarantee that memory allocated by kernel will be memory continuous. > Of course you can search though all pages that you allocated and most likely you'll find a continuous > chunk big enough for that. > Another problem - mbufs. > You need to be sure that each mbuf doesn't cross page boundary > (in case next page is not adjacent to current one). > So you'll probably need to use rte_mempool_xmem_create() to allocate mbufs from no hugepages. > BTW, as I remember with vfio in place you should be able to do IO with no-hugepages options, no? > As it relies on vfio ability to setup IOMMU tables for you. > Konstantin > >> Sergio ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [dpdk-dev] no hugepage with UIO poll-mode driver 2015-11-26 4:47 ` Younghwan Go @ 2015-11-26 14:37 ` Kyle Larose 0 siblings, 0 replies; 13+ messages in thread From: Kyle Larose @ 2015-11-26 14:37 UTC (permalink / raw) To: Younghwan Go; +Cc: dev On Wed, Nov 25, 2015 at 11:47 PM, Younghwan Go <yhwan@ndsl.kaist.edu> wrote: > Hello, > > Thank you all for helping us understand on issues with no hugepage option. > > As Konstantin mentioned at the end, I tried using VFIO module instead of IGB > UIO module. I enabled all necessary parameters (IOMMU, virtualization, > vfio-pci, VFIO permission) and ran my code with no hugepage option. > > At first, it seemed to receive packets fine, but after a while, it stopped > receiving packets. I could temporarily remove this issue by not calling > rte_eth_tx_burst(). Also, when I looked at the received packets, they all > contained 0s instead of actual data. In my experience, if there's all zeroes it's because the NIC didn't actually write any packet data. In my case, to prove that, I initialized the mbuf data to a well-known pattern, and saw that after packet reception, the mbuf data was still that pattern, not the data I actually sent in the packet. From my analysis, I concluded that the ixgbe NIC was failing to actually write the data to memory because the corresponding PCI transaction was failing. The PCI transaction failed because something was going wrong with how IOMMU was mapping my VM's guest physical address to the host's physical address, and how that interacted with the PCI tree. I don't know if that's what you're seeing. However, it may be that the physical address being given to the NIC for the mbuf is invalid, or cannot be properly mapped to a real physical address if you're using IOMMU. When this happens, can you check the pcie AER registers on the NIC, if they exist? Maybe they'll show that something happened. Thanks, Kyle > Was there anything that I missed in > running with VFIO? I'm curious if no hugepage with no hugepage option was > confirmed to run with VFIO. > > Thank you, > Younghwan > > 2015-11-25 오후 11:12에 Ananyev, Konstantin 이(가) 쓴 글: > ^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2015-11-26 14:37 UTC | newest] Thread overview: 13+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2015-11-25 5:45 [dpdk-dev] no hugepage with UIO poll-mode driver Younghwan Go 2015-11-25 6:19 ` Tan, Jianfeng 2015-11-25 6:39 ` Younghwan Go 2015-11-25 10:08 ` Bruce Richardson 2015-11-25 10:23 ` Thomas Monjalon 2015-11-25 11:00 ` Bruce Richardson 2015-11-25 11:03 ` Thomas Monjalon 2015-11-25 12:02 ` Bruce Richardson 2015-11-25 13:22 ` Thomas Monjalon 2015-11-25 13:44 ` Sergio Gonzalez Monroy 2015-11-25 14:12 ` Ananyev, Konstantin 2015-11-26 4:47 ` Younghwan Go 2015-11-26 14:37 ` Kyle Larose
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).