This specific feature is not about NIC activity, it’s about make GPU memory visible and accessible to the CPU so there isn’t an asynchronous hidden invalidation procedure.

That’s the reason for my comment to change the name from rte_gpu_mem_pin() to rte_gpu_mem_to_cpu() (or similar).

 

The scenario you are proposing can happen but in a different context, this is not the case.

 

Thanks

EA

 

From: John Alexander <John.Alexander@datapath.co.uk>
Date: Tuesday, 4 January 2022 at 18:31
To: Elena Agostini <eagostini@nvidia.com>, NBU-Contact-Thomas Monjalon (EXTERNAL) <thomas@monjalon.net>
Cc: dev@dpdk.org <dev@dpdk.org>
Subject: RE: [PATCH v2] gpudev: pin GPU memory

External email: Use caution opening links or attachments

 

What happens when the Nvidia GPU driver kernel callback occurs to invalidate the pinned GPU memory region?  Doesn’t the NIC need to cease all DMA transfers to/from that region before the kernel callback can complete?

 

John Alexander

Senior Software Engineer

Bemrose House, Bemrose Park, Wayzgoose Drive

Derby

DE21 6XQ

+44 (0)1332 294 441

 | 

www.datapath.co.uk

LinkedIn

Twitter

YouTube

Vote for Datapath

Datapath Ltd.  Registered Number: 1609392.  Registered in England at Bemrose House, Bemrose Park, Wayzgoose Drive, Derby. DE21 6XQ.

From: Elena Agostini <eagostini@nvidia.com>
Sent: 04 January 2022 13:55
To: NBU-Contact-Thomas Monjalon (EXTERNAL) <thomas@monjalon.net>
Cc: dev@dpdk.org
Subject: Re: [PATCH v2] gpudev: pin GPU memory

 

CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and know the content is safe.

> 04/01/2022 03:41, eagostini@nvidia.com:

> > From: Elena Agostini <eagostini@nvidia.com>

> >

> > Enable the possibility to make a GPU memory area accessible from

> > the CPU.

> >

> > GPU memory has to be allocated via rte_gpu_mem_alloc().

> >

> > This patch allows the gpudev library to pin, through the GPU driver,

> > a chunk of GPU memory and to return a memory pointer usable

> > by the CPU to access the GPU memory area.

> >

> > Signed-off-by: Elena Agostini <eagostini@nvidia.com>

> [...]

> > +/**

> > + * @warning

> > + * @b EXPERIMENTAL: this API may change without prior notice.

> > + *

> > + * Pin a chunk of GPU memory to make it accessible from the CPU

>

> You should define what means "pin" exactly.

> Which properties should we expect?

>

 

Thanks for reviewing, this is the kind of discussion I wanted to have.

Maybe "pin" is too GDRCopy specific oriented.

Here I want to make a GPU memory buffer visible from the CPU. In case

of NVIDIA, this means the GPU memory address has to be pinned (virtual address

doesn't change) and dma-mapped.

 

Maybe the name should be more like rte_gpu_mem_to_cpu() that's more

explicative and generic.

 

 

> > + * using the memory pointer returned by the function.

>

> Which function should return the pointer?

> rte_gpu_mem_pin is returning an int.

 

Oversight, will fix it.

 

>

>

> > + * GPU memory has to be allocated via rte_gpu_mem_alloc().

>

> Why pinning is not done by rte_gpu_mem_alloc()?

> Should it be a flag?

 

rte_gpu_mem_alloc() allocate virtual memory on the GPU that doesn't have

to be necessarily shared (pinned) to make it visible from CPU.

 

>

> > + *

> > + * @param dev_id

> > + *   Device ID requiring pinned memory.

> > + * @param size

> > + *   Number of bytes to pin.

> > + *   Requesting 0 will do nothing.

> > + * @param ptr

> > + *   Pointer to the GPU memory area to be pinned.

> > + *   NULL is a no-op accepted value.

> > +

> > + * @return

> > + *   A pointer to the pinned GPU memory usable by the CPU, otherwise NULL and rte_errno is set:

> > + *   - ENODEV if invalid dev_id

> > + *   - EINVAL if reserved flags

>

> Which reserved flags?

>

> > + *   - ENOTSUP if operation not supported by the driver

> > + *   - E2BIG if size is higher than limit

> > + *   - ENOMEM if out of space

>

> Is out of space relevant for pinning?

 

Yes, let me add it

 

>

> > + *   - EPERM if driver error

> > + */

> > +__rte_experimental

> > +int rte_gpu_mem_pin(int16_t dev_id, size_t size, void *ptr);