ucx
ucx copied to clipboard
UCT/CUDA_IPC: Cache for mempool import operation
What
Follow up to https://github.com/openucx/ucx/pull/9982. This PR caches the operation that imports remotely exported handle for a custom CUDA memory pool as the mapping operation via cuMemPoolImportFromShareableHandle
is expensive.