feat: Adding UCX support for cacheTransceiver
Support for KvCache transfer over UCXX backend instead of MPI. To enable the UCX backend the following environment variable need to be set: TRTLLM_USE_UCX_KVCACHE=1
Also keeping @pcastonguay @schetlur-nv for vis about this UCX backend support MR for dis-agg serving.
Thanks June
/bot run --add-multi-gpu-test
PR_Github #436 [ run ] triggered by Bot
PR_Github #436 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #374 completed with status: 'FAILURE'
/bot run --add-multi-gpu-test
PR_Github #489 [ run ] triggered by Bot
PR_Github #489 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #421 completed with status: 'FAILURE'
/bot run --add-multi-gpu-test
PR_Github #509 [ run ] triggered by Bot
PR_Github #509 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #436 completed with status: 'FAILURE'
Since all the commits in this PR are already included in #3101 and have been merged, this PR will be closed. Thank you, @RoeyAzran1992 !