ucx
ucx copied to clipboard
What‘s the best practice for intra-node GPU communication when using Active Message?
There are 8 cards in one node. Should I create the endpoints for the rest of 7 cards in GPUx? Or I need to use different methods when intra-node GPU communication? Or use one UCX context for all 8 cards?