TensorRT
TensorRT copied to clipboard
Adding rank based logging for torch distributed examples
This PR
- Adds rank based logging for the distributed examples
- Corrects the fallback to pytorch case for NCCL converters
- This with #3830 provides utilities for running distributed tensor parallel examples using torch.distributed