DeepSpeed
DeepSpeed copied to clipboard
Add FALCON Auto-TP Support
This PR adds the support for running Falcon-40B on multiple A100 GPUs. To run the test for this model you can use this PR on DeepSpeedExample repo.
@RezaYazdaniAminabadi could we merge this?