DeepSpeedExamples
DeepSpeedExamples copied to clipboard
Mistral and Orca Training
Hello, good work everyone.
Is training these models suitable for this Structure? If not, what should be changed? Or can you add it?