DeepSpeedExamples
DeepSpeedExamples copied to clipboard
If I use a self-improved transformer architecture, can it support?
The customized model is not in your "Supported Models" list. Can it benefit from Deepspeed chat?