DeepSpeed-MII icon indicating copy to clipboard operation
DeepSpeed-MII copied to clipboard

Request to support additional model architectures

Open sumitsahaykoantek opened this issue 8 months ago • 0 comments

Please add support for Mosaic MPT models and some other architectures with less than 1b parameters.

Also, it would be great if there can be some instructions how someone can contribute to this project for adding support of a new model architecture or custom model.

sumitsahaykoantek avatar Nov 14 '23 10:11 sumitsahaykoantek