DeepSpeed-MII
DeepSpeed-MII copied to clipboard
Request to support additional model architectures
Please add support for Mosaic MPT models and some other architectures with less than 1b parameters.
Also, it would be great if there can be some instructions how someone can contribute to this project for adding support of a new model architecture or custom model.