Is there any plan to support MarianMTModel?
Is there any plan to support MarianMTModel?here,https://huggingface.co/Helsinki-NLP/opus-mt-it-en
Is it a standard Transformer model? If it is, we can write an export script to export from Marian model to LightSeq model, and then infer without any modification.
Is it a standard Transformer model? If it is, we can write an export script to export from Marian model to LightSeq model, and then infer without any modification.
Yes, looking forward to this script.
Same question, I also want to use lightseq to accelerate MarianMTModel.
Is it a standard Transformer model? If it is, we can write an export script to export from Marian model to LightSeq model, and then infer without any modification.
MarianMTModel's architecture is basically same with BartForContionalGeneration. However it use swish activation instead of gelu.
Lightseq doesn't support MarianMTModel. https://github.com/bytedance/lightseq/issues/422 @dengcunqin @qiubinyang