transformers icon indicating copy to clipboard operation
transformers copied to clipboard

Adding NLLB-200 - MoE - 54.5B for no language left behind

Open PierreColombo opened this issue 2 years ago • 5 comments

System Info

Hello @LysandreJik, Thanks a lot for your work on no language left behind.

Is there any plan to add the 54.4B Model?

Kindest regards

Who can help?

No response

Information

  • [ ] The official example scripts
  • [ ] My own modified scripts

Tasks

  • [ ] An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • [ ] My own task or dataset (give details below)

Reproduction

Improvement

Expected behavior

Improvement

PierreColombo avatar Jan 25 '23 15:01 PierreColombo

WDYT @ArthurZucker @younesbelkada given your work on MoEs?

LysandreJik avatar Jan 25 '23 19:01 LysandreJik

Sure, we can add this to the to dos, @PierreColombo could you add the link to the open sourced checkpoints?

ArthurZucker avatar Jan 26 '23 13:01 ArthurZucker

Hi Thanks for your positive answer.

Code is here: https://github.com/facebookresearch/fairseq/tree/nllb

Checkpoints are here : https://tinyurl.com/nllb200moe54bmodel

Thanks !

PierreColombo avatar Jan 26 '23 13:01 PierreColombo

Hi all, This would be greatly appreciated! Thanks

nunonmg avatar Jan 26 '23 13:01 nunonmg

also cc @sheonhan re. NNLB

julien-c avatar Jan 26 '23 13:01 julien-c

+1, would love to see it!

gsarti avatar Jan 30 '23 13:01 gsarti

+1 here.

BrightXiaoHan avatar Feb 03 '23 01:02 BrightXiaoHan

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

github-actions[bot] avatar Feb 27 '23 15:02 github-actions[bot]

unstale?

julien-c avatar Mar 06 '23 12:03 julien-c

We went for the fairseq implementation :'(

PierreColombo avatar Mar 06 '23 12:03 PierreColombo

Friendly ping @ArthurZucker

sgugger avatar Mar 06 '23 14:03 sgugger

Yes! @sheonhan mentioned wanting to take this, otherwise will gladly sprint !

ArthurZucker avatar Mar 06 '23 15:03 ArthurZucker

Since I'm working on the Image Completion Transformer at the moment, I might be blocking the folks who want to use it asap, so you should go ahead! @ArthurZucker

sheonhan avatar Mar 06 '23 15:03 sheonhan