FasterTransformer icon indicating copy to clipboard operation
FasterTransformer copied to clipboard

T5 MoE docs need updates

Open jokerwyt opened this issue 1 year ago • 5 comments

There is no description of T5 MoE Support in docs/t5_guide.md; updates are needed; thanks!

Another question: are MoE Support in examples/pytorch/t5/translate_example.py thoroughly tested? I found some suspicious bugs in it.

jokerwyt avatar Apr 17 '23 09:04 jokerwyt

There is no description of T5 MoE Support in docs/t5_guide.md; updates are needed; thanks!

Another question: are MoE Support in examples/pytorch/t5/translate_example.py thoroughly tested? I found some suspicious bugs in it.

We don't have public checkpoint to demo.

byshiue avatar Apr 17 '23 09:04 byshiue

There is no description of T5 MoE Support in docs/t5_guide.md; updates are needed; thanks! Another question: are MoE Support in examples/pytorch/t5/translate_example.py thoroughly tested? I found some suspicious bugs in it.

We don't have public checkpoint to demo.

I see, thank you. But it's very important for us users to follow this work. I'll appreciate it if you can provide some checkpoints and detailed docs about T5 Moe Support.

jokerwyt avatar Apr 17 '23 09:04 jokerwyt

Does the kernels even work? I set random weights to set up an MoE T5, but I continuously get errors regarding internal errors in CUTLASS MoE GEMM kernel. Any thoughts? image

Also, Switch Transformers is pretty much the MoE version of T5. The weights are publicly available everywhere, namely huggingface.

taehyunzzz avatar Jul 13 '23 11:07 taehyunzzz

There is no description of T5 MoE Support in docs/t5_guide.md; updates are needed; thanks! Another question: are MoE Support in examples/pytorch/t5/translate_example.py thoroughly tested? I found some suspicious bugs in it.

We don't have public checkpoint to demo.

I see, thank you. But it's very important for us users to follow this work. I'll appreciate it if you can provide some checkpoints and detailed docs about T5 Moe Support.

Hello: Has this problem been fixed? How can I use FT to support T5-MoE? please~

FlyingPotatoZ avatar Aug 22 '23 07:08 FlyingPotatoZ

+1

WeiMa01 avatar Aug 22 '23 07:08 WeiMa01