bhsueh_NV
bhsueh_NV
Can you try on latest main branch? If you still encounter same problem, please provide the reproduce steps to reproduce this issue, thanks.
We may have fixed some bugs affecting the stability. The latest branch also provide some similar features in the gpt model, they may be helpful. I am not sure the...
We only verified on the docker image we mention in the document.
For `pip install tensorflow`, you should use TF docker directly. For warning in pytorch docker, it is fine.
Close this bug because it is inactivated. Feel free to re-open this issue if you still have any problem.
We don't see the key "lm_head.weight" in the t5 model we test. The models we test are standard T5 like https://huggingface.co/t5-small. I guess the name of `lm_head` in the checkpoint...
Hi, Liudeep. The `lm_head.weight` converting is added in converter of latest release. https://github.com/NVIDIA/FasterTransformer/blob/main/examples/pytorch/t5/utils/huggingface_t5_ckpt_convert.py
Close this bug because it is inactivated. Feel free to re-open this issue if you still have any problem.
We will solve the issue 2 and 3 first. And for issue 1, we will take some time to check. Do you encounter any problem due to lacking -lmpi_cxx?
> > We will solve the issue 2 and 3 first. > > And for issue 1, we will take some time to check. > > Do you encounter any...