bhsueh_NV comments

Results 639 comments of


                                            bhsueh_NV

make TFusedMHAKernelFactory thread_local

This PR has bug when we run multi-GPU BERT with multi-thread, so we fix this issue in latest release directly. Thank you for the feedback and PR.

possible bug in len penalty -- assumption that all logits have the same sign?

Hi, kgimpel. Thanks for your fallback. This is really a bug. We will fix it in next release.

possible bug in len penalty -- assumption that all logits have the same sign?

Hi, kgimpel. This bug is fixed in latest main branch.

possible bug in len penalty -- assumption that all logits have the same sign?

Close this bug because it is inactivated. Feel free to re-open this issue if you still have any problem.

question about GPT-3's computational complexity

I don't know what input/output you use, what computing cost you expect, and what you want to ask.

Add TensorFlow Ops for T5

Thanks for your feedback, we will consider it.

Add TensorFlow Ops for T5

> @byshiue will the FT op be in the roadmap for the next release? TF op turns out to be faster than th op from the decoder(decoding) benchmark and is...

Possible Bug in Context Likelihood

Thank you for the comment and discussion. As you say, this is a bug, and we have fixed it in latest release.

Possible Bug in Context Likelihood

Close this bug because it is inactivated. Feel free to re-open this issue if you still have any problem.

Converting t5-3b plus size model to tensorrt

I see you set data_type to fp32, which requires 12 GB to store the model. In such case, bs 32 + beam width 4 + sequence length 128 may be...