Junyang Lin

Results 173 comments of Junyang Lin

It is not available for us to release the processed data. You can try downloading from the official website.

The related setup of this task is mentioned in the paper.

Never used this before. I think it is good enough, as it has 48GB memory. Maybe for huge models you should use relatively small batch sizes.

Did not try with this task by finetuning seriously. We'll figure our a solution in the near future.

I have provided the code but not the script. I'll update it soon.

Which version of pytorch are you using? This is because the problem of in-place operation in the new version of pytorch, for example, >=1.10

Not yet. Sorry about that. Would you mind telling us in which scenarios you need such models?

For what reason you consider about using topp sampling? For this repo, we do not have relevant experience. Perhaps it is still better to use beam search following our practice...

Yes, this is one significant problem of the current model. One way to tackle this problem is to compute the average probabilities of the output logits, and set a threshold...