Vidyaranya

Results 2 issues of Vidyaranya

What changes are to be made to adapt the divided space time attention to a join space-time attention model?

This addresses Issue 642. When the stop token is \n\n the generation should stop after generation two new lines. Check the previous token that is generated and if it is...

cla signed