Alex Li
Results
2
issues of
Alex Li
Thanks for the great work, especially the scaling curves! I'm particularly interested in how you trained the Transformer++ models. Is there any public codebase that you used to train them?...
Great work! Is it possible to released pretrained checkpoints of the other model sizes (e.g., S, B, L)? I'm aware that their generation performance is not that great. However, they'd...