Alex Li

Results 2 issues of Alex Li

Thanks for the great work, especially the scaling curves! I'm particularly interested in how you trained the Transformer++ models. Is there any public codebase that you used to train them?...

Great work! Is it possible to released pretrained checkpoints of the other model sizes (e.g., S, B, L)? I'm aware that their generation performance is not that great. However, they'd...