leejason
Results
13
issues of
leejason
trafficstars
> No, gpt-neo models are not compatible with this codebase Thank you very much for the update. How can I make it work if I modify the "6B_roto_256.json" with the...
Is it possible to train RWKV on TPU? If not, what would be needed?
Any suggestion for benchmarking CTRL with GPT-2? Say, loss value, PPL, or any metric to measure text generation quality?