leejason

Results 13 issues of leejason
trafficstars

> No, gpt-neo models are not compatible with this codebase Thank you very much for the update. How can I make it work if I modify the "6B_roto_256.json" with the...

Is it possible to train RWKV on TPU? If not, what would be needed?

Any suggestion for benchmarking CTRL with GPT-2? Say, loss value, PPL, or any metric to measure text generation quality?