RWKV-LM
RWKV-LM copied to clipboard
RWKV on TPU?
Is it possible to train RWKV on TPU? If not, what would be needed?
On Nvidia GPUs you need a CUDA kernel for efficient training. Wonder if we can do similar things for TPUs