taichi
taichi copied to clipboard
Add bfloat16 support
Concisely describe the proposed feature I'm trying to use Taichi to speed up traceformers-like neural network. I encounter float16 type overflow and have to use float32 where it is not necessary. The bfloat16 type can help with this, but it is unsupported in taichi now