ocannl
ocannl copied to clipboard
Enable mixed precision (e.g. softmax attention using single-precision, rest using half-precision)
It would probably mean emitting different instructions in Ndarray. Propagating the setting in Formula might be tricky since it needs to be partially top-down.