Keren Zhou

Results 18 issues of Keren Zhou

To compute `(a * b) % c`, if `a * b` is greater than `max(int32_t)`, then the final result is incorrect. Reproducer: https://github.com/Jokeren/triton-samples/blob/main/int64-mod.py

enhancement

Hi @ptillet , I started playing triton a few days ago. I have read through your tutorials on vec-add and matrix multiplications. I should say triton is very convenient for...

enhancement

- int8 matmul is skipped because of precision problems - If the kernel is in the warmup mode, we skip interpreter execution - All interpreter-related tests have been moved under...

Should be 2 instead of 1, right?

Just curious about the perf comparison between this work and others that also adopt the `compile` interface to optimize the computation graph. See https://github.com/alibaba/BladeDISC/releases/tag/v0.3.0

Otherwise pip install will ignore cython