Jason Ansel

Results 199 comments of Jason Ansel

Test failure is `RuntimeError: Found Tesla M60 which is too old to be supported by the triton GPU compiler, which is used as the backend. Triton only supports devices of...

Another example from: ```py def fn(a, b): x = a + b x_view = x.view(dtype=torch.int16) return x_view.mul(2) ``` Generates: ```py import halide as hl from math import inf, nan @hl.generator(name="kernel")...

Yeah, it finishes and runs correctly.

An example schedule for a reduction would be super useful. The best I was able to write was still 3.5x slower than the baseline. For a reduction (sum, prod, max,...

@krastogi-in can you fix the lint errors?

Heads up this will have some conflicts with https://github.com/pytorch/pytorch/pull/137756