Jason Ansel
Jason Ansel
Test failure is `RuntimeError: Found Tesla M60 which is too old to be supported by the triton GPU compiler, which is used as the backend. Triton only supports devices of...
Another example from: ```py def fn(a, b): x = a + b x_view = x.view(dtype=torch.int16) return x_view.mul(2) ``` Generates: ```py import halide as hl from math import inf, nan @hl.generator(name="kernel")...
Yeah, it finishes and runs correctly.
An example schedule for a reduction would be super useful. The best I was able to write was still 3.5x slower than the baseline. For a reduction (sum, prod, max,...
@pytorchbot merge
@pytorchbot merge
@krastogi-in can you fix the lint errors?
@pytorchbot merge
@pytorchbot merge
Heads up this will have some conflicts with https://github.com/pytorch/pytorch/pull/137756