qazal issues

Results 58 issues of


                                            qazal

multioutput kernels

This diff #4030 deletes the "can only have one output buffer" constraint - ```py # can only have one output buffer # can only reduce contiguous # max one reduceop...

bounty locked

proposal: multioutput JIT spec

I think JIT should use Tensor.corealize to let the scheduler fuse outputs, the user can .realize() to opt-out of the fusion. will enable the tests once multioutput is merged.

Fuzz all permutations of schedule

`FUZZ_SCHEDULE=1 DEBUG=2 python3 test/test_multitensor.py TestMultiTensor.test_simple_add_X` need to take more time with: - [x] What is the right abstraction? fuzzer.py could merge with realize.py -> moved to /tests - [x] Seedless`.randn`...

minimal diff for multioutput reduce pairs

Trains beautiful_mnist with ~25% less kernels. this diff: https://tiny-tools-client.vercel.app/?id=2edc5abc69f74cc1ae5eb3b25a9ac292 master: https://tiny-tools-client.vercel.app/?id=0c8240da3a9b4eef8759987ef3df4708

Deterministic schedule order

Tested by fuzzing stdout of `for si in schedule: print(si.outputs[0])` in `test/test_multitensor.py TestMultiTensor.test_simple_add_X `https://gist.github.com/Qazalin/cd7d88ba1b221ed58b46ea4a091f3a89 There can be multiple valid topological sorts of a DAG. In this graph https://tiny-tools-client.vercel.app/?id=785de65a9e5246ae9c6f651a3f96d453 Any ordering...

qazal

multioutput kernels

proposal: multioutput JIT spec

Fuzz all permutations of schedule

minimal diff for multioutput reduce pairs

Deterministic schedule order

process replay benchmarks

UOpGraph fuzzer

Fix unsafe pads fusing with shrink

Fuse double expands

generic double expand fusion