chenyu issues

Results 80 issues of


                                            chenyu

Update eval loss in modern.py.

I would like to update the loss numbers in the comment section too, but I could not reproduce your numbers. For repro.py, I am getting 23 eval: split train. loss...

Simplify symbolic.SumNode.floordiv logic

Simplify the logic, also fix an inconsistency that `(a*6+b*6)//16 -> (((a*6)+(b*6))//16)` but `(a*6+b*6+16)//16 -> ((((a*3)+(b*3))//8)+1)`. With the change now `(a*6+b*6)//16 -> (((a*3)+(b*3))//8)` too.

Fix constant folding for Tensor([3])

#1178

Enable JIT test for METAL. Add METAL_NO_FAST_MATH flag.

Metal by default enables fast-math, which may violate IEEE 754 standard (https://developer.apple.com/documentation/metal/mtlcompileoptions/1515914-fastmathenabled). With fast-math, `rand * 2 * pi` (used in randn implementation) will be different from numpy about 30%...

diverse test value in test_dtype DATA based on dtype

added negative int for int case TODO: test this on real GPU. test passed on tinybox

WINO=1 python examples/beautiful_mnist.py has lower test_accuracy

I can hit `RuntimeError: Error Domain=MTLCommandBufferErrorDomain Code=1 "Discarded (victim of GPU error/recovery)` on M1 Max pretty consistently with `WINO=1 PYTHONPATH=. python examples/beautiful_mnist.py`. Seems fine with `JIT=2`. Also with `WINO=1` when...

chenyu

Update eval loss in modern.py.

Simplify symbolic.SumNode.floordiv logic

Fix constant folding for Tensor([3])

Enable JIT test for METAL. Add METAL_NO_FAST_MATH flag.

diverse test value in test_dtype DATA based on dtype

WINO=1 python examples/beautiful_mnist.py has lower test_accuracy

use true half for onnx

testing and reliability improvements

add real beam search to regular ci

beautiful_mnist does not train