reddyn12 comments

Results 32 comments of


                                            reddyn12

Upat bounty

> So this is unreviewable by reading. How have you tested that that's 0 functional changes? I tested against https://github.com/reddyn12/tinygrad/blob/upat_speed/test/test_pattern_matcher.py Unable to get it merged to master, but they pass.

[MLPERF] Retinanet

Beam 3 BS 52 OOM Beam 3 BS 32 1.3s but 37, 28, 28, 28 gb used in gpu This is still proportional to BEAM 2 BS 52 step 2s

[MLPERF] Retinanet

School is on a training run rn, no available compute. Can you do a run on a tiny tn for atleast 1 epoch? Expected scores: Epoch 1: 0.23 Epoch 2:...

[MLPERF] Retinanet

@chenyuxyz can you run to get a step-time for me to compare to?

[MLPERF] Retinanet

After the merge, getting NAN. Doing a run on a pre merge branch: https://wandb.ai/schlimeszn/RetinaNet/runs/0xl12h6e?nw=nwuserschlime. It's on epoch 3/4 and the first 2 epochs match Nvidia's submission. This is fp32, batch...

Should this symbolic test fail?

It errors: ``` python3 test/test_tensor_variable.py TestTensorVariable.test_symbolic_mean_2d_add F ====================================================================== FAIL: test_symbolic_mean_2d_add (__main__.TestTensorVariable.test_symbolic_mean_2d_add) ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/nikhil/Code/Fun/tinygrad/test/test_tensor_variable.py", line 68, in test_symbolic_mean_2d_add t = Tensor.ones(2, 2).contiguous().reshape(vv2+add_term, vv+add_term) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File...

reddyn12

Upat bounty

[MLPERF] Retinanet

[MLPERF] Retinanet

[MLPERF] Retinanet

[MLPERF] Retinanet

Should this symbolic test fail?

Add simple meshgrid impl

CUDA Jit Error

CUDA Jit Error

Mamba Implementation