reddyn12 comments

Results 32 comments of


                                            reddyn12

copyin() broken in ops_cuda

CUDA=1 python3 examples/mamba.py With async - out of memory error within the load dict call. Also, weird your getting False for output matched on CLANG. The contiguous fixed that issue....

copyin() broken in ops_cuda

I'm dumb. Didn't see the prompt arg, ignore the output match part. If async works on tiny15, could it be a method that works for multi gpu setups? Cuz I...

copyin() broken in ops_cuda

I have 3080. I'll be home tmr and can check again then. I'm p sure I had cuda 12.4 when I nvidia-smi.

https://github.com/reddyn12/tinygrad/tree/mamba_new is the fresh branch @nimlgen. Still getting the same error: ``` reddyn@Nikhil-3080:/mnt/d/Code/tinygrad$ CUDA=1 python3 examples/mamba.py Special tokens have been added in the vocabulary, make sure the associated word embeddings...

copyin() broken in ops_cuda

same error with ``` CUDA=1 python3 examples/gpt2.py ```

copyin() broken in ops_cuda

mamba and gpt2 work when I use school's compute cluster. Have a strong feeling its single gpu system related

Error with Test Script

Btw, I had to comment out the gym import cuase there was a problem with that too.

Upat bounty

This branch timed: ```PYTHONPATH=. python3 test/test_pattern_matcher.py 0.84s user 0.31s system 251% cpu 0.459 total``` https://github.com/reddyn12/tinygrad/tree/upat_speed timed: ```PYTHONPATH=. python3 test/test_pattern_matcher.py 0.80s user 0.27s system 187% cpu 0.569 total``` Going to write...

Upat bounty

When #4744 gets merged, I'll convert tests to match UPat spec

Upat bounty

I realized testing pre UPat and post would be inconvenient if I'm writing the tests in UPat form. That's why I added dict form tests in #4744. Was the closing...