gauravjain14
gauravjain14
I tried the following `python3 example_1.py --cuda_idx=0` This run successfully and I could see that it was using the GPUs. When I tried the following `nvprof --print-gpu-trace python3 example_1.py --cuda_idx=0`...
I am looking for some pointers to get started with leveraging Triton to generate kernels for a custom hardware backend. I see there have been efforts made that support lowering...
I checked the codah dataset and I don't see the `res` key in there. Also sharing a snapshot of the dataset. ``` KeyError: 'res' 'res' ``` Source Code Path -...