hirayaku

Results 2 issues of hirayaku

### Description When I test strided loads in pallas kernels with CUDA backend, `pallas.load` seems to ignore `step` in the slice argument. For example, the following code should return [0,...

bug
pallas

I am unable to reproduce the accuracy results in the paper for IGB-large+SAGE model. I got 60~61% validation and test accuracy after 3 epochs, compared to 64.89% in the paper....

question