Rohan Yadav comments

Results 50 comments of


                                            Rohan Yadav

*: trying out fixing the code generation

cc @RawnH i pushed some more code

How to generate cuda kernel with python bindings?

I don't think that the Python API is currently able to generate CUDA kernels. You will have to use the C++ API or the web tool to do so.

Release of taco and availability of PyPi

@fredrikbk do we have any plans for a new distribution of PyTaco? I don't think we have anyone working on this currently.

qcd.mul1 fails on Apple M1

I'm not sure anyone on the development team right now has an M1 mac to reproduce this issue. However, it looks like a small precision error that seems ignorable?

Compilation error for TTM with CSF,CSC as input formats and CSF output

Can you share the link of the web interface that led to the error (it includes the schedules and formats). Trying it myself, it looks like this particular case works.

Compilation error for TTM with CSF,CSC as input formats and CSF output

Thanks, I see the problem now.

scheduling: inefficient code generated for CPU spmv with pos split

I don't think this code is quite right yet (still working on it), but the idea seems fine -- we want to buffer changes to the same output location `i`...

scheduling: inefficient code generated for CPU spmv with pos split

I don't see an easy way to do this when trying to use multiple threads. It looks like you need some way of having a check after each parallel block...

lower: properly fix #355

I'm not sure what's the best way to test this since I need to compile with `simplify = false` for the invalid output to show up. Figuring out how to...

Uncaught exception when fusing sparse dimensions

Yes, I believe fusing without iterating over the position space is supported for only dense dimensions currently. I'm not sure what the best way of fusing co-iteration loops looks like.