simit icon indicating copy to clipboard operation
simit copied to clipboard

Inefficient Matrix Multiplication on GPU

Open fredrikbk opened this issue 8 years ago • 0 comments

The matrix multiplication code we emit is not tailored to GPU execution (no parallelism).

This will be fixed with the sparse tensor compilation theory, so we should consider leaving it to then. If someone really needs it we can add a custom solution.

fredrikbk avatar Aug 25 '16 19:08 fredrikbk