simit Inefficient Matrix Multiplication on GPU

Inefficient Matrix Multiplication on GPU

Open fredrikbk opened this issue 8 years ago • 0 comments

The matrix multiplication code we emit is not tailored to GPU execution (no parallelism).

This will be fixed with the sparse tensor compilation theory, so we should consider leaving it to then. If someone really needs it we can add a custom solution.

Aug 25 '16 19:08 fredrikbk

simit simit copied to clipboard

Inefficient Matrix Multiplication on GPU

simit
simit copied to clipboard