Add Support for Efficient Inference

Open elvircrn opened this issue 1 year ago • 0 comments

This PR adds support for the following:

Efficient SPQR CUDA-based matvec kernel implementation for a subset of paramaters
Integration of said kernel for end-to-end inference
Kernel benchmarks
End-to-end inference demo and benchmarks

Sep 11 '24 12:09 elvircrn