SpQR icon indicating copy to clipboard operation
SpQR copied to clipboard

Add Support for Efficient Inference

Open elvircrn opened this issue 1 year ago • 0 comments

This PR adds support for the following:

  • Efficient SPQR CUDA-based matvec kernel implementation for a subset of paramaters
  • Integration of said kernel for end-to-end inference
  • Kernel benchmarks
  • End-to-end inference demo and benchmarks

elvircrn avatar Sep 11 '24 12:09 elvircrn