ao icon indicating copy to clipboard operation
ao copied to clipboard

gemlite integration in torchao

Open HDCharles opened this issue 4 months ago • 2 comments

Summary:

  1. integrated gemlite kernels from https://github.com/mobiusml/gemlite
  2. updated kernel wrappers to work with compile (needed to change acc_dtype to torch.dtype equivalent and added basic op/fake_op registration)
  3. added bs>1 support to llama benchmark

on A100 results are not great for the kernel, likely needs further tuning before further integration makes sense.

Screenshot 2024-10-08 at 7 29 03 PM

Test Plan: benchmarks.sh

Reviewers:

Subscribers:

Tasks:

Tags:

HDCharles avatar Oct 08 '24 16:10 HDCharles