neural-compressor icon indicating copy to clipboard operation
neural-compressor copied to clipboard

Smoothquant refactor for 3.x API

Open violetch24 opened this issue 1 year ago • 0 comments

Type of Change

Smoothquant refactor for 3.x API API changed

Description

  • [x] refactor new API - prepare/convert
  • [ ] add ut for new API, remove unnecessary old ones
  • [x] fix eager model/prepared model issue for old quantize API
  • [ ] support calib_fun rather than dataloader for auto-tuning
  • [ ] add ut for auto-tuning
  • [ ] modify 3.x sq example

Expected Behavior & Potential Risk

ut pass

How has this PR been tested?

Dependency Change?

violetch24 avatar May 13 '24 10:05 violetch24