ao
ao copied to clipboard
Create report for Autoquant
Summary
Autoquant will iterate through a user module and identify all linear dtype + shapes as well as execution time for different quantization routines. This information is baked into the final model output but it is not easily viewable.
We want to add an api for exposing this information.
We should add the option to generate human readable reports for a given autoquant run on a model. This will be useful for identifying strange autoquant behavior - why was quantX chosen over quantY.