optimum
optimum copied to clipboard
Rework ORTQuantizer & ORTOptimizer to easier work with already converted `onnx` checkpoints and add support for seq2seq models (RFC)
ORTQuantizer
- [x] remove hard transformers dependency
- [x] rework
from_pretrainedto includefrom_transformers,ORTModelForXXXor a path to amodel.onnxfile - [x] add
file_nametofrom_pretrained - [x] add better docstrings with example
- [x] align parameters for method, e.g.
export - [x] add more tests for creating
ORTQuantizer