DeepSpeed
DeepSpeed copied to clipboard
Does deepspeed inference support PTQ?
I search in the doc and find setting for inference for model of QAT. Is there a function for inference of PTQ model?
has the same question ? any example?
Interested in this discussion - Any updates on this ?