modify the quantize.py file for efficiency

Open yaozhewei opened this issue 4 years ago • 1 comments

Update the calculate from torch.split to torch.amin/torch.amax for fast computation
Update stochastic rounding computation logic (faster and cleaner) a. support both sym/asym sr in pytorch level b. reduce the new tensor creator from 2-->1 c. support cpu tensor as well
change fp16 --> fp32 to avoid overflow issue
change some other logic for easy understanding

Nov 04 '21 06:11 yaozhewei

Can one of the admins verify this patch?

Jun 09 '22 20:06 rocm-mici

Stale PR. quantize.py is quite different now. These changes are no longer relevant, therefore closing the PR.

Aug 23 '23 21:08 mrwyattii