sparseml [WIP] emualte bias QAT on linear forward pass

[WIP] emualte bias QAT on linear forward pass

Open bfineran opened this issue 3 years ago • 0 comments

in collaboration with @anmarques

goal of this PR is to add a pass to emulate the INT32 quantization of a FC layer's bias add to accurately match what happens during inference

TODO: testing, any necessary backwards pass changes

May 06 '22 17:05 bfineran