sparseml
sparseml copied to clipboard
[WIP] emualte bias QAT on linear forward pass
in collaboration with @anmarques
goal of this PR is to add a pass to emulate the INT32 quantization of a FC layer's bias add to accurately match what happens during inference
TODO: testing, any necessary backwards pass changes