Benjamin Fineran issues

Results 11 issues of


                                            Benjamin Fineran

[BERT examples] torch benchmarking support

tested against: * CPU, GPU, FP32, FP16 * Zoo and local models * base and layer-dropped models

[YOLO examples] torch quant CPU benchmark

supports loading a recipe and model from sparsezoo, applying that recipe to the model, and then possibly converting it to a quantized torch model to run on CPU. `torch.quantization.convert` has...

don't save epoch on IC one_shot checkpoints

default saved epoch for `one_shot` in the IC flows is `-1` due to `Trainer` initialization. This will cause issues on model load since the checkpoint recipe will be initialized to...

finalize stage modifiers after stage end epoch in staged torch runs

currently in runs of composed staged recipes, modifier finalization only occurs at the end of the entire run (after all stages). this may cause issues because when a stage is...

[WIP] emualte bias QAT on linear forward pass

in collaboration with @anmarques goal of this PR is to add a pass to emulate the INT32 quantization of a FC layer's bias add to accurately match what happens during...

[deepsparse.license] license validation readme

README for `deepsparse.license` tool proposed in #630. @jeanniefinks and Rob G to complete TODOs

mle-team

[QuantizationModifier] pydantic classes for defining quantization schemes to generate QConfigs

a core feature of the QuantizationModifier refactor is the ability for users to have both more simple and more fine grained control over how quantization is applied at large and...

mle-team

[QuantizationModifier] refactor base - move deprecated code to legacy file, add object routing for yaml load

first PR for QuantizationModifier refactor. moves the existing modifier and tests into a "legacy" file. Creates a template object for the new modifier. To maintain backwards compatibility, we add support...

mle-team

[analytics] disable in make test and GHA

fix group size support for sgpt_wrapper

scales and zero points were not accounting for correct groups when iterating over the input channel dimension