sparseml issues

[MOE Quantization] Update transformers version to 4.40.0

1

We need to update the transformers version to support QWEN2-MOE model, see: https://github.com/huggingface/transformers/releases/tag/v4.40.0 _(it also, fits into our goal to be constantly matching the latest release)_ ## Important changes ####...

dbogunowicz

[GHA] Add workflow files to run weekly and nightly tests/run llama-7b models

- Blocked on k8 runners being available. Only aws runners currently work

dsikka

update

dsikka

test

rahul-tuli

Performance Degradation in YOLOv8s Model Exported to ONNX via SparseML's Exporter

10

**Describe the bug** When exporting the YOLOv8s (pruned50-quant, model.pt from sparsezoo) model via the ONNX exporter (sparseml.ultralytics.export_onnx), its performance noticeably decreases compared to the ONNX model available in SparseZoo **Expected...

rsazizov

bug

[GHA] Add steps to publish nightly wheel and build nightly container

# Summary - Add a step to publish the nightly wheel using the nm-action: https://github.com/neuralmagic/nm-actions/blob/main/actions/publish-whl/action.yml - Once built, updated to add in a step to build the nightly container using...

dsikka

[WIP] Update/expand finetune tests

dsikka

GPTQ UX config groups support

1

This PR enhances the user experience of the `GPTQModifier` by allowing it to directly accept quantization-related arguments, such as `config_groups`. This change simplifies the configuration process, enabling users to specify...

rahul-tuli

Split SparseGPT and GPTQ modifiers

This PR introduces a structural change by separating concerns between quantization and sparsification. A new `GPTQModifier` is extracted from the existing `SparseGPTModifier`. This ensures that each class now has a...

rahul-tuli

Bump jinja2 from 3.0.1 to 3.1.4 in /research/information_retrieval/doc2query

Bumps [jinja2](https://github.com/pallets/jinja) from 3.0.1 to 3.1.4. Release notes Sourced from jinja2's releases. 3.1.4 This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise...

dependabot[bot]

dependencies

sparseml
sparseml copied to clipboard

Metadata

[MOE Quantization] Update transformers version to 4.40.0

[GHA] Add workflow files to run weekly and nightly tests/run llama-7b models

update

test

Performance Degradation in YOLOv8s Model Exported to ONNX via SparseML's Exporter

[GHA] Add steps to publish nightly wheel and build nightly container

[WIP] Update/expand finetune tests

GPTQ UX config groups support

Split SparseGPT and GPTQ modifiers

Bump jinja2 from 3.0.1 to 3.1.4 in /research/information_retrieval/doc2query

← Metadata

Owner

Metadata

sparseml sparseml copied to clipboard

Metadata

← Metadata

Owner

Metadata

sparseml
sparseml copied to clipboard