sparseml issues

FLOPs (floating-point operations) module for ConvNets

This code updates an existing but unused model analyzer (AnalyzerModule) object that computes forward FLOPs, parameters, prunable parameters, and zeroed parameters model-wide. Note that this is somewhat redundant with the...

ohaijen

Evaluation script for summarization task w/ Rouge score

Code that I used to evaluate Llama-2-7b model on CNN/DailyMail dataset with the Rouge score.

natuan

[Roadmap] SparseML Roadmap Q1 2024

## Short-term Work Issues or PRs that the NM team are planning to tackle for this quarter: * SparseGPT one-shot pruning for Transformers * [x] Support for text generation LLMs...

mgoin

Adding HistogramObserver

The PR adds support for utilizing HistogramObserver from PyTorch which computes the min/max values for quantization by minimizing quantization error. The implementation has been tested on CodeLlama and Llama-2 models.

abhinavnmagic

[WIP] Rouge Score implementation for SparseMLEvalRegistry

rahul-tuli

Add AutoModel support for SparseAutoModel

Add support for loading Transformers models without specifying task attributes. This is especially useful for exporting models for embedding extraction. This current accessed by `"model"` or `"base"` task - I'm...

mgoin

Bump scikit-learn from 0.24.2 to 1.0.1 in /research/information_retrieval/doc2query

Bumps [scikit-learn](https://github.com/scikit-learn/scikit-learn) from 0.24.2 to 1.0.1. Release notes Sourced from scikit-learn's releases. scikit-learn 1.0.1 We're happy to announce the 1.0.1 release with several bugfixes: You can see the changelog here:...

dependabot[bot]

dependencies

RecursionError when converting LlaMa model to ONNX

5

**Describe the bug** `RecursionError: maximum recursion depth exceeded while getting the str of an object` **Expected behavior** I want to the convert a LlaMa model into ONNX and then benchmark...

luuyin

bug

TopKast additional tests + bugfix

Additional tests to ensure Top-KAST is working as intended. Bugfix: when computing weight decay for the backwards-only weights (set B in the paper), the multiplier should be proportional to 1/(the...

ohaijen

Remove copies of the model for `sparseml.transformers.export_onnx`

Not ready for prime time, but it does work in making LLM export much more memory efficient and faster

mgoin

sparseml
sparseml copied to clipboard

Metadata

FLOPs (floating-point operations) module for ConvNets

Evaluation script for summarization task w/ Rouge score

[Roadmap] SparseML Roadmap Q1 2024

Adding HistogramObserver

[WIP] Rouge Score implementation for SparseMLEvalRegistry

Add AutoModel support for SparseAutoModel

Bump scikit-learn from 0.24.2 to 1.0.1 in /research/information_retrieval/doc2query

RecursionError when converting LlaMa model to ONNX

TopKast additional tests + bugfix

Remove copies of the model for `sparseml.transformers.export_onnx`

← Metadata

Owner

Metadata

sparseml sparseml copied to clipboard

Metadata

← Metadata

Owner

Metadata

sparseml
sparseml copied to clipboard