sparseml
sparseml copied to clipboard
[Roadmap] SparseML Roadmap Q1 2024
Short-term Work
Issues or PRs that the NM team are planning to tackle for this quarter:
-
SparseGPT one-shot pruning for Transformers
- [x] Support for text generation LLMs
AutoModelForCausalLMhttps://github.com/neuralmagic/sparseml/pull/2010 - [ ] General model support
AutoModelhttps://github.com/neuralmagic/sparseml/pull/2003
- [x] Support for text generation LLMs
-
Saving compressed models
- [ ] Export sparse models w/ sparsity_config to Transformers
- [ ] Export weight-quantized models w/ quant_config to Transformers
- [ ] Export W8A8-quantized models w/ quant_config to Transformers