sparseml [Roadmap] SparseML Roadmap Q1 2024

[Roadmap] SparseML Roadmap Q1 2024

Open mgoin opened this issue 1 year ago • 0 comments

Issues or PRs that the NM team are planning to tackle for this quarter:

SparseGPT one-shot pruning for Transformers
- [x] Support for text generation LLMs AutoModelForCausalLM https://github.com/neuralmagic/sparseml/pull/2010
- [ ] General model support AutoModel https://github.com/neuralmagic/sparseml/pull/2003
Saving compressed models
- [ ] Export sparse models w/ sparsity_config to Transformers
- [ ] Export weight-quantized models w/ quant_config to Transformers
- [ ] Export W8A8-quantized models w/ quant_config to Transformers

Feb 15 '24 15:02 mgoin