deepsparse issues

Better legacy failure warning

3

In case someone fails to create the V2 pipeline, I think it's helpful to print the error.

[Move] Common Evaluation Modules to `Sparsezoo``

1

# Evaluator Move This PR moves a few modules from `deepsparse.evaluation` to `sparsezoo.evaluation` ## Motivation and Context The moved modules provide a common interface for evaluating models. This interface can...

rahul-tuli

mle-team

Debug pipeline

This is currently a hack but it would be great to get a version of this into production so that we can use debug_analysis on the pipeline and pass real...

ProExpertProg

yolo-v8 in onnx-runtime outperforms deepsparse on iMX8

2

**Describe the bug** I downloaded and tested the [yolov8-s-coco-pruned70_quantized](https://sparsezoo.neuralmagic.com/models/yolov8-s-coco-pruned70_quantized?hardware=deepsparse-c6i.12xlarge&comparison=yolov8-s-coco-base&tab=4) model from the sparseZoo. When I simply infere the onnx model with onnx-runtime, I get an average of 1,92 seconds (over...

Fritskee

bug

AWS text classification benchmark

mwitiderrick

Add support for ultrachat200k

Add ultrachat200k for perplexity eval

anmarques

Research: 4-bit quantization

4

Hi. The paper describes 8-bit quantization combined with pruning, which is fantastic. My question: has any research been done for 4-bit quantization? Since GPU memory is notoriously expensive, 4-bit quantization...

truenorth8

enhancement

Assertion at src/lib/core/topology.cpp:627

**Describe the bug** When I try to run the [example ](https://github.com/neuralmagic/deepsparse/blob/main/docs/llms/text-generation-pipeline.md) LLM TextGeneration code I get an assertion error. (Sorry for any formatting errors, if you have tips to make...

Zorgosto

bug

[V2 Pipeline] SImple Asyncio pipeline test

## Description Adds tests for Pipeline.run_async() ## Problem Testing run_async() currently requires some hacking in tests/server. Isolate Pipeline func's test. ## Solution Simple pipeline running run_async ## Usage ```python3 inference_state...

horheynm

log warning for batch_size 1

7

Show warning when overriding batch_size 0 to 1 https://app.asana.com/0/1201735099598270/1206262288703592

horheynm

deepsparse
deepsparse copied to clipboard

Metadata

Better legacy failure warning

[Move] Common Evaluation Modules to `Sparsezoo``

Debug pipeline

yolo-v8 in onnx-runtime outperforms deepsparse on iMX8

AWS text classification benchmark

Add support for ultrachat200k

Research: 4-bit quantization

Assertion at src/lib/core/topology.cpp:627

[V2 Pipeline] SImple Asyncio pipeline test

log warning for batch_size 1

← Metadata

Owner

Metadata

deepsparse deepsparse copied to clipboard

Metadata

← Metadata

Owner

Metadata

deepsparse
deepsparse copied to clipboard