deepsparse issues

[BERT examples] torch benchmarking support

tested against: * CPU, GPU, FP32, FP16 * Zoo and local models * base and layer-dropped models

[YOLO examples] torch quant CPU benchmark

supports loading a recipe and model from sparsezoo, applying that recipe to the model, and then possibly converting it to a quantized torch model to run on CPU. `torch.quantization.convert` has...

bfineran

Server-Client example using YOLO

adrianosantospb

AWS Lambda Deployment

Adding a Lambda deployment to the examples directory. This is very similar to the sagemaker deployment. The scope of this application encompasses the automation of the: 1. Construction of a...

InquestGeronimo

support for converting quantized onnx model to blob

3

Hello, I am keen to convert my quantized trained ONNX model into a blob file. OpenVino currently does not support this which is what I've been using so far. Is...

d-smit

enhancement

Adding file monitoring hooks

Note: Not integrated into server yet. Main hook is `start_file_watcher` for server to call into to kick off a watcher process. Everything else is just helpers for that. The file...

corey-nm

mle-team

ONNX Runtime benchmark does not obey `-ncores` when `-s async`

1

**Describe the bug** As in the title, setting `-ncores` with `-s async` uses more cores than set with `-ncores`. For example, with `deepsparse.benchmark oBERT-MobileBERT_14layer_50sparse_block4_qat.onnx -e onnxruntime -ncores 8 -s async`...

fxmarty

bug

Support for Windows 10

1

**Is your feature request related to a problem? Please describe.** Usage under windows 10. **Describe the solution you'd like** Support for Windows 10.

ErfolgreichCharismatisch

enhancement

[deepsparse.license] license validation readme

README for `deepsparse.license` tool proposed in #630. @jeanniefinks and Rob G to complete TODOs

bfineran

mle-team

Adding example with model deployment directory in docker

This shows users how to use the `/deployment` directory of a model inside docker. Test plan: Run example docker build command from readme

corey-nm

mle-team

deepsparse
deepsparse copied to clipboard

Metadata

[BERT examples] torch benchmarking support

[YOLO examples] torch quant CPU benchmark

Server-Client example using YOLO

AWS Lambda Deployment

support for converting quantized onnx model to blob

Adding file monitoring hooks

ONNX Runtime benchmark does not obey `-ncores` when `-s async`

Support for Windows 10

[deepsparse.license] license validation readme

Adding example with model deployment directory in docker

← Metadata

Owner

Metadata

deepsparse deepsparse copied to clipboard

Metadata

← Metadata

Owner

Metadata

deepsparse
deepsparse copied to clipboard