inference issues

Results 200 inference issues

Sort by recently updated

Update install onnxruntime in GettingStarted.ipynb

According to the requirement [Matrix](https://onnxruntime.ai/), for gpu enable should install onnxruntime-gpu

main.py: error: argument --profile: expected one argument

root@0824378d443e:/inference/vision/classification_and_detection# ./run_local.sh pytorch ssd-mobilenet cpu usage: main.py [-h] [--dataset {imagenet,imagenet_mobilenet,imagenet_pytorch,coco-300,coco-300-pt,coco-1200,coco-1200-onnx,coco-1200-pt,coco-1200-tf}] --dataset-path DATASET_PATH [--dataset-list DATASET_LIST] [--data-format {NCHW,NHWC}] [--profile {defaults,resnet50-tf,resnet50-pytorch,resnet50-onnxruntime,mobilenet-tf,mobilenet-onnxruntime,ssd-mobilenet-tf,ssd-mobilenet-pytorch,ssd-mobilenet-onnxruntime,ssd-resnet34-tf,ssd-resnet34-pytorch,ssd-resnet34-onnxruntime,ssd-resnet34-onnxruntime-tf}] [--scenario SCENARIO] [--max-batchsize MAX_BATCHSIZE] --model MODEL [--output OUTPUT] [--inputs INPUTS] [--outputs OUTPUTS]...

chenyuxin1999

TEST04 should have reasonable runtime

Though the reasoning for TEST04 is very good the methodology of testing looks quite bad. As I understand the maximum input sample one can use is the dataset size which...

arjunsuresh

inference v2.1 and backlog

Early Stopping and test pass/fail

From Early Stopping proposal: > Practical Impact > The proposed change affects three types of systems. First, systems with overlatency percentiles less than the target tail latency will be able...

nv-jinhosuh

Early Stopping implementation might have a bug

Early Stopping implemented in LoadGen seems to have a bug: overlatency count collapses to 1 as the total query count increases. It is likely that a very fast accelerator can...

nv-jinhosuh

Trained DLRM models are not useful for measuring accuracy

The trained DLRM models require the corresponding feature dictionaries if they are used to measure accuracy. These dictionaries are small, but take a long time and much memory to prepare...

diyessi

can't find model "tokenization"

# language models -> bert -> accuracy-squad.py ``` import numpy as np import six import tokenization from transformers import BertTokenizer from create_squad_data import read_squad_examples, convert_examples_to_features ``` I follow the steps...

mexiQQ

Resnet50-v1.5 pytorch model

Hi, we can see that there are ONNX (`.onnx`) and TensorFlow (`.pb`) versions of the Resnet50-v1.5 model but a version for PyTorch (`.pt`) seems to be missing in the supported...

corehalt

inference v1.0

inference
inference copied to clipboard

Metadata

Xiyon/add cb for targetqps

Update install onnxruntime in GettingStarted.ipynb

Fix broken link

main.py: error: argument --profile: expected one argument

TEST04 should have reasonable runtime

Early Stopping and test pass/fail

Early Stopping implementation might have a bug

Trained DLRM models are not useful for measuring accuracy

can't find model "tokenization"

Resnet50-v1.5 pytorch model

← Metadata

Owner

Metadata

inference inference copied to clipboard

Metadata

← Metadata

Owner

Metadata

inference
inference copied to clipboard