inference icon indicating copy to clipboard operation
inference copied to clipboard

Reference implementations of MLPerf™ inference benchmarks

Results 200 inference issues
Sort by recently updated
recently updated
newest added

According to the requirement [Matrix](https://onnxruntime.ai/), for gpu enable should install onnxruntime-gpu

root@0824378d443e:/inference/vision/classification_and_detection# ./run_local.sh pytorch ssd-mobilenet cpu usage: main.py [-h] [--dataset {imagenet,imagenet_mobilenet,imagenet_pytorch,coco-300,coco-300-pt,coco-1200,coco-1200-onnx,coco-1200-pt,coco-1200-tf}] --dataset-path DATASET_PATH [--dataset-list DATASET_LIST] [--data-format {NCHW,NHWC}] [--profile {defaults,resnet50-tf,resnet50-pytorch,resnet50-onnxruntime,mobilenet-tf,mobilenet-onnxruntime,ssd-mobilenet-tf,ssd-mobilenet-pytorch,ssd-mobilenet-onnxruntime,ssd-resnet34-tf,ssd-resnet34-pytorch,ssd-resnet34-onnxruntime,ssd-resnet34-onnxruntime-tf}] [--scenario SCENARIO] [--max-batchsize MAX_BATCHSIZE] --model MODEL [--output OUTPUT] [--inputs INPUTS] [--outputs OUTPUTS]...

Though the reasoning for TEST04 is very good the methodology of testing looks quite bad. As I understand the maximum input sample one can use is the dataset size which...

inference v2.1 and backlog

From Early Stopping proposal: > Practical Impact > The proposed change affects three types of systems. First, systems with overlatency percentiles less than the target tail latency will be able...

Early Stopping implemented in LoadGen seems to have a bug: overlatency count collapses to 1 as the total query count increases. It is likely that a very fast accelerator can...

The trained DLRM models require the corresponding feature dictionaries if they are used to measure accuracy. These dictionaries are small, but take a long time and much memory to prepare...

# language models -> bert -> accuracy-squad.py ``` import numpy as np import six import tokenization from transformers import BertTokenizer from create_squad_data import read_squad_examples, convert_examples_to_features ``` I follow the steps...

Hi, we can see that there are ONNX (`.onnx`) and TensorFlow (`.pb`) versions of the Resnet50-v1.5 model but a version for PyTorch (`.pt`) seems to be missing in the supported...

inference v1.0