inference issues

Results 200 inference issues

Sort by recently updated

Add open_orca preprocessing steps

@attafosu @pgmpablo157321 please review and merge this.

nvzhihanj

Count error when logging errors: submission_checker.py

arjunsuresh

Llama2 LoadGen server mode: TPS not reported properly

In v4.0 submission, we found in the **server** log that "result_token_throughput" is not reported properly, and most of them are at the e-09 scale (@pgmpablo157321 feel free to to check...

nvzhihanj

MLPerf-Inference Server: change metrics from scheduled to completed samples per second for all benchmarks

As presented in https://docs.google.com/presentation/d/1Y_AKEJ6h1g5k3ntrL7nTazWw3xVDzJ_tjOGkLQ6VDMI/edit?usp=sharing the completed sample per second is a better representation of the throughput than scheduled QPS. @pgmpablo157321 to help implement after the conclusion of v4.0

nvzhihanj

Compliance checker logs not checked by the submission UI

[compliance_checker_log.txt](https://github.com/mlcommons/policies/blob/master/submission_rules.adoc#563-inference) inside the results directory is mentioned as a requirement by the submission rules but is not enforced by the submission UI.

arjunsuresh

[Text-to-image] Run benchmark section failed to execute due to various errors

Hi @arjunsuresh and @gfursin, I facing errors in the run benchmark section in the text-to-image section. ``` user@AIMLPerf-NVMe:~/CM/repos/local/cache/57064143a0ce4ff2/inference/text_to_image/model$ cd $SD_FOLDER user@AIMLPerf-NVMe:~/CM/repos/local/cache/57064143a0ce4ff2/inference/text_to_image$ python3 main.py --dataset "coco-1024" --dataset-path coco2014 --profile stable-diffusion-xl-pytorch --model-path...

willamloo3192

Units in the summary file for llama offline results are wrong

For llama benchmarks, the submission checker uses tokens per second for Offline, but samples per second for Server. https://github.com/mlcommons/inference/blob/master/tools/submission/submission_checker.py#L1385 However, the summary.csv still [use](https://github.com/mlcommons/inference/blob/master/tools/submission/submission_checker.py#L2543-L2544) samples/second as the header to report...

yeandy

Early Stopping / Equal Issue mode into Policy for 4.1

There was a discussion on how to make Early Stopping more user friendly in https://github.com/mlcommons/inference/issues/1095 This issue was closed without being added into real policy and implementation though. And in...

nv-jinhosuh

BERT Benchmark unable to execute successfully.

**Command:** cmr "run mlperf inference generate-run-cmds _submission" --quiet --submitter="MLCommons" --hw_name=default --model=bert-99 --implementation=reference --backend=pytorch --device=cuda --scenario=Offline --adr.compiler.tags=gcc --target_qps=1 --category=edge --division=open --env.CM_VERIFY_SSL=false **OS Version:** Ubuntu 22.04 with kernel 6.5.0 **CUDA Version:** 12.0...

willamloo3192

Compilation Warnings for Loadgen

It'll be good to fix the compilation warnings happening for loadgen. ``` -- The C compiler identification is GNU 11.4.0 -- The CXX compiler identification is GNU 11.4.0 -- Detecting...

arjunsuresh

inference
inference copied to clipboard

Metadata

Add open_orca preprocessing steps

Count error when logging errors: submission_checker.py

Llama2 LoadGen server mode: TPS not reported properly

MLPerf-Inference Server: change metrics from scheduled to completed samples per second for all benchmarks

Compliance checker logs not checked by the submission UI

[Text-to-image] Run benchmark section failed to execute due to various errors

Units in the summary file for llama offline results are wrong

Early Stopping / Equal Issue mode into Policy for 4.1

BERT Benchmark unable to execute successfully.

Compilation Warnings for Loadgen

← Metadata

Owner

Metadata

inference inference copied to clipboard

Metadata

← Metadata

Owner

Metadata

inference
inference copied to clipboard