inference issues

Automated command for llama2-70b: Cannot take a larger sample than population

5

Hello mlcommons team, I want to run the "Automated command to run the benchmark via MLCommons CM" (from the example: https://github.com/mlcommons/inference/tree/master/language/llama2-70b), but I am getting the following error: ``` /root/mambaforge/bin/python3...

philross

Mixtral mlperf log file is having a new line which fails when loaded with the mlperf logger

1

When the Mixtral server latency constraints are not met, the submission checker is breaking with the below error. ``` File "/home/arjun/CM/repos/local/cache/f2ac2b26439f49be/inference/tools/submission/log_parser.py", line 44, in __init__ raise RuntimeError("Encountered invalid line: {:}".format(line))...

arjunsuresh

Enable TEST04 and TEST05 for SDXL

6

We have already enabled TEST01 for SDXL - wasn't mandatory for v4.0 (because the proposal came late), but mandatory for v4.1. https://github.com/mlcommons/inference/pull/1574 NVIDIA has checked internally and SDXL can be...

nv-ananjappa

CM running failed when cloning from https://github.com/GATEOverflow/inference_results_v4.0.git

5

I installed CM following the guide in https://docs.mlcommons.org/ck/install/ successfully and then refer to https://docs.mlcommons.org/inference/benchmarks/language/bert/ to run the scripts as below: cm run script --tags=run-mlperf,inference,_find-performance,_full \ --model=bert-99 \ --implementation=nvidia \ --framework=tensorrt...

Bob123Yang

Running Mixtral is producing the below warning - I guess the evaluation of the accuracy logs is now completed

``` WARNING:Mixtral-8x7B-Instruct-v0.1-MAIN:Accuracy run will generate the accuracy logs, but the evaluation of the log is not completed yet ```

arjunsuresh

Running Automated command for llama2-70b without downloading the model

4

Hello mlcommons team, I want to run the "Automated command to run the benchmark via MLCommons CM" (from the example: https://github.com/mlcommons/inference/tree/master/language/llama2-70b), but I do not want to download llama2-70b, since...

philross

Add GNN checkpoint link

1

pgmpablo157321

Get error message "unrecognized arguments: rocm" when running mlperf inference on ubuntu with rocm

1

The command used to run mlperf inference for resnet50 model on ubuntu with rocm is below: cm run script --tags=run-mlperf,inference \ --model=resnet50 \ --implementation=reference \ --framework=tensorflow \ --category=edge \ --scenario=Offline...

jerryzhaoc

Update index.md

1

Clarified the steps to follow the prereq step was not clear since it points to an external page

PurushGupta

Tokens per sample upper limit for GPTJ

Is there any reason why we have an [accuracy upper limit for LLAMA2 Tokens per sample](https://github.com/mlcommons/inference/blob/master/tools/submission/submission_checker.py#L109) but not for GPT-J? It's good to document this reason for users.

arjunsuresh

inference
inference copied to clipboard

Metadata

Automated command for llama2-70b: Cannot take a larger sample than population

Mixtral mlperf log file is having a new line which fails when loaded with the mlperf logger

Enable TEST04 and TEST05 for SDXL

CM running failed when cloning from https://github.com/GATEOverflow/inference_results_v4.0.git

Running Mixtral is producing the below warning - I guess the evaluation of the accuracy logs is now completed

Running Automated command for llama2-70b without downloading the model

Add GNN checkpoint link

Get error message "unrecognized arguments: rocm" when running mlperf inference on ubuntu with rocm

Update index.md

Tokens per sample upper limit for GPTJ

← Metadata

Owner

Metadata

inference inference copied to clipboard

Metadata

← Metadata

Owner

Metadata

inference
inference copied to clipboard