inference issues

Update SDXL model name in the readme

1

Change model name to a more explicit name "Stable Diffusion XL 1.0"

Pytorch BERT failed to load state_dict

Issue Description: I manually download the model.pytorch and vocab.txt to the designated folder and run `/home/user/cm/bin/python3 run.py --backend=pytorch --scenario=Offline --max_examples 10 --mlperf_conf '/home/user/CM/repos/local/cache/b737554800c84148/inference/mlperf.conf' --user_conf '/home/user/CM/repos/mlcommons@ck/cm-mlops/script/generate-mlperf-inference-user-conf/tmp/c8eb2a31fd70402a93daee688bf391fa.conf' --accuracy 2>&1 | tee /home/user/CM/repos/local/cache/454869b45fbf4f67/test_results/default-reference-gpu-pytorch-v2.2.1-default_config/bert-99/offline/accuracy/console.out`...

willamloo3192

Fix loadgen build error for uncommitted changes

1

Fixes #1648

arjunsuresh

Loadgen built with uncommitted changes - needs revisit

2

`Loadgen built with uncommitted changes` error should be thrown only when loadgen-related files are modified. Currently this is thrown when any of the files in the inference repository is modified.

arjunsuresh

Tips please - personal project + idea for inference

Hi there, just saw the video from the ai engineer Inference looks really awesome I've got this mini project in mind for years now, Getting a model to tell how...

fire17

Reduce the dataset size for LLAMA2

2

I believe there is no need to run LLAMA2-70B model for 24576 samples. GPT-J-6B is run for 13368 samples, Stable diffusion is run for 5000 samples, 3d-unet is run for...

arjunsuresh

mrmhodak

Support running loadgen over the network with external SUTs

1

Currently the loadgen over the network is connecting only to the same host. This fix makes it connect to external SUTs in the network.

arjunsuresh

inference
inference copied to clipboard

Metadata

Update SDXL model name in the readme

Pytorch BERT failed to load state_dict

Fix loadgen build error for uncommitted changes

Loadgen built with uncommitted changes - needs revisit

Tips please - personal project + idea for inference

Reduce the dataset size for LLAMA2

Add CM support for LLAMA2

Support batchsize argument in llama2 implementation

LlaMa2-70b run_accuracy.sh issue with consolidate_results.py

Support running loadgen over the network with external SUTs

← Metadata

Owner

Metadata

inference inference copied to clipboard

Metadata

← Metadata

Owner

Metadata

inference
inference copied to clipboard