inference
inference copied to clipboard
Reference implementations of MLPerf™ inference benchmarks
Hi @arjunsuresh **I am running the Resnet50 benchmark with the command:** cm run script --tags=run-mlperf,inference,_find-performance,_full,_r4.1-dev \ --model=resnet50 \ --implementation=nvidia \ --framework=tensorrt \ --category=edge \ --scenario=Offline \ --execution_mode=test \ --device=cuda \...
We need to update the [system_requirements](https://github.com/mlcommons/inference/blob/docs/docs/system_requirements.yml) file for all the reference implementations and then use the same in the [inference docs](https://docs.mlcommons.org/inference/)
 please help me out with this error.
https://github.com/mlcommons/inference/blob/2d2eb3081dcda64e766a28dc1b6fb9c40fb9eefa/tools/submission/submission_checker.py#L432C28-L432C32 should this be 8313 as per length of 405b official dataset?
The following issues appear when running the LLM reference implementation. Multiple GPUs issue: ``` (VllmWorkerProcess pid=1795) ERROR 12-03 18:49:03 multiproc_worker_utils.py:231] Exception in worker VllmWorkerProcess while processing method init_device: Cannot re-initialize...
The following issues appear when running the LLM reference implementation Dependencies in the docker container: ``` Collecting mistral-common>=1.4.4 (from mistral-common[opencv]>=1.4.4->vllm==0.6.3->-r requirements.txt (line 8)) Downloading mistral_common-1.5.0-py3-none-any.whl.metadata (4.6 kB) Downloading mistral_common-1.4.4-py3-none-any.whl.metadata (4.6...
GNN/R-GAT Inference task force agreed that the sorting of the nodes is disallowed. Sorting of the nodes reduces number of unique nodes and artificially increases the throughput up to 6x....
Hi, I tried to reproduce your baseline on mixtral-8x7b, but the accuracy on gsm8k is 72.00 instead of 73.66. Can you reproduce it? Also, what is the version of transformers?
Updated 1 pull request submission from previous pull requests (#1934, #1933, #1932, #1931), involving functionality of multi-gpu & multi-node for pytorch & migraphx backend. Submitted by SCC24 UCSD Team Zixian...
Hi, I'm facing some issues when i tried running the benchmark for 3d-unet. When i ran **make run RUN_ARGS="--benchmarks=3d-unet --scenarios=offline,server""** Got the errors, which is also the system i'm working...