Anton Lokhmotov comments

Results 273 comments of


                                            Anton Lokhmotov

Clarification regarding how the accuracy.txt file is generated

I can think of a situation when an implementer refactors/integrates a reference script into their own script. For example, the reference script may hardcode using `/usr/bin/python3`, while they may want...

Clarification regarding how the accuracy.txt file is generated

@arjunsuresh But you admit that in some cases it may not be straightforward: > yes, running the reference accuracy script standalone is fine I believe. > But this is not...

Is beam search strategy mandatory in gptj-6b benchmark?

Yes, it is mandatory for the Closed division. However, for the reasons that you outlined, GPTJ might be dropped from MLPerf Inference too. (Normally, a benchmark need to survive 4...

How to run text-to-image with multi GPUs

Hi @surbanqq! Reference code often supports only a single accelerator. But for their submissions vendors optimize including scaling to multiple accelerators. In the case of NVIDIA, please take a look...

[Hardware][TPU][V1] Multi-LoRA Optimisations for the V1 TPU backend

# Benchmarking LoRA against baseline (no LoRA) throughput We use NVIDIA's [GenAI-Perf](https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/perf_analyzer/genai-perf/README.html) tool to force fixed-length inputs and outputs to produce "heatmap" plots as below. On TPU-v6e and H100 instances,...

Fix SingleStream and MultiStream reciprocal score comparison for TEST04

Ping.

Fix SingleStream and MultiStream reciprocal score comparison for TEST04

recheck

Permission error in inference/speech_recognition/rnnt/

MlPref -> MLPerf? On Sat, 28 Jan 2023, 19:40 Arjun Suresh, ***@***.***> wrote: > Actually we are using the reference script in CM workflow > > itself and they work...

Some points to make life easier for new submitters to submit to MLPerf inference

> Reference implementations are not practically usable- while it is not practical to support all the hardware ideally we should have an object oriented device where a new submitter should...

Some points to make life easier for new submitters to submit to MLPerf inference

> I think KILT will be useful particularly if it supports more hardware > backends other than Qualcomm. We are planning to release more backends after the v3.1 round. Some...