jinsong-mao comments

Results 7 comments of


                                            jinsong-mao

llama_7b model OOM issue

@xuzhao9 I tried to use 4xA100-40G to avoid the OOM issue, looks torchbench.py only use one GPU's memory, I used options like --device-index or --multiprocess, both failed. do you have...

Support for AMD GPUs in the Llama Recipes notebook quickstart

> We have tried this script on AMD GPUs and it works for LoRa and full fine tuning. We have not tried bits-n-bytes. can u share us your flow/script on...

automatic run test benchmark with docker file

strange to get this feedback: ![image](https://github.com/pytorch/benchmark/assets/146043398/62daebca-5142-47fe-965e-05792ded3fd8) when direct run with command: docker run -it --gpus all ghcr.io/pytorch/torchbench:latest thanks

automatic run test benchmark with docker file

@xuzhao9 thanks, with this workflow file, I can modify the dockerfile to access volume mounting of my user id and runner, I can run this flow smoothly now. there is...

automatic run test benchmark with docker file

@xuzhao9 hi xuzhao, if I want to run the test_bench.py in fp16, how should i add configs? thanks

automatic run test benchmark with docker file

Hi @xuzhao9 , after running the tests and checking the output json file, it's very strange that the GPU clock frequency is only 765, ![image](https://github.com/pytorch/benchmark/assets/146043398/2fc02296-57cc-4744-b6c5-d5d3a73cf231) I am running test on...

Add support to AMD ROCm

how about running test_bency.py on rocm+amd?