GLEN BERTULFO comments

Results 6 comments of


                                            GLEN BERTULFO

Flex 170 x8 is failing when targeting 6 or 8 GPUs

Hi Yang, is there a work around to get 6 or 8 GPUs working?

Flex 170 x8 is failing when targeting 6 or 8 GPUs

Hi Yang, going back to 8 GPUs on Flex with 32 attention head number, I reran on the same platform and verified this info when i did print(model) -- 32...

Flex 170 x8 is failing when targeting 6 or 8 GPUs

Hi Yang, please see attached output text file-- 8GPUs_llama2_7B.txt [8GPUs_llama2_7B.txt](https://github.com/intel-analytics/ipex-llm/files/14910582/8GPUs_llama2_7B.txt)

Flex 170 x8 is failing when targeting 6 or 8 GPUs

Hi Yang, updating. The CPU memory does hit max utilization when running 8GPUs with Vicuna 33B model on the DUT-- ![image](https://github.com/intel-analytics/ipex-llm/assets/73611412/9d46cefa-9171-4e78-b0cb-4c9d957bae13) Please see attached vicuna 33B full txt output log...

Flex 170 x8 is failing when targeting 6 or 8 GPUs

Hi Yang, from our debug synch you indicated that on the same machine your fellow team member were not seeing issues on 8-GPU config. May I kindly ask for the...

Flex 170 x8 is failing when targeting 6 or 8 GPUs

Issue is resolved. Closing this ticket. Thank you team for your help.