ipex-llm icon indicating copy to clipboard operation
ipex-llm copied to clipboard

Benchmark latency different between oneAPI2024.0 and 2024.1

Open kevin-t-tang opened this issue 1 year ago • 1 comments

Platform: Ubuntu 22.04 with Arc A770 Model: Meta-Llama-3-8B-Instruct

Config ① image source oneapi/2024.0

Config ② image source oneapi/2024.1

Config ① 0,meta-llama/Meta-Llama-3-8B-Instruct,476.44,14.67,0.0,1024-512,1,1024-512,1,sym_int4,True,118.7,4.849609375,N/A,N/A
1,meta-llama/Meta-Llama-3-8B-Instruct,1044.11,15.46,0.0,2048-512,1,2038-512,1,sym_int4,True,118.7,5.7109375,N/A,N/A
Config ② ,meta-llama/Meta-Llama-3-8B-Instruct,455.34,21.77,0.0,1024-512,1,1024-512,1,sym_int4,,10.28,5.896484375
,meta-llama/Meta-Llama-3-8B-Instruct,2656.5,23.43,0.0,2048-512,1,2038-512,1,sym_int4,,10.28,6.6171875

So please help double check the result difference between oneAPI 2024.0 and 2024.1, Thanks!

kevin-t-tang avatar Jun 18 '24 09:06 kevin-t-tang

We haven't officially supported and benchmarked oneapi 2024.1 yet. Will keep you updated when it is ready :)

hkvision avatar Jun 21 '24 03:06 hkvision