Benchmarks
Benchmarks copied to clipboard
Tdey/intel xpu
Added callbacks to measure performance throughput more precisely. By default, it shows the throughput for each epoch. Updated the optimizer API for keras==2.12 and keras==3.13 version
LGTM
Is there anything in the performance callback that is Intel-specific? I may lift it into candle_lib unless there are objections (and after testing)
Is there anything in the performance callback that is Intel-specific? I may lift it into candle_lib unless there are objections (and after testing)
tested a few and it seems to work on my local machine.
OK. This branch is still using /common instead of candle_lib so I will not try to merge it. We should try to fix the offload syntax so that xpu, gpu etc are supported in the same code
LGTM
Thanks Venkat. I added you as a reviewer. I need approval to merge the patch.
Is there anything in the performance callback that is Intel-specific? I may lift it into candle_lib unless there are objections (and after testing)
No, nothing Intel-specific. This is more precise way for measuring performance. The same callbacks work on A100 too.