Results 2 comments of River

changing the version does solve the problem, thx~

Solved by "set SYCL_CACHE_PERSISTENT=1". By doing so, only the first time of inference will take a long time and following running will be much faster.