numba-dpex Lower than expected performance in blackscholes numpy implementation

Lower than expected performance in blackscholes numpy implementation

Open adarshyoga opened this issue 1 year ago • 1 comments

The blackscholes numpy implementation in dpbench is ~26X slower than the corresponding kernel and prange implementations.

How to reproduce:

Follow instructions to setup dpbench.
Run blackscholes - python -c "import dpbench; dpbench.run_benchmark(\"black_scholes\")"

Mar 20 '23 20:03 adarshyoga

The slowdown maybe related to kernel launch overhead in the JitKernel custom dispatcher class. Overhead is especially noticeable with small problem sizes. The experimental.dispatcher.KernelDispatcher fixes the launch overhead.

Can you please reevaluate with the new dispatcher?

Dec 20 '23 02:12 diptorupd

numba-dpex numba-dpex copied to clipboard

Lower than expected performance in blackscholes numpy implementation

numba-dpex
numba-dpex copied to clipboard