oneDPL
oneDPL copied to clipboard
Specialize work-group sizes for scan-based algorithms
This PR allows us to change the work-group size and iterations per work-item for the scan family of algorithms. We choose values for these parameters that optimize performance for devices of interest.