iree icon indicating copy to clipboard operation
iree copied to clipboard

GPUMaterializeEncoding: tune for narrow cases

Open bjacob opened this issue 4 months ago • 0 comments

The tile size selection heuristic in GPUMaterializeEncoding is focused on the generic case of non-narrow shapes; then at the end, a fix-up is applied to adjust to narrow shapes.

This isn't optimal, as this misses the possibility that when one dimension is narrow, we may want to go wider in another dimension.

bjacob avatar Oct 21 '24 14:10 bjacob