iree
iree copied to clipboard
GPUMaterializeEncoding: tune for narrow cases
The tile size selection heuristic in GPUMaterializeEncoding is focused on the generic case of non-narrow shapes; then at the end, a fix-up is applied to adjust to narrow shapes.
This isn't optimal, as this misses the possibility that when one dimension is narrow, we may want to go wider in another dimension.