iree icon indicating copy to clipboard operation
iree copied to clipboard

[LLVMGPU] Fix prefetching pass for nested loops

Open Groverkss opened this issue 4 months ago • 0 comments

The prefetch pass assumes that shared memory can be reused in the prologue. This may not be true when nested loops are involved, so we need to explicitly insert a barrier to ensure we can reuse this memory.

Groverkss avatar Oct 02 '24 13:10 Groverkss