tvm
tvm copied to clipboard
[TIR] Add pass ManifestSharedMemoryLocalStage
Added a pass to insert local (cache) stage for the shared memory. It's similar to cache read but bypasses the limitation of int set analysis for compacting buffer region by inferring the buffer shape from the loop extents.
Ping @spectrometerHBH for a second review