Kunwar Grover
Kunwar Grover
This is looking much nicer with the vector.contract distribution. Can you move that to a seperate patch if possible?
AFAIU my patch should also take care of SPIRV
Sure, will do tomorrow
> > AFAIU my patch should also take care of SPIRV > > Checked out the #16927 PR and still see the same error @Groverkss I'll have a look and...
@kuhar I looked into this more. Currently, the cooperative matrix path is reusing the MMA heuristics deduction which my patch adds the shared memory check on. I will send a...
> I'm thinking of adding this part to the description of `gpu.subgroup_reduce`: > > > If a `cluster_size` is provided, the subgroup is divided into clusters of `cluster_size` lanes each,...
Please only review the last commit. Other commits are the patches this patch depends on.
Closing in favour of https://github.com/iree-org/iree/pull/17773
> Is there a way you can avoid maybe decomposing attention operation until vector distribution and handle the layout distribution for attention directly? Probably not… if we do that, we...
There are some tests that exceed shared memory, so i'm going to wait for https://github.com/iree-org/iree/pull/18415 to land before i land this.