[EPIC][CPU] Enable predictable performance on mixed-types GEMM using data-tiling
This EPIC tracks all the related work.
Tasks related to core functionality
Tasks
- [x] https://github.com/openxla/iree/issues/15680
- [x] https://github.com/openxla/iree/issues/15214
- [x] https://github.com/openxla/iree/issues/15613
- [x] https://github.com/openxla/iree/issues/14195
- [x] https://github.com/openxla/iree/pull/15558
- [ ] https://github.com/openxla/iree/issues/15681
Tasks related to performance improvements
- [x] https://github.com/openxla/iree/pull/15585
- [ ] https://github.com/openxla/iree/issues/15554
- [x] https://github.com/openxla/iree/pull/15651
- [ ] https://github.com/openxla/iree/issues/15441
- [x] linalg.matmul_transpose_b is not data-tiled
- [x] https://github.com/openxla/iree/issues/15600
- [ ] https://github.com/openxla/iree/issues/15611
llama2 specific tasks
Tasks
- [ ] https://github.com/openxla/iree/issues/15566
- [ ] https://github.com/openxla/iree/issues/15195
- [ ] Enable e2e integration tests
@Max191 I think you have some local patches and ideas that are required for mixed-types data-tiling work, could you add them to tasklist accordingly?
@bjacob please help update this if there are on-going/TODO tasks in your mind.
@MaheshRavishankar I created an epic to help us understand better what needs to be done for mixed-types data-tiling, and the work we've been working on.
Thank you all for all the awesome work!
For small tasks, adding a brief description to tasklist is good enough. For large tasks, it would be good if you can create an issue/epic. It's not necessary to do it now, but please help add a brief description. Thank you!