iree icon indicating copy to clipboard operation
iree copied to clipboard

[EPIC][CPU] Enable predictable performance on mixed-types GEMM using data-tiling

Open hanhanW opened this issue 2 years ago • 2 comments

This EPIC tracks all the related work.

Tasks related to core functionality

Tasks

  • [x] https://github.com/openxla/iree/issues/15680
  • [x] https://github.com/openxla/iree/issues/15214
  • [x] https://github.com/openxla/iree/issues/15613
  • [x] https://github.com/openxla/iree/issues/14195
  • [x] https://github.com/openxla/iree/pull/15558
  • [ ] https://github.com/openxla/iree/issues/15681

Tasks related to performance improvements

  • [x] https://github.com/openxla/iree/pull/15585
  • [ ] https://github.com/openxla/iree/issues/15554
  • [x] https://github.com/openxla/iree/pull/15651
  • [ ] https://github.com/openxla/iree/issues/15441
  • [x] linalg.matmul_transpose_b is not data-tiled
  • [x] https://github.com/openxla/iree/issues/15600
  • [ ] https://github.com/openxla/iree/issues/15611

llama2 specific tasks

Tasks

  • [ ] https://github.com/openxla/iree/issues/15566
  • [ ] https://github.com/openxla/iree/issues/15195
  • [ ] Enable e2e integration tests

hanhanW avatar Nov 17 '23 02:11 hanhanW

@Max191 I think you have some local patches and ideas that are required for mixed-types data-tiling work, could you add them to tasklist accordingly?

@bjacob please help update this if there are on-going/TODO tasks in your mind.

@MaheshRavishankar I created an epic to help us understand better what needs to be done for mixed-types data-tiling, and the work we've been working on.

Thank you all for all the awesome work!

hanhanW avatar Nov 17 '23 02:11 hanhanW

For small tasks, adding a brief description to tasklist is good enough. For large tasks, it would be good if you can create an issue/epic. It's not necessary to do it now, but please help add a brief description. Thank you!

hanhanW avatar Nov 17 '23 02:11 hanhanW