openvino
openvino copied to clipboard
[GPU] Add initial SDPA implementation
Details:
- Add initial SDPA implementation
Remaining tasks:
- Input/Output Transpose fusion support - https://github.com/openvinotoolkit/openvino/pull/24475
- Indirect inputs support
- GQA related optimization (Broadcast fusion)