Ivan Tikhonov

Results 14 comments of Ivan Tikhonov

@amyeroberts @Rocketknight1 Hi! any updates on this? we are working on improving SDPA operation support on openvino side, and using these models for testing our changes: phi3-vision - https://huggingface.co/microsoft/Phi-3-vision-128k-instruct/blob/main/modeling_phi3_v.py#L52 orion-14b...

@praasz @mitruska do we plan to introduce some "internal" specification for ov internal ops? maybe not so formal as for official opset PagedAttention is covered by some presentation, for Rope...

@CuriousPanCake could you take a look?

this PR significantly increases LoadTIme (+50% for some models). Need to refine solution