Make key optional in ipex.llm.functional.rotary_embedding
Context: vLLM has a PR to make key arg in rotary embedding optional (https://github.com/vllm-project/vllm/pull/17566). Currently, this is not supported in ipex.llm.functional.rotary_embedding_batched that vLLM uses. I couldn't find rotary_embedding_batched here, so I'm assuming this is generated from rotary_embedding.
I wasn't able to build and test as I don't have access to intel cpu or xpu, looking for review/feedback. thanks
Note: this only modifies ipex.llm.functional.rotary_embedding API
format, also wondering how do we test here?
@jingxu10 @ZailiWang could you take a look?
cc @zhuyuhua-v
There is no CI in public repo. @jingxu10 @ZailiWang could you please cherry pick the PR and test it in internal repo?
There is no CI in public repo. @jingxu10 @ZailiWang could you please cherry pick the PR and test it in internal repo?
OK, let me do it and feedback the result.
Hi @sarckk , thanks a lot for the contribution as well as the notification of recent updates in vLLM, we are implementing some other updating requests for vLLM integration, and this PR, along with some other changes needed at the frontend, will be merged then.
@ZailiWang sounds good! thanks for closing the loop on this