aphrodite-engine icon indicating copy to clipboard operation
aphrodite-engine copied to clipboard

[New Model]: Phi3ForCausalLM

Open sparsh35 opened this issue 1 year ago • 3 comments

The model to consider.

https://huggingface.co/microsoft/Phi-3-medium-128k-instruct

I was trying to run the exl2 quants for these models , but getting error at rotatry embedding these models use two rope scaling factors as long_factor and short_factor. Model is good and the vllm , huggingface have a merge which does support this but they don't support exl2.

The closest model Aphrodite already supports.

No response

What's your difficulty of supporting the model you want?

relevant git merges :

https://github.com/vllm-project/vllm/pull/4298

sparsh35 avatar May 24 '24 02:05 sparsh35

Bump

localbarrage avatar Jun 24 '24 21:06 localbarrage

Bump

murtaza-nasir avatar Jul 04 '24 01:07 murtaza-nasir

It's currently in the rc_054 branch, you can test it, please note that some quantizations are broken atm.

sgsdxzy avatar Jul 04 '24 06:07 sgsdxzy

Added as of v0.6.0

AlpinDale avatar Sep 03 '24 13:09 AlpinDale