Orion vLLM support?

vLLM support?

Open lhl opened this issue 1 year ago • 2 comments

The docs mention that you used vLLM for inferencing, but it looks like Orion support hasn't been upstreamed yet: https://github.com/vllm-project/vllm/tree/main/vllm/model_executor/models

Can you share the model file or do you have an ETA for upstreaming the code? HF transformers inferencing is slow enough to make Orion pretty unusable even for running evals.

Jan 23 '24 16:01 lhl

I've been using the Orion branch from https://github.com/dachengai/vllm and it's running, but there might be issues with outputs in different languages

Jan 25 '24 01:01 ZeroYuJie

I've been using the Orion branch from https://github.com/dachengai/vllm and it's running, but there might be issues with outputs in different languages

Yeap ,I am trying to translate from Chinese to English, but the output still contains Chinese characters. 😭

Feb 09 '24 07:02 shuiqingliu

Orion Orion copied to clipboard

vLLM support?

Orion
Orion copied to clipboard