vllm icon indicating copy to clipboard operation
vllm copied to clipboard

[Usage]: how to use openai compatible api to run GGUF model?

Open weiminw opened this issue 5 months ago • 1 comments

Your current environment

The output of `python collect_env.py`

How would you like to use vllm

I want to run inference of a [specific model](put link here). I don't know how to integrate it with vllm.

Before submitting a new issue...

  • [X] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

weiminw avatar Sep 12 '24 06:09 weiminw