vllm
vllm copied to clipboard
[Usage]: how to use openai compatible api to run GGUF model?
Your current environment
The output of `python collect_env.py`
How would you like to use vllm
I want to run inference of a [specific model](put link here). I don't know how to integrate it with vllm.
Before submitting a new issue...
- [X] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.