opencompass
opencompass copied to clipboard
[Feature] Improve the inference speech with vLLM batch API
Describe the feature
Improve the inference speech with vLLM batch API
Will you implement it?
- [ ] I would like to implement this feature and create a PR!