vllm
vllm copied to clipboard
Add tests for models
We need tests for the models we support. The tests should ensure that the outputs of our models when using greedy sampling are equivalent to those of HF models.