Nadav Shmayovits

Results 2 issues of Nadav Shmayovits

This pull request adds support for Logits processor plugins. This makes implementing custom Logits processors very easy, and eliminates the need to change vLLM directly to implement it. For example...

documentation
frontend
needs-rebase
unstale

### 🚀 The feature, motivation and pitch I am trying to run a 70B model on a node with 3XA100-80Gi. 2XA100-80Gi does not contain enough VRAM to run the model,...

feature request