Max Zabarka

Results 2 comments of Max Zabarka

I'd like to help implement this aswell

> In case of direct vLLM calls (from Python) we could let the user to pass a callback to process the logits before the token is chosen, so the probability...