Max Zabarka
Results
2
comments of
Max Zabarka
I'd like to help implement this aswell
> In case of direct vLLM calls (from Python) we could let the user to pass a callback to process the logits before the token is chosen, so the probability...