djl-serving
djl-serving copied to clipboard
Add multi-token support
Description
Per-token text is not supported. This may needs some changes on vllm/lmi_dist side. Per-token cum_logprob is not supported, but should be easy to add.