speculative-decoding icon indicating copy to clipboard operation
speculative-decoding copied to clipboard

About batch size > 1

Open Nero0113 opened this issue 5 months ago • 1 comments

First of all, thank you for open-sourcing the implementation of speculative decoding at batch size > 1. I would like to ask if it is possible to adapt directly to the models downloaded by huggingface instead of customizing their framework code. Because I tried to use this with codegen, but the generated content is messy. Hope you can answer my confusion at your convenience.

Nero0113 avatar Sep 10 '24 08:09 Nero0113