CTranslate2
CTranslate2 copied to clipboard
Support Speculative Decoding
This could be used for LLMs and hopefully for encoder-decoder models like using the smaller NLLB coupled with the bigger NLLB models