vllm
vllm copied to clipboard
CTranslate2
Hello,
Thanks for the great framework for deploying LLM.
Would it be possible to use a LLM model compiled with the CTranslate2 library?
Thanks for bringing this up. We will investigate the CTranslate2 library and evaluate the difficulty and the potential benefit of adding it into vLLM.
Would love to see this, ct2 would be a great integration! It would give us easy access to fast 8 bit inference and plays nice with HF Transformers. Thank you for the library so far!!
Hi,
Any news regarding this integration? Ctranslate2 has already proven its speed within the TitanML framework for local LLM serving.
hi,
any news on this?
+1
+11
@zhuohan123 do you see any benefit of adding this to vLLM?