lightllm icon indicating copy to clipboard operation
lightllm copied to clipboard

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Results 125 lightllm issues
Sort by recently updated
recently updated
newest added

Hello, I want to deploy the VILA model for serving VILA1.5-3B-AWQ (https://github.com/NVlabs/VILA). Could you please guide me on how to get started? Are there any specific instructions or tools I...

您好,请问您是否考虑支持encoder-only模型?如:bge-m3 embedding model等

bug

Could lightllm use only CPU for inference?