TensorRT-LLM
TensorRT-LLM copied to clipboard
[Model Requests] Add support for CogVLM
https://github.com/THUDM/CogVLM CogVLM is one of the best models for describing images, much better than qwen vl in my experience. To make image subtitles faster would be a huge gain. Being able to launch it in 4bit would be a huge advantage.