self-operating-computer icon indicating copy to clipboard operation
self-operating-computer copied to clipboard

CogVLM Support - A better LLaVa

Open AMEND09 opened this issue 1 year ago • 0 comments

CogVLM is a Python based open source multimodal model. It is significantly better LLaVa, especially at identifying elements on a screen in fact it excels at that part. CogVLM is not too difficult to run and it would improve the experience of running locally. While CogVLM is not supported through ollama please consider adding support for it.

AMEND09 avatar Feb 23 '24 23:02 AMEND09