llava topic
LLamaSharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...
flowgen
AutoGen Visualized - Visual Tools for Multi-Agent Development.
vision-core-ai
Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
captain
Give your computer an AI Brain
mlx-vlm
MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.
LLaVA-CLI-with-multiple-images
LLaVA inference with multiple images at once for cross-image analysis.
LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
captcha-solver
basic google recaptcha solver using llava-v1.6-7b