paligemma topic

List paligemma repositories

notebooks

9.0k
Stars
1.4k
Forks
9.0k
Watchers

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2,...

maestro

2.7k
Stars
220
Forks
2.7k
Watchers

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

ms-swift

11.9k
Stars
1.1k
Forks
11.9k
Watchers

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi...

mlx-vlm

1.9k
Stars
212
Forks
1.9k
Watchers

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

YoloGemma

84
Stars
6
Forks
84
Watchers

Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.

gemma-cookbook

2.4k
Stars
371
Forks
2.4k
Watchers

A collection of guides and examples for the Gemma open models from Google.

MLLM-Finetuning-Demo

54
Stars
2
Forks
54
Watchers

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

Vision-language-models-VLM

59
Stars
11
Forks
59
Watchers

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)