unsloth topic

List unsloth repositories

unsloth

50.4k
Stars
4.2k
Forks
50.4k
Watchers

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

ms-swift

11.9k
Stars
1.1k
Forks
11.9k
Watchers

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi...

oreilly-pytorch-dl

45
Stars
33
Forks
45
Watchers

Code for Deep Learning for Modern AI

notebooks

3.8k
Stars
550
Forks
3.8k
Watchers

100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.

unsloth-llama3-alpaca-lora

31
Stars
0
Forks
31
Watchers

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-e...

Astor-AI

21
Stars
3
Forks
21
Watchers

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, e...

Make-AI-Clone-of-Yourself

21
Stars
4
Forks
21
Watchers

Cloning Yourself using your whatsapp chat history and training a model on it.

vlm-grpo

78
Stars
7
Forks
78
Watchers

An implementation of GRPO for Unsloth's VLMs training

MedCoT-7B

38
Stars
6
Forks
38
Watchers

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实...

Qing-Digital-Self

35
Stars
6
Forks
35
Watchers

数字分身项目,并且包含了搭建(复现)教程 Qing's digital self, including setup tutorial