unsloth topic

List unsloth repositories

unsloth

48.6k
Stars
4.0k
Forks
48.6k
Watchers

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

ms-swift

3.6k
Stars
310
Forks
Watchers

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...

oreilly-pytorch-dl

45
Stars
33
Forks
45
Watchers

Code for Deep Learning for Modern AI

notebooks

3.8k
Stars
550
Forks
3.8k
Watchers

100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.

unsloth-llama3-alpaca-lora

31
Stars
0
Forks
31
Watchers

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-e...

Astor-AI

21
Stars
3
Forks
21
Watchers

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, e...

Make-AI-Clone-of-Yourself

19
Stars
3
Forks
19
Watchers

Cloning Yourself using your whatsapp chat history and training a model on it.

vlm-grpo

78
Stars
7
Forks
78
Watchers

An implementation of GRPO for Unsloth's VLMs training

MedCoT-7B

35
Stars
6
Forks
35
Watchers

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实...

Qing-Digital-Self

35
Stars
6
Forks
35
Watchers

数字分身项目,并且包含了搭建(复现)教程 Qing's digital self, including setup tutorial