unsloth topic
unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...
oreilly-pytorch-dl
Code for Deep Learning for Modern AI
notebooks
100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.
unsloth-llama3-alpaca-lora
Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-e...
Astor-AI
AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, e...
Make-AI-Clone-of-Yourself
Cloning Yourself using your whatsapp chat history and training a model on it.
vlm-grpo
An implementation of GRPO for Unsloth's VLMs training
MedCoT-7B
本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实...
Qing-Digital-Self
数字分身项目,并且包含了搭建(复现)教程 Qing's digital self, including setup tutorial