unsloth topics

unsloth

50.4k

Stars

4.2k

Forks

50.4k

Watchers

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

unslothai

ai

fine-tuning

finetuning

gemma

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi...

modelscope

agent

aigc

baichuan

chatglm

oreilly-pytorch-dl

45

Stars

33

Forks

45

Watchers

Code for Deep Learning for Modern AI

sinanuozdemir

bert

clip

deep-learning

diffusion

notebooks

3.8k

Stars

550

Forks

3.8k

Watchers

100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.

unslothai

unsloth

unsloth-llama3-alpaca-lora

31

Stars

0

Forks

31

Watchers

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-e...

Cre4T3Tiv3

4bit

alpaca

colab

finetuning

Astor-AI

21

Stars

3

Forks

21

Watchers

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, e...

SrikarVeluvali

flask

huggingface

llama3

llm