notebooks issues

About dataset format for Fine-tuning Qwen3

1

The two cases provided in the document involve different dataset formats. Which one should we choose when we do fine-tuning without CoT? ![Image](https://github.com/user-attachments/assets/c015e74c-9b7b-4001-82eb-2203bd1ee798) **1、[Qwen3 (14B) Reasoning + Conversational notebook](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(14B)-Reasoning-Conversational.ipynb) (recommended)**...

zhanglt

Fix push to HF hub

`tokenizer` is undefined, we should use `processor` from that cell instead: ``` model, processor = FastModel.from_pretrained( model_name = "unsloth/csm-1b", max_seq_length= 2048, # Choose any for long context! dtype = None,...

mnickkk

GRPO results not match the dataset

2

I use the `Gemma 3 (1B)`, pick some question from the `openai/gsm8k`, the result is pretty bad compare `Gemma 3 (1B)` without fine-tune. I run it with `llama-cli -m ./gemma-3-finetune.Q8_0.gguf`,...

calvin2021y

updated kaggle-Gemma3_(4B) notebook

5

Set tokenize=False in tokenizer.apply_chat_template It won't run otherwise since the tokenizing is happening twice. I faced this minor issue when running today. ```python def apply_chat_template(examples): texts = tokenizer.apply_chat_template(examples["conversations"], tokenize =...

naimur-29

Gemma3_(4B).ipynb not working %(

1

from trl import SFTTrainer, SFTConfig trainer = SFTTrainer( model = model, tokenizer = tokenizer, train_dataset = dataset, eval_dataset = None, # Can set up evaluation! args = SFTConfig( dataset_text_field =...

PashaHatsune

Llama3.1_(8B)-Alpaca.ipynb g collab notebook broken

2

vanilla Llama3.1_(8B)-Alpaca.ipynb throws in the second codeblock ``` :1: UserWarning: WARNING: Unsloth should be imported before trl, transformers, peft to ensure all optimizations are applied. Your code may run slower...

carafelix

fix: regex match in format rewards

The original regex didn’t correctly match newlines inside the `reasoning` and `answer` tags. As a result, `soft_format_reward_func` always returned 0, since `strict_format_reward_func` required exactly two newlines inside each tag.

LudWittg

add tool calling notebooks

Add tool calling notebooks for the following models: - [Qwen-2.5-Instruct example](https://colab.research.google.com/github/oliveirabruno01/unsloth-challenge/blob/main/Qwen2_5_1_5B_Tool_Calling.ipynb) - [Llama-3.1-Instruct example](https://colab.research.google.com/github/oliveirabruno01/unsloth-challenge/blob/main/Llama3_1_%288B%29_Tool_Calling.ipynb)

oliveirabruno01

issue with (Meta_Synthetic_Data_Llama3_2_(3B).ipynb)

1

when opening for more than 3 chunks: import time # Process 3 chunks for now -> can increase but slower! for filename in filenames[:5]: !synthetic-data-kit \ -c synthetic_data_kit_config.yaml \ create...

ramixpe

About train on responses only (instruction&question padding)

About instruction&question padding. I saw in some SFT notebook demos, e.g. Gemma3_(4B).ipynb, we enable padding these parts -100, while some others like Qwen2.5_(7B)-Alpaca.ipynb we do not. Is it necessary or...

auto-Dog

notebooks
notebooks copied to clipboard

Metadata

About dataset format for Fine-tuning Qwen3

Fix push to HF hub

GRPO results not match the dataset

updated kaggle-Gemma3_(4B) notebook

Gemma3_(4B).ipynb not working %(

Llama3.1_(8B)-Alpaca.ipynb g collab notebook broken

fix: regex match in format rewards

add tool calling notebooks

issue with (Meta_Synthetic_Data_Llama3_2_(3B).ipynb)

About train on responses only (instruction&question padding)

← Metadata

Owner

Metadata

notebooks notebooks copied to clipboard

Metadata

← Metadata

Owner

Metadata

notebooks
notebooks copied to clipboard