Nurul Fhakri
Results
1
issues of
Nurul Fhakri
Hi, I tested Qwen 2.5 (3B) with GRPO on Kaggle, and after merging using 16-bit, it seems like the LoRA adaptations are not applied properly. The output lacks reasoning compared...