Nurul Fhakri

Results 1 issues of Nurul Fhakri

Hi, I tested Qwen 2.5 (3B) with GRPO on Kaggle, and after merging using 16-bit, it seems like the LoRA adaptations are not applied properly. The output lacks reasoning compared...