南栖
Results
53
comments of
南栖
model: https://huggingface.co/Minami-su/Amara-o2-7B-Qwen https://huggingface.co/Minami-su/Amara-o1-7B-Qwen
I'm releasing an open-source framework By combining GRPO + QLoRA + DeepSpeed ZeRO-3,https://github.com/Minami-su/deepspeed-grpo-qlora-vllm