南栖

Results 53 comments of 南栖

model: https://huggingface.co/Minami-su/Amara-o2-7B-Qwen https://huggingface.co/Minami-su/Amara-o1-7B-Qwen

I'm releasing an open-source framework By combining GRPO + QLoRA + DeepSpeed ZeRO-3,https://github.com/Minami-su/deepspeed-grpo-qlora-vllm