南栖 comments

Results 53 comments of


                                            南栖

模型训练loss变化是什么样的？

我目前也在复现这篇论文的模型架构和训练但是目前遇到一定的问题，能否大家建立一个微信群讨论一下？

请问，会开源训练和微调的方法吗

等我开源

GRPO + QLora + DS3 fails while merging model with adapters

@dhruvmullick I'm releasing an open-source framework By combining GRPO + QLoRA + DeepSpeed ZeRO-3,https://github.com/Minami-su/deepspeed-grpo-qlora-vllm

Add Semantic memory

Oh, I made a mistake. For the one above, the ingestion uses gpt-4o-mini and the response uses gpt-4.1-mini. For this one, both the ingestion and response use gpt-4.1-mini: === Evaluation...

Add Semantic memory

@nanxingw Added. evaluationV2

Add Semantic memory

Hi @nanxingw, just wanted to check in on the status of this pull request. I've pushed the evaluation scripts you asked for last week. Is there any feedback or anything...

Add Semantic memory

Hi @nanxingw, Thank you so much for the update! I am definitely interested in discussing future work on this. Thank you for sharing your email, I will reach out to...

Add Semantic memory

@nanxingw 81 / 81 files viewed

Add Amara-o1-7B-Qwen Amara-o2-7B-Qwen to AlpacaEval

datasets: https://huggingface.co/datasets/Minami-su/Amara-o1-dataset https://huggingface.co/datasets/Minami-su/Amara-o2-dataset