zhangfan-algo

Results 26 issues of zhangfan-algo

对于比较长的长下文微调帮助挺大的

enhancement

**Describe the bug** 2024-05-16 14:19:20 [W socket.cpp:697] [c10d] The IPv6 network addresses of (zf-yi1-5-34b-sft-0516-02-master-0, 23456) cannot be retrieved (gai error: -2 - Name or service not known). 2024-05-16 14:19:35 Traceback...

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction torchrun --nproc_per_node ${num_gpu_per_node} --master_port $MASTER_PORT --master_addr $MASTER_ADDR --node_rank $RANK --nnodes $WORLD_SIZE src/train.py \ --stage...

pending

like llama Series and qwen Series

已经运行了5分钟,一条数据也没有跑出来 ![image](https://github.com/modelscope/swift/assets/47747764/758af88a-d8f5-4a0a-a8b5-9146f70fcbeb)

这个模型效果非常不错,数学上面接近gpt4o了

enhancement