zhangfan-algo

Results 32 issues of zhangfan-algo

对于比较长的长下文微调帮助挺大的

enhancement

**Describe the bug** 2024-05-16 14:19:20 [W socket.cpp:697] [c10d] The IPv6 network addresses of (zf-yi1-5-34b-sft-0516-02-master-0, 23456) cannot be retrieved (gai error: -2 - Name or service not known). 2024-05-16 14:19:35 Traceback...

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction torchrun --nproc_per_node ${num_gpu_per_node} --master_port $MASTER_PORT --master_addr $MASTER_ADDR --node_rank $RANK --nnodes $WORLD_SIZE src/train.py \ --stage...

pending

like llama Series and qwen Series

已经运行了5分钟,一条数据也没有跑出来 ![image](https://github.com/modelscope/swift/assets/47747764/758af88a-d8f5-4a0a-a8b5-9146f70fcbeb)

这个模型效果非常不错,数学上面接近gpt4o了

enhancement

**Describe the bug** Traceback (most recent call last): File "/apps1/zhangfan/anaconda3/envs/new_swift/lib/python3.10/site-packages/swift/cli/rlhf.py", line 5, in rlhf_main() File "/apps1/zhangfan/anaconda3/envs/new_swift/lib/python3.10/site-packages/swift/utils/run_utils.py", line 22, in x_main args, remaining_argv = parse_args(args_class, argv) File "/apps1/zhangfan/anaconda3/envs/new_swift/lib/python3.10/site-packages/swift/utils/utils.py", line 131, in...

### Your current environment pip install+https://github.com/vllm-project/vllm.git ### Model Input Dumps _No response_ ### 🐛 Describe the bug (VllmWorkerProcess pid=544861) ERROR 09-13 18:22:53 multiproc_worker_utils.py:226] File "/apps1/zhangfan/anaconda3/envs/new_swift/lib/python3.10/site-packages/vllm/executor/multiproc_worker_utils.py", line 223, in _run_worker_process (VllmWorkerProcess...

bug

**Describe the bug** ![image](https://github.com/user-attachments/assets/bc125f23-b4e3-4786-a062-684944e42140) **Additional context** SIZE_FACTOR=8 MAX_PIXELS=602112 torchrun --nproc_per_node ${num_gpu_per_node} --master_port $MASTER_PORT --master_addr $MASTER_ADDR --node_rank $RANK --nnodes $WORLD_SIZE examples/pytorch/llm/llm_rlhf.py \ --model_cache_dir Qwen2-Audio-7B-Instruct \ --model_type qwen2-audio-7b-instruct \ --rlhf_type dpo \...