qingzhong1
qingzhong1
If I currently have data without a reasoning process, but I want to use this data to fine-tune Qwen3, should I simply add /no_think after the prompt and prefix the...
According to the official documentation, when installing MS-Swift, single-machine multi-GPU training will report the following error.   I'd like to know what solutions are available to fix this. Thank...