zhangfan-algo

Results 32 issues of zhangfan-algo

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction 哪个参数可以设置呢 ### Expected behavior _No response_ ### System Info _No response_ ### Others _No...

solved

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction 报错信息:raise ValueError("Layer-wise BAdam does not yet support distributed training, use ratio-wise BAdam.") ### Expected...

fixed - pending confirmation

目前测试数据上下文比较长,且无法缩短。请教一下有没有方法可以支持一下更长的上下文 ![image](https://github.com/modelscope/swift/assets/47747764/8391f136-2b8c-4e96-a00a-89f0e56dc6fd)

enhancement

env :cuda 12.3 pytorch 2.2.2 Failed to import transformers.models.qwen2.modeling_qwen2 because of the following error (look up to see its traceback): /mnt/pfs/zhangfan/system/anaconda/envs/swift/lib/python3.10/site-packages/flash_attn-2.5.5-py3.10-linux-x86_64.egg/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: ZN2at4_ops15sum_IntList_out4callERKNS_6TensorEN3c1016OptionalArrayRefIlEEbSt8optionalINS5_10ScalarTypeEERS2 RuntimeError raise RuntimeError(: Failed to import...

主要是不太懂需要配置那些参数,辛苦大佬帮忙给一份示例跑train_with_qlora微调代码的脚本

![image](https://github.com/TigerResearch/TigerBot/assets/47747764/c733211f-740a-49fa-9d44-4bdbcbd0eb0b) ![image](https://github.com/TigerResearch/TigerBot/assets/47747764/a8bdaaef-3ca4-4da6-a58c-35fc0342aaaf)

想看下模型各项能力如何,例如视觉推理能力,OCR能力测评.以及预训练数据集和指令微调数据的训练代码以及数据可以开源一下吗 包括预训练和指令精调的

like Chinese and more.