zhangfan-algo issues

Results 32 issues of


                                            zhangfan-algo

可以建一个官方的微信交流群吗

博主,隔壁ChatGLM有官方的交流群,一些技术问题和模型使用中的问题大家可以讨论交流,咱们方便建一个不

微调13B模型,官方推荐的cuda和pytorch版本是那个呀

预训练是否支持pretrain中文数据,扩充词表

想问下博主 pt代码是否支持pretrain中文数据集,以及如果预训练中文的时候是否支持扩充词表呢,因为原生llama对中文不是很友好,中文几乎找到在原有词表中

pending

AttributeError: can't set attribute 'eos_token' 在跑评估代码出现了报错

![image](https://github.com/hiyouga/ChatGLM-Efficient-Tuning/assets/47747764/64e982c8-9bfa-4677-b949-90f2465e5b73)

pending

InternVL可以支持一下微调不

https://github.com/OpenGVLab/InternVL 最新的模型效果接近qianwen-vl-max 可以支持一下微调不

more models

![image](https://github.com/modelscope/swift/assets/47747764/5a391d29-3e63-4c4e-a7b5-a031800a1a25) 使用的是8卡A800 运行脚本 RAY_memory_monitor_refresh_ms=0 CUDA_VISIBLE_DEVICES=0 python examples/pytorch/llm/llm_infer.py \ --infer_backend vllm \ --ckpt_dir /mnt/pfs/zhangfan/study_info/LLaMA-Factory_0308/output/merge_sft_prompt_0319_qwen1half_4B_sft_0319/checkpoint-5890 \ --custom_val_dataset_path data/merge_sft_prompt_0319_test.jsonl \ --max_length -1 \ --use_flash_attn true \ --max_new_tokens 2300 \ --temperature 0.01 \ --top_p...

question

zhangfan-algo

可以建一个官方的微信交流群吗

微调13B模型,官方推荐的cuda和pytorch版本是那个呀

预训练是否支持pretrain中文数据,扩充词表

AttributeError: can't set attribute 'eos_token' 在跑评估代码出现了报错

InternVL可以支持一下微调不

qwen1.5-4B-chat 多卡推理报错

可以支持一下SPIN自我博弈微调的方法不

如何微调InternVL-Chat-V1.2-Plus

请教一下InternVL-Chat-V1.5如何进行lora微调呢

单张80G卡编辑7B模型报显存不足想请教一下如何单机多卡去run