Yi
Yi copied to clipboard
A series of large language models trained from scratch by developers @01-ai
根据README运行sft脚本: ``` cd finetune/scripts bash run_sft_Yi_6b.sh ``` **报错信息** ``` [2024-01-02 10:43:01,920] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2024-01-02 10:43:04,373] [WARNING] [runner.py:203:fetch_hostfile] Unable to find hostfile, will proceed with...
``` sampling params: SamplingParams(n=1, best_of=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.7, top_p=1.0, top_k=-1, min_p=0.0, use_beam_search=False, length_penalty=1.0, early_stopping=False, stop=[], stop_token_ids=[7], include_stop_str_in_output=False, ignore_eos=False, max_tokens=3736, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True) ``` 在v100 32GB上运行,启动脚本为 ```sh CUDA_VISIBLE_DEVICES=2,3 python...
微调 YI-6B 一直出现 loss scale overflow 然后 reduce 到 min_loss_scale 报错, YI-6B-Chat 则没问题,chat 模型训练参数设置有什么不同吗
data:image/s3,"s3://crabby-images/08c2b/08c2b95d7cec5de08838bf85ccf5552b400d0d82" alt="image" data:image/s3,"s3://crabby-images/f12d0/f12d03763df564ae573e95f7c6587c38e9ef5110" alt="image" 环境: V100 * 8 deepspeed: 0.10.3 torch: 2.1.0+cu118
您好,请问Yi是否通过了大模型备案。
You shouldn't move a model when it is dispatched on multiple devices. Traceback (most recent call last): File "/home/yi/workspace/Yi/finetune/sft/main.py", line 416, in main() File "/home/yi/workspace/Yi/finetune/sft/main.py", line 345, in main model,...
我使用34B-chat和langchain 配合作为一个agent使用。 prompt如下: ``` ou are a friendly Chatbot that having a conversation with a human. You NEVER answer that you are an AI language model. If you don't know...
就不能在官网挂一个体验的链接吗?找了半天,官网,公众号,开源社区等等,连一个体验链接都找不到。你们官方部署一个体验的环境没那么难吗? 我这边正好有客户需要处理长上下文本的需求,总不能每个要体验的人都自己去部署吧
## Bug Description In instruct mode, the model do not stop after completing the first valid assistant response (). It then generates garbage, even "user"... ``` system system 你是一个AI助手。 user...