我伤你不懂

Results 7 comments of 我伤你不懂

> 类似的特变标记,是套用template上添加的,如果直接在数据上加上,是不是会有问题 > > ``` > { > "instruction": "{{query}}\nthought\n{{thought}}", > "input": "", > "output": "{{ans}}", > "history": [], > "system": "You are a helpful assistant." > } > ```...

> > > 类似的特变标记,是套用template上添加的,如果直接在数据上加上,是不是会有问题 > > > ``` > > > { > > > "instruction": "{{query}}\nthought\n{{thought}}", > > > "input": "", > > > "output": "{{ans}}", > > >...

> 请问下你全参数微调的服务器配置是什么情况呢? 8机8卡 都是A100 80G deepspeed zero 3

> 用官方代码全参微调成功了 请问是使用多机多卡微调的吗?

> i have the same question with deepseek-coder-33b-instruct sir, I have solved the issue by setting the end token id and the end token ,when generate

> i have the same question with deepseek-coder-33b-instruct if u use the instruct model,I think u should generate with the method : ``` from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer =...

> 单机 A100 是几张卡?打开 CUDA_LAUNCH_BLOCKING=1 试试呢,报错在哪里? 请问有vllm部署的教程吗?或者文件分享下文件