suc16 comments

Results 78 comments of


                                            suc16

Tokenizer issue with Vicuna V1.1, EOS, BOS tokens seem to be blank

I have the same issue with fastchat 0.2.1. I have tried to update huggingface transformers and restart workers, but still not work. vicuna v0 and vicuna v1.1 both have the...

Tokenizer issue with Vicuna V1.1, EOS, BOS tokens seem to be blank

> New models and v.0.1.10 works for me so - fschat-v0.1.10 + vicuna-7b-v0 work - fschat-v0.1.10 + vicuna-7b-v1.1 work - fschat-v0.2.1 + vicuna-7b-v0 not work - fschat-v0.2.1 + vicuna-7b-v1.1 not...

Tokenizer issue with Vicuna V1.1, EOS, BOS tokens seem to be blank

Thanks. My environment python3.9 transformers 4.28.1 fschat 0.2.2 After applying delta with latest fastchat, I still get the blank EOS/BOS in special_tokens_map.json python3 -m fastchat.model.apply_delta --base /data/models/llama-7b-hf --target /data/models/vicuna-7b --delta...

训练速度

> > > 我这个训练速度正常吗，18w的数据，batch_size=64, 2轮，居然要32个小时，V100 80GGPU，合着一秒钟才5条文本？ A100才有80g吧。。。这个速度似乎是正常的。

训练速度

> > > > 写错了，是A100，这可真慢啊，我发现增加batchsize对于加速一点用没有你的代码里batchsize大了，max_steps是会按照比例下调是吧。

训练速度

> 资源没利用起来，很多你定义的参数其实是无效的，这里推荐一篇blog：[ Efficient Training on a Single GPU](https://huggingface.co/docs/transformers/perf_train_gpu_one#efficient-training-on-a-single-gpu) > > 另外可以改进的包括但不限于： > > ``` > bf16 = True > tf32 = True > optim = “adamw_torch_fused” # 或者安装apex后 "adamw_apex_fused" >...

suc16

Tokenizer issue with Vicuna V1.1, EOS, BOS tokens seem to be blank

Tokenizer issue with Vicuna V1.1, EOS, BOS tokens seem to be blank

Tokenizer issue with Vicuna V1.1, EOS, BOS tokens seem to be blank

训练速度

训练速度

训练速度

你好，如果领域多轮对话数据的话，如何构建多轮qa 数据示例进行微调？

大佬们，能提供api.py吗？类似https://github.com/THUDM/ChatGLM-6B/blob/main/api.py

大佬们，能提供api.py吗？类似https://github.com/THUDM/ChatGLM-6B/blob/main/api.py

大佬们，能提供api.py吗？类似https://github.com/THUDM/ChatGLM-6B/blob/main/api.py