gctian issues

Results 7 issues of


                                            gctian

为什么logit向量的 vocab_size和实际使用的vocab.json 的大小不一致？

vocab.json中有 106029个 Token，但是模型最终生成的logit向量的维度的 107008，为什么不一致呢？这样就会出现有些 token无法解码吧？

local build error: linking with `cc` failed

### System Info GPU: A100 Python:3.10.13 ### Information - [ ] Docker - [ ] The CLI directly ### Tasks - [ ] An officially supported command - [ ]...

connection failed when using docker-disque

hi: i'm useing docker-disque, and use disq as client.my url is "disque://10.18.119.32:7711". my error is "redis.exceptions.ConnectionError: Error -2 connecting to b'':7711. Name or service not known." i debug step by...

probability tensor contains either `inf`, `nan` or element < 0

有人碰到这个错吗？换了几个torch的版本也不行，应该跟版本环境没关系，求大佬们指点~~~ ![image](https://github.com/baichuan-inc/Baichuan-13B/assets/18107813/025fd62b-a8c8-4a4e-9ec9-6910c61f7437)

checkpoint模型无法加载

### 🐛 bug 说明保存的checkpoint目录下缺少文件吧？为啥只有3个文件，而完整的 model目录有6个文件这是完整的模型目录： ### Python Version None

bug

关于几个训练细节的问题

作者的项目非常赞，我有几个问题想请教下： 1. 多个角色数据，是一起训练吗？还是每个角色训一个单独的 lora 2. 比如训练【令狐冲】角色，那就是SFT的 QA微调，A=令狐冲的话，Q=上一条和令狐冲对话人的话，是这样格式吗？ 3. SFT微调是单轮微调，还是多轮微调？ 4. 怎么划分连续上下文呢？避免答非所问的QA 5. 关于角色背景信息，还要进行指令微调吗？比如【令狐冲】的人物关系、角色技能等，光靠 system prompt不够充分吧谢谢。

Input validation error: `inputs` must have less than 512 tokens. Given: 534

### System Info - text-embeddings-router 1.1.0 - python3.10 - centos - A800 ### Information - [ ] Docker - [X] The CLI directly ### Tasks - [ ] An officially...