Cherrysaber comments

Results 10 comments of


                                            Cherrysaber

[BUG/Help] ds_train_finetune.sh 多卡训练需要多少资源才行？

wsl2?

[Feature] 建了个分支，支持多GPU部署，自动平均分配显存。

> 加载量化后的int4模型会报错： ![image](https://user-images.githubusercontent.com/46914203/227116839-efcae0ad-430a-4ca4-8fd1-630734da8ce6.png) `model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4-qe", trust_remote_code=True)` `model.save_pretrained(“./multi_gpus”,max_shard_size='2GB')` 先用python运行上面两行代码，在运行webui就行了，模型路径填 _**“./multi_gpus”**_

[Feature] 建了个分支，支持多GPU部署，自动平均分配显存。

> > 加载量化后的int4模型会报错： ![image](https://user-images.githubusercontent.com/46914203/227116839-efcae0ad-430a-4ca4-8fd1-630734da8ce6.png) > > 这是因为路径不对吧？不过都量化int4了还需要多卡吗？没有测试。还是非常必要的，max_tokens直接和显存大小相关，int4模型能记录的上下文在相同配置下，远超正常模型。

[Feature] 建了个分支，支持多GPU部署，自动平均分配显存。

> > > 加载量化后的int4模型会报错： ![image](https://user-images.githubusercontent.com/46914203/227116839-efcae0ad-430a-4ca4-8fd1-630734da8ce6.png) > > > > > > `model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4-qe", trust_remote_code=True)` `model.save_pretrained(“./multi_gpus”,max_shard_size='2GB')` 先用python运行上面两行代码，在运行webui就行了，模型路径填 _**“./multi_gpus”**_ > > 这样确实可以跑起来，但是有出现了新问题确实是4张卡 > > 错误信息 > > 代码 > >...

[Feature] 建了个分支，支持多GPU部署，自动平均分配显存。

> 我也遇到了同样的报错： Expected all tensors to be on the same device, but found at least two devices... > > 使用模数和仓库里的代码都不可以正常运行。模型是从 https://cloud.tsinghua.edu.cn/d/fb9f16d6dc8f482596c2/ 这里下载的。把整个错误栈贴上来

[Feature] 建了个分支，支持多GPU部署，自动平均分配显存。

> > 我也遇到了同样的报错： Expected all tensors to be on the same device, but found at least two devices... > > 使用模数和仓库里的代码都不可以正常运行。模型是从 https://cloud.tsinghua.edu.cn/d/fb9f16d6dc8f482596c2/ 这里下载的。 > > 试一下Cli的demo是否能正常运行。我windows都是正常的，切到wsl ubuntu就和他们一样报错我hook了torch.embedding 发现...