HanWenbo comments

Results 7 comments of


                                            HanWenbo

请问：72B模型量化后，和7B/14B模型对比，效果怎么样呢？

同问

请问：72B模型量化后，和7B/14B模型对比，效果怎么样呢？

感谢您的回复。进行 gptq 量化需要加载校准数据集，不同的校准数据对量化结果影响大吗？Qwen 使用的校准数据集未来是否有开源的打算？

No rcv1.tar.xz file

I have the same problem, who can tell me how to do it? Thank you very much

Too many requests in 1 hour. Try again later

我是在对话框里报错，你这个咋在外边报错了？

```python model = AutoModel.from_pretrained(pretrained, torch_dtype=torch.float16, trust_remote_code=True).cuda() print('模型加载完毕') print('开始量化') model.quantize(4) print('结束量化') # 模型保存路径 file_path="your path" model.save_pretrained(file_path) ```

export 模型之后，返回的内容都是unk

你的显卡型号有些不支持导致的，我目前unk出现这个问题

When using the Chat Assistant, the model uses openai, which works properly, and the conversation message is empty

> > 我通过设置流模式返回结果的分隔符为* /n，去掉一个/n解决了这个问题 > > 请问是在什么地方设置的啊？修改了源代码吗？ ![image](https://github.com/user-attachments/assets/885e96b3-4ac3-49eb-b216-9e399d535751)

HanWenbo

请问：72B模型量化后，和7B/14B模型对比，效果怎么样呢？

请问：72B模型量化后，和7B/14B模型对比，效果怎么样呢？

No rcv1.tar.xz file

Too many requests in 1 hour. Try again later

[Help] 请问如何将自己训练的模型导出成 int4版本

export 模型之后，返回的内容都是unk

When using the Chat Assistant, the model uses openai, which works properly, and the conversation message is empty