Zhengxiao Du comments

Results 163 comments of


                                            Zhengxiao Du

[BUG/Help] <title>好多常识回答不对

欢迎针对 ChatGLM-6B 的 badcase 提交反馈：https://github.com/THUDM/ChatGLM-6B/tree/main/improve

[BUG/Help] <title> 8 or 128 ?

Thanks for pointing out this. Already fixed.

[BUG/Help] <我不得不说> 在配置不错的情况下，运行也慢，官方 demo respone 也是空的

应该是计算中出现了NaN。你的 CUDA 版本是多少？

[BUG/Help]开batch预测时，模型结果不一致。

有可能是显卡计算的精度误差导致的。只要生成结果都合理即可吧

RuntimeError: DefaultCPUAllocator: not enough memory: you tried to allocate 100663296 bytes.[BUG/Help] <title>

内存不够了，100663296 bytes只有95MB。可以检查一下计算机的空余内存，是否有别的程序在占用大量内存

添加了流式解码器，做到更好的控制台体验和更高的解码效率

很有价值的实现，我需要检验一下流式解码和原 tokenizer 是否是等价的。可能存在的问题是现在 `stream_generate` 会对模型的输出做一些后处理，https://huggingface.co/THUDM/chatglm-6b/blob/main/modeling_chatglm.py#L1251

[BUG/Help] <title> RuntimeError: CUBLAS error: CUBLAS_STATUS_NOT_INITIALIZED

The error is usually caused by running out of GPU memorg https://discuss.pytorch.org/t/cuda-error-cublas-status-not-initialized-when-calling-cublascreate-handle/125450

更新了模型后，报这样的错。

这是显存爆了，跟模型没关系吧

[BUG/Help] Windows环境下运行本地int4模型报错

编译并行 kernel 还需要 `openmp`，如果失败的话会 fallback 到非并行 kernel，你可以看一下后面还有什么报错

[Feature] <title>AttributeError: 'Seq2SeqTrainer' object has no attribute 'is_deepspeed_enabled'

请使用仓库中最新的代码