fryng
fryng
### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4-qe", trust_remote_code=True).float() 采用int4量化模型出现以下错误:AttributeError: 'NoneType' object has no attribute 'int4WeightExtractionFloat'...
多人访问一段时间后,服务器本机无法访问网络,但是其他人还是可以正常访问服务器
autos 脚本的send函数并没有增加history的功能,如何添加history
建议增加fastllm增加chatglm推理速度
GPU: NVIDIA GeForce RTX 2080 Ti. Max memory: 22.0 GB. Platform: Windows. Torch: 2.6.0+cu124. CUDA: 7.5. CUDA Toolkit: 12.4. Triton: 3.1.0 Bfloat16 = FALSE. FA [Xformers = 0.0.29.post3. FA2 =...