chatglm.cpp icon indicating copy to clipboard operation
chatglm.cpp copied to clipboard

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

Results 148 chatglm.cpp issues
Sort by recently updated
recently updated
newest added

chatglm3-6b -t q4_0 -o chatglm3-ggml.bin ![1](https://github.com/li-plus/chatglm.cpp/assets/162705053/b0b680aa-8ab6-41ea-bbc6-a7e0779c6108)

(base) root@ai:~/chatglm.cpp/examples# python3 web_demo.py -m ../chatglm-ggml.bin Traceback (most recent call last): File "/root/chatglm.cpp/examples/web_demo.py", line 24, in pipeline = chatglm_cpp.Pipeline(args.model) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/root/anaconda3/lib/python3.11/site-packages/chatglm_cpp/__init__.py", line 33, in __init__ super().__init__(str(model_path)) RuntimeError: /tmp/pip-install-g7k57662/chatglm-cpp_f0d0b14b283d4034978c4aac7fe3f802/chatglm.cpp:323 check...

支持minicpm-2b模型

能分享一下Windows10 平台编译好的文件吗,我都电脑不知道啥情况,编译过程中总是蓝屏

![image](https://github.com/li-plus/chatglm.cpp/assets/47915665/b908a6d7-ee40-431e-9db9-7a0df5fa32e3)

![image](https://github.com/li-plus/chatglm.cpp/assets/47915665/6a7d1867-29d3-4904-9da5-bf474e5dea24) 出错了 Illegal instruction

In file included from /Users/a1021500667/Documents/gpt/chatglm.cpp/chatglm.cpp:1: /Users/a1021500667/Documents/gpt/chatglm.cpp/chatglm.h:546:13: error: no matching function for call to 'ggml_scale_inplace' ggml_scale_inplace(gctx, attn_scores, ggml_new_f32(gctx, 1.f / std::sqrt(head_size)))); ^~~~~~~~~~~~~~~~~~ /Users/a1021500667/Documents/gpt/chatglm.cpp/third_party/ggml/include/ggml/ggml.h:1158:35: note: candidate function not viable: no known conversion...

73.13 [ 97%] Built target chatglm 73.18 [ 98%] Building CXX object CMakeFiles/main.dir/main.cpp.o 76.29 [100%] Linking CXX executable bin/main 76.50 /usr/bin/ld: lib/libchatglm.a(chatglm.cpp.o): in function `chatglm::tensor_assign_buffers(ggml_tensor*)': 87.70 /chatglm.cpp/chatglm.cpp:113: undefined reference to...

一张显卡放不下单个模型的情况下,支持多卡推理吗?

我分别量化了chatglm3和chatglm3-32k两个模型,请问如何设置他们的context大小. 我看到很多文件都需要修改,请问能否出一个文档来说明一下