studyinglover
studyinglover
我分别量化了chatglm3和chatglm3-32k两个模型,请问如何设置他们的context大小. 我看到很多文件都需要修改,请问能否出一个文档来说明一下
I am a freshman about GGML, when I read the source code, I notice that the function `struct ggml_tensor * ggml_conv_2d` https://github.com/ggerganov/ggml/blob/10e83a412717c20d57ba19f025248e18e43addf3/src/ggml.c#L6889 from the comment, wo can know the input...
### Describe the Feature I have trained a PyTorch model, and I want to directly load this model and run inference in a browser. However, the current method for loading...