zydmtaichi comments

Results 4 comments of


                                            zydmtaichi

[BUG] When using the chatglm model and training with deepspeed, I encountered an error in compiling cpu_adam.

> @avivbrokman, can you try adding `torch_adam: true` into the optimizer section of your ds_config? As described [here](https://www.deepspeed.ai/docs/config-json/#optimizer-parameters), this will enable `torch.optim.Adam` instead of DeepSpeed's cpu_adam, and should avoid the...

[BUG/Help] OVERFLOW导致loss异常

> 如果是v100 其并不支持bf16 你好，可是我用V100，在lora模式下微调了zero3-offload的glm4，没有发现报错，这是什么情况？

Rerankers and Embeddings

> I'd like to add a vote to @BradKML's suggestion on a re-ranker feature in the Ollama framework. This will help keep any RAG pipeline within Ollama end-to-end. +1

RuntimeError: The size of tensor a (4096) must match the size of tensor b (500) at non-singleton dimension 2

> This is because there are only 500 learned positional encodings and if you try to infer an image much higher than the default model resolution, then the number of...