ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Question]: how to serve bge-reranker-v2-m3 and bge-large-zh-v1.5 in RAGFlow?

Open leachli opened this issue 10 months ago • 1 comments

Describe your problem

我目前使用了 Dify + ragflow知识库,ragflow中带了bge-reranker-v2-m3 和 bge-large-zh-v1.5 模型 为了模型充分利用,我想让这两个模型能够对外暴露服务,使模型能够在dify中使用,请问下这里该如何去做能达到效果? @KevinHuSh 请教下

leachli avatar Mar 11 '25 01:03 leachli

It's not supported yet. You could utilize Ollama/XInference... to server embedding model.

KevinHuSh avatar Mar 11 '25 03:03 KevinHuSh

Is there a switch in ragflow to turn off the rerank model, because I built the model and wanted to configure it to plug in, Avoid duplicate boot model

---- 回复的原邮件 ---- | 发件人 | Kevin @.> | | 日期 | 2025年03月11日 12:00 | | 收件人 | @.> | | 抄送至 | @.>@.> | | 主题 | Re: [infiniflow/ragflow] [Question]: 正式版中:bge-reranker-v2-m3 和 bge-large-zh-v1.5 模型 如何可以对外提供模型服务? (Issue #5871) |

It's not supported yet.

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

KevinHuSh left a comment (infiniflow/ragflow#5871)

It's not supported yet.

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

leachli avatar Mar 21 '25 08:03 leachli

You could depoy a slim version of docker image which has no build-in embedding and rerank models.

KevinHuSh avatar Mar 24 '25 05:03 KevinHuSh