Embedding similarity scores stuck around 0.541 when using BGE-Code-V1 as retriever in RAG

Open OpEnD17 opened this issue 3 months ago • 0 comments

When I use BGE-Code-V1 (Qwen2.5-Coder-1.5B based) as retriever in my RAG pipeline, I find that query–chunk similarity scores are always around ~0.541, regardless of the query and document content.

Task: RAG retrieval (query: incomplete code; chunk: code blocks(almost function-level)) Similarity function: cosine similarity

Sep 03 '25 03:09 OpEnD17