magicpose comments

Results 3 comments of


                                            magicpose

回答的有点太慢了

collection_router.py ``` def invoke(self, query: str, **kwargs) -> Tuple[List[str], int]: consume_tokens = 0 collection_infos = self.vector_db.list_collections() vector_db_search_prompt = COLLECTION_ROUTE_PROMPT.format( question=query, collection_info=[ { "collection_name": collection_info.collection_name, "collection_description": collection_info.description, } for collection_info in...

这一套prompt是不是对中文的知识库不太友好？我感觉中间的判断语音相似性的模型输出不了东西。最终的summary结果也非常简陋。

@echodyn @Howe829 文档分块有什么更好的方案吗，替换这部分就行了吧。尝试让ds写了一套方案未验证：以下是一个兼容中英文的智能分块完整实现方案，结合了LLM实时决策、语义分块和混合回退策略 ``` import re import requests from typing import List, Dict, Optional from pydantic import BaseModel from langdetect import detect from tenacity import retry, stop_after_attempt, wait_exponential...

Different llm with diiferent errors

``` config.set_provider_config("embedding", "OpenAIEmbedding", {"model": "bge-m3", "base_url": "http://192.168.23.10/v1","dimension": 1024}) ``` 最好再确认一下你的embedding模型支持的维度，我也遇到这个情况，最后发现设定的Dimension如果超了模型支持的Dimension值，就会出现这个错误，并且embedding返回的结果维度翻倍。我的设置768就好了