magicpose
magicpose
collection_router.py ``` def invoke(self, query: str, **kwargs) -> Tuple[List[str], int]: consume_tokens = 0 collection_infos = self.vector_db.list_collections() vector_db_search_prompt = COLLECTION_ROUTE_PROMPT.format( question=query, collection_info=[ { "collection_name": collection_info.collection_name, "collection_description": collection_info.description, } for collection_info in...
@echodyn @Howe829 文档分块有什么更好的方案吗,替换这部分就行了吧。 尝试让ds写了一套方案未验证: 以下是一个兼容中英文的智能分块完整实现方案,结合了LLM实时决策、语义分块和混合回退策略 ``` import re import requests from typing import List, Dict, Optional from pydantic import BaseModel from langdetect import detect from tenacity import retry, stop_after_attempt, wait_exponential...
``` config.set_provider_config("embedding", "OpenAIEmbedding", {"model": "bge-m3", "base_url": "http://192.168.23.10/v1","dimension": 1024}) ``` 最好再确认一下你的embedding模型支持的维度,我也遇到这个情况,最后发现设定的Dimension如果超了模型支持的Dimension值,就会出现这个错误,并且embedding返回的结果维度翻倍。我的设置768就好了