Patrick6663
Results
1
comments of
Patrick6663
+1,chunk_text方法直接str(text)了 ` def chunk_text(self, text) -> Tuple[List[str], Dict[str, str]]: if self.dataset_name in self.datasets_no_chunk: chunks = [f"{text.get('title', '')} {text.get('text', '')}".strip() if isinstance(text, dict) else str(text)] else: chunks = [str(text)]` 个人感觉baseconfig里设定的chunksize和overlap没有用到,需要在else这里做分块处理