dosubot[bot]

Results 14330 comments of


                                            dosubot[bot]

`z.toJSONSchema` with `z.record(z.enum([...]), ...)` resulting in optional keys

Hi @StageDog! I'm [Dosu](https://go.dosu.dev/dosubot) and I’m helping the zod team. This is a known gap in Zod 4's JSON Schema generation: while `z.record(z.enum([...]), ...)` requires all enum keys at runtime...

`z.toJSONSchema` with `z.record(z.enum([...]), ...)` resulting in optional keys

You're correct—the omission of a `required` array for enum keys in the JSON Schema output of `z.record(z.enum([...]), ...)` is still present and was not addressed by the fix for #2623....

"requestBody" in custom tool's schema is not working, but shows errors

@a-taka Hello there! I'm here to help you with bug reports, answer your questions, and assist you while waiting for a human maintainer. Let's work together to solve the issue...

目前是否支持namespace类型从私有调整为公共类型？

目前 Apollo 并不支持将 namespace 类型从私有调整为公共类型。无论是官方文档还是代码实现，都没有提供将已创建的私有 namespace 直接转换为公共类型的功能或接口。创建 namespace 时类型已确定，后续无法修改。如果有需要，只能新建一个公共类型的 namespace，然后迁移配置数据。[参考](https://github.com/apolloconfig/apollo/blob/master/docs/zh/design/apollo-core-concept-namespace.md) *To reply, just mention [@dosu](https://go.dosu.dev/dosubot).* --- How did I do? [Good](https://app.dosu.dev/response-feedback/fc27174a-3826-436c-94f5-54df3c04073a?feedback_type=great_response) | [Irrelevant](https://app.dosu.dev/response-feedback/fc27174a-3826-436c-94f5-54df3c04073a?feedback_type=irrelevant_answer) | [Incorrect](https://app.dosu.dev/response-feedback/fc27174a-3826-436c-94f5-54df3c04073a?feedback_type=incorrect_sources) | [Verbose](https://app.dosu.dev/response-feedback/fc27174a-3826-436c-94f5-54df3c04073a?feedback_type=too_verbose) | [Hallucination](https://app.dosu.dev/response-feedback/fc27174a-3826-436c-94f5-54df3c04073a?feedback_type=hallucination) |...

mineru对论文pdf的图像解析问题

Hi @Jarser77! I'm [Dosu](https://go.dosu.dev/dosubot) and I’m helping the MinerU team. 你反馈的“vlm-sglang-engine”在论文PDF图像和表格解析不准确，属于MinerU 2.0.x的已知问题，尤其是表格空白单元格丢失、复杂布局识别不全等。这些问题主要源于底层模型和解析逻辑的局限，2.1.1版本已修复大部分内容丢失、表格/图片匹配不准等问题，建议优先升级到2.1.1及以上版本以获得最佳效果（包括更好的内容还原和表格/图片识别）[参考1](https://github.com/opendatalab/MinerU/issues/3208)。如果暂时无法升级，可以尝试以下部分缓解措施，但无法彻底解决模型本身的识别盲区： - 启用“强制OCR”模式（force OCR），对复杂排版或中文PDF有一定帮助[参考2](https://github.com/opendatalab/MinerU/issues/3025)。 - 检查本地模型权重和环境（如PyTorch/CUDA版本）与官方Demo是否一致，避免因环境差异导致识别效果下降[参考3](https://github.com/opendatalab/MinerU/discussions/2959)。 - 调整OCR参数（如det_db_box_thresh、det_db_unclip_ratio）或切换表结构模型（如wired_table_v2、slanetplus），部分场景下能提升表格识别[参考4](https://github.com/opendatalab/MinerU/issues/3025)。 - 切换到“pipeline”后端，部分表格模型支持更细致的参数调优，但复杂表格依然有局限[参考5](https://github.com/opendatalab/MinerU/issues/3208)。注意：MinerU不会对PDF图片区域做OCR提取文字，只会直接裁剪图片区域。如需图片内文字，请用外部OCR工具处理[参考6](https://github.com/opendatalab/MinerU/issues/1111#issuecomment-2502992512)。如果需要进一步定位问题，建议用--log-level debug输出详细日志，或补充具体的模型路径、配置参数等信息。升级到2.1.1后大部分类似问题已解决[参考7](https://github.com/opendatalab/MinerU/issues/3113)。 *To reply, just mention [@dosu](https://go.dosu.dev/dosubot).*...

mineru对论文pdf的图像解析问题

你反馈的图像和表格解析问题在MinerU 2.1.3下依然存在，目前社区和文档中没有发现完全解决此类科学论文PDF复杂布局的说明，部分问题属于模型本身的局限，尤其是表格空白单元格、复杂图文混排等场景[相关说明](https://github.com/opendatalab/MinerU/issues/3208#issuecomment-3134779903)。 vlm-sglang-engine后端主要支持采样参数（如temperature、top_p等），不提供专门针对图像/表格识别的调优项[代码参考](https://github.com/opendatalab/MinerU/blob/93a3bc776b9280a831aec4ed7f56e034bc1c2c6b/mineru/backend/vlm/sglang_engine_predictor.py)。pipeline后端可以通过--table/--formula参数控制表格和公式识别，并支持切换表结构模型（如slanetplus）、调整OCR参数（det_db_box_thresh、det_db_unclip_ratio）和批处理大小（MINERU_MIN_BATCH_INFERENCE_SIZE环境变量）等[代码参考](https://github.com/opendatalab/MinerU/blob/93a3bc776b9280a831aec4ed7f56e034bc1c2c6b/mineru/backend/pipeline/model_init.py)。建议你： 1. 确认本地模型权重和配置与官方Demo一致，必要时重新下载模型文件[相关讨论](https://github.com/opendatalab/MinerU/discussions/2959)。 2. pipeline后端可尝试--table-model slanetplus、调整OCR参数（如--det_db_box_thresh 0.3 --det_db_unclip_ratio 1.8），部分表格识别会有提升。 3. 如问题依旧，建议用--log-level debug输出详细日志，并补充mineru --version、模型路径、命令行参数等，便于进一步定位。部分复杂表格和图像布局目前仍有识别盲区，属于已知未完全解决的模型/算法限制。如需进一步分析，可上传debug日志和配置详情。 *To reply, just mention [@dosu](https://go.dosu.dev/dosubot).* --- How did I do? [Good](https://app.dosu.dev/response-feedback/bc987011-f9ef-41ed-a6c1-c6ad8bf69d9b?feedback_type=great_response) | [Irrelevant](https://app.dosu.dev/response-feedback/bc987011-f9ef-41ed-a6c1-c6ad8bf69d9b?feedback_type=irrelevant_answer)...

Strikethrough formatting is actively stripped during PDF-to-markdown conversion

Hi @ozp! I'm [Dosu](https://go.dosu.dev/dosubot) and I’m helping the MinerU team. You are correct: MinerU currently escapes all tildes (`~`) globally during PDF-to-markdown conversion, which destroys valid strikethrough formatting (`~~text~~`) and...

[Question]: I want to select the knowledge base by passing variables. How can I pass multiple knowledge base IDs in one variable?

You can pass multiple knowledge base IDs in one variable by using a list (array) of IDs, not a comma-separated string. RAGFlow's backend logic supports this: if you set your...

[Question]: I want to select the knowledge base by passing variables. How can I pass multiple knowledge base IDs in one variable?

你可以直接在代理或工作流的变量配置中，把变量（比如 dataset_id）的 value 设置为一个原生 JSON 数组，例如： ```json { "variables": { "dataset_id": { "type": "array", "value": ["kb_id1", "kb_id2"] } } } ``` 在 Retrieval 组件（知识库检索工具）的参数里，引用这个变量即可： ```json { "component_name": "Retrieval", "params": {...

[Question]: I want to select the knowledge base by passing variables. How can I pass multiple knowledge base IDs in one variable?

你可以直接在 agent 的 API 调用里，把 `inputs.dataset_id.value` 设置为一个 JSON 数组（即原生列表），这样就能一次传递多个知识库 ID，不需要用逗号分隔字符串，也不用管 `type` 字段是不是 `line`。例如： ```json { "question": "文本知识库管理为管理非问答对的文档包括什么类型", "stream": false, "inputs": { "dataset_id": { "type": "line", // 这里 type 填什么都行，不影响 "value":...

‹
1
2
...
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
›