zongshuai zhang
zongshuai zhang
标题消失问题
### Description of the bug | 错误描述 解析完之后的标题为空,只有序号 [轮胎设计_制造技术和法规进展及民族轮胎企业技术创新战略_危银涛.pdf](https://github.com/user-attachments/files/16654753/_._.pdf) [我国子午线轮胎技术概况_王锋.pdf](https://github.com/user-attachments/files/16654748/_.pdf) ### How to reproduce the bug | 如何复现 magic-pdf -p test.pdf ### Operating system | 操作系统 Linux ### Python version...
### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to submit this...
### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have searched for existing...
### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have searched for existing...
using gradio to generate (masterpiece, best quality, highres:1),(1girl, solo:1),(eye blinks:1.8),(head wave:1.3) https://github.com/user-attachments/assets/cdf09d4c-5848-4b06-a6b0-caab67e34757
使用AED模型,长音频按照每1分钟切分进行识别,会概率出现重复的情况, 例如: “但是你是我一看以后,它应该是需要呃一个时间段也是这样考虑。但是考核。对他这个整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的整体的。” 包括大量的“嗯嗯嗯嗯”,“啊啊啊啊啊”,“哈哈哈哈哈哈哈”重复的情况。 模型推理参数: "use_gpu": 1, "beam_size": 1, "nbest": 1, "decode_max_len": 0, "softmax_smoothing": 1.25, "aed_length_penalty": 0.6, "eos_penalty": 1.0