2257396011 issues

Results 8 issues of

2257396011

如何获得每个字的坐标

在使用ocr的时候我看获取到的坐标一般都是一句话的坐标位置，怎么能够改为ocr按字来识别，这样就能够在json文件中得到整个pdf中全部字的坐标了。

enhancement

### Description of the bug | 错误描述使用表格识别功能后报错： Traceback (most recent call last): File "D:\wzh\MinerU-master\demo\magic_pdf_parse_main.py", line 136, in pdf_parse_main(pdf_path) │ └ 'D:/wzh/1.pdf' └ > File "D:\wzh\MinerU-master\demo\magic_pdf_parse_main.py", line 121, in pdf_parse_main...

bug

使用ocr方式提取文字的代码位置

我在magic_model.py中找到了提取ocr文字的代码，但是我看pdf_parse_union_core.py中只用了get_all_spans来获取ocr提取的文字，然后用txt方式的话会替换一下，ocr方式的话直接用这个的返回值不需要替换。但是一直没有用到get_ocr_text这个函数，所以想问一下使用ocr提取的代码是哪个。 ![屏幕截图 2024-08-07 105453](https://github.com/user-attachments/assets/eca33c72-3e0f-4e29-a40b-faaef6904168)

enhancement

公式检测时间cpu才3秒，换了gpu一下30秒了，其他的话都快了很多。然后运行一会之后会报错RuntimeError: CUDA error: unspecified launch failure

### Description of the bug | 错误描述 [08/09 08:35:21 d2.checkpoint.detection_checkpoint]: [DetectionCheckpointer] Loading from /home/founder/MinerU-master/MinerU-master/PDF-Extract-Kit/Layout/model_final.pth ... [08/09 08:35:21 fvcore.common.checkpoint]: [Checkpointer] Loading from /home/founder/MinerU-master/MinerU-master/PDF-Extract-Kit/Layout/model_final.pth ... 2024-08-09 08:35:23.885 | INFO | magic_pdf.model.pdf_extract_kit:__init__:132 -...

bug

ocr加速包下载的时候nvidia-nccl-cu11会与原库有冲突

### Description of the bug | 错误描述 (MinerU) founder@founder:~/MinerU/MinerU-master$ python -m pip install paddlepaddle-gpu==3.0.0b1 -i https://www.paddlepaddle.org.cn/packages/stable/cu118/ Looking in indexes: https://www.paddlepaddle.org.cn/packages/stable/cu118/ Collecting paddlepaddle-gpu==3.0.0b1 Downloading https://paddle-whl.bj.bcebos.com/stable/cu118/paddlepaddle-gpu/paddlepaddle_gpu-3.0.0b1-cp310-cp310-linux_x86_64.whl (845.8 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 845.8/845.8 MB 1.5...

bug

2257396011

如何获得每个字的坐标

表格识别报错

使用ocr方式提取文字的代码位置

公式检测时间cpu才3秒，换了gpu一下30秒了，其他的话都快了很多。然后运行一会之后会报错RuntimeError: CUDA error: unspecified launch failure

ocr加速包下载的时候nvidia-nccl-cu11会与原库有冲突

将模型加载和解析的内容分开

关于model.json文件

可不可以直接使用代码运行