AdvancedLiterateMachinery issues

DocXChain OCR 识别较慢，CPU 占用率过高

1

DocXChain OCR 识别较慢，CPU 占用率过高，但是 GPU 占用率确不高，请问是什么原因导致的？

lore-tsr训练时输出的这些参数都代表什么意义，哪位大佬可以帮忙解释下啊

8

lore-tsr训练时输出的这些参数都代表什么意义，哪位大佬可以帮忙解释下啊 ![lore-tsr](https://github.com/user-attachments/assets/c65fa9c8-639e-4431-a839-1fd167384b4d)

happybuby

采用推荐脚本训练，在3万步左右会出现Loss is nan, stopping training，又遇到的嘛？

3

Loss is nan, stopping training {'pt_loss': tensor(nan, device='cuda:0', grad_fn=), 'poly_loss': tensor(nan, device='cuda:0', grad_fn=), 'rec_loss': tensor(3.2648, device='cuda:0', grad_fn=)}

JinJiTongXue

有DocXLayout_231012.pth模型的测试结果吗？

Note-Liu

TextDetection每页开销1~4秒？

1

请问下，cv_resnet18_ocr-detection-line-level_damo，这么小的模型为啥开销这么大，是有什么config参数没改吗？除了ocr-detection, 其他模块模型开销都在1s以内我的环境是：cuda11.8, cudnn8.6.0, tensorflow2.11.0

SidneyRey

TypeError in create_grid_input.py: Mismatch in save_pkl_file arguments

1

# TypeError in create_grid_input.py: Mismatch in save_pkl_file arguments ## Issue Summary A TypeError is occurring in the `create_grid_input.py` script due to a mismatch between the number of arguments in the...

LUCIFERX92

Update create_grid_input.py

1

# TypeError in create_grid_input.py ## Description of the Issue When running the script, the following error occurs: ``` File "/Users/i_manav.gupta/VGT_V4/VGT/object_detection/create_grid_input.py", line 215, in save_pkl_file(grid, args.output, f"page_{page}", page, args.model) TypeError: save_pkl_file()...

LUCIFERX92

Access Denied Error 409 When Running VGT Inference Due to Failure in Downloading Model Weights

2

Running the VGT inference script fails because the model file at https://layoutlm.blob.core.windows.net/dit/dit-pts/dit-base-224-p16-500k-62d53a.pth cannot be publicly accessed due to an HTTP 409 error.

omidsrezai

Training LevOCR

Does anyone train LevOCR with long sentences ? NAR or Iterative Transformer is quite bad when they recognize long sequences, I want to know LevOCR can recognize long sentences like...

Holmes2002

Training hardware setting for Omniparser

2

Please let me know the training hardware setting(GPU) for Omniparser How long does it take to pretrain the model?

jypark92

AdvancedLiterateMachinery
AdvancedLiterateMachinery copied to clipboard

Metadata

DocXChain OCR 识别较慢，CPU 占用率过高

lore-tsr训练时输出的这些参数都代表什么意义，哪位大佬可以帮忙解释下啊

采用推荐脚本训练，在3万步左右会出现Loss is nan, stopping training，又遇到的嘛？

有DocXLayout_231012.pth模型的测试结果吗？

TextDetection每页开销1~4秒？

TypeError in create_grid_input.py: Mismatch in save_pkl_file arguments

Update create_grid_input.py

Access Denied Error 409 When Running VGT Inference Due to Failure in Downloading Model Weights

Training LevOCR

Training hardware setting for Omniparser

← Metadata

Owner

Metadata

AdvancedLiterateMachinery AdvancedLiterateMachinery copied to clipboard

Metadata

← Metadata

Owner

Metadata

AdvancedLiterateMachinery
AdvancedLiterateMachinery copied to clipboard