layout-parser icon indicating copy to clipboard operation
layout-parser copied to clipboard

A Unified Toolkit for Deep Learning Based Document Image Analysis

Results 118 layout-parser issues
Sort by recently updated
recently updated
newest added

**Describe the bug** Model instantiation throwing 500 error **To Reproduce** Steps to reproduce the behavior: 1. What command or script did you run? ```python model = lp.Detectron2LayoutModel( config_path ='lp://PrimaLayout/mask_rcnn_R_50_FPN_3x/config', #...

bug

Hi, I'm using layout parser to perform OCR on a research paper, but on almost every page of the pdf the text boxes are not properly aligned. For example I...

enhancement

**Describe the bug** The bounding boxes returned by, for example, `ocr_agent.gather_data(res, agg_level=lp.TesseractFeatureType.BLOCK)` don't reflect the block size in the initial data. Looking at the code, I think by removing elements...

bug

When I try to download a layout model, at some point of the download (usually around 20 %), the download speed goes from 5 mb/s to close to zero. Then...

bug

import cv2 try: from PIL import Image except ImportError: import Image import pytesseract import layoutparser as lp ocr_agent = lp.TesseractAgent() pytesseract.pytesseract.tesseract_cmd = "Tesseract-OCR/tesseract.exe" path = "images/a.jpg" img = cv2.imread(path) #...

bug

**Motivation** Weights & Biases is the machine learning platform for developers to build better models faster. Use W&B's lightweight, interoperable tools to quickly track experiments, version and iterate on datasets,...

Thanks for building this library. Used this code to detect if an image contains graphs and charts. ```python layout = 'lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config' model = lp.Detectron2LayoutModel(layout, extra_config=["MODEL.ROI_HEADS.SCORE_THRESH_TEST", 0.8], label_map={0: "Text", 1: "Title",...

**Describe the bug** In the example given [here](https://github.com/Layout-Parser/layout-parser/blob/master/docs/example/parse_ocr/index.rst) the function group_blocks_by_distance doesn't sort within row in the x direction. I came up with a simple fix to this as implemented...

bug

There are no steps to understand how to train a model on a custom dataset. I'm really looking to use this in my current workflow and kudos for developing this!

enhancement

I am working with a large set of historical tables and need to extract the rows/columns in them. I ran various layout models from the Model Zoo, but the only...

bug