Document_Layout_Analysis-MonkAI
Document_Layout_Analysis-MonkAI copied to clipboard

→

Metadata

DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

Readme
Issues

trafficstars

Document Layout Detection using MonkAI Object Detection Library

Deep learning models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

Choice of architecture

-Inspiration from the blog- https://medium.com/@Intellica.AI/a-comparative-study-of-custom-object-detection-algorithms-9e7ddf6e765e

Yolov3, FasterRCNN & SSD are broadly top 3 model architectures that are used for Object detection. So, for this task, prediction and confidence on inference images of these 3 architectures have been compared.

Tutorial Blog

https://medium.com/@swapnil.ahlawat/object-detection-document-layout-analysis-using-monk-object-detection-toolkit-6c57200bde5

About

DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

deep-learning

object-detection

faster-rcnn

yolov3

document-analysis

ssd512

26

Stars

6

Forks

Watchers

Owner

swapnil-ahlawat

← Metadata

26

Stars

6

Forks

Watchers

Owner

swapnil-ahlawat

Metadata

DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.

Back

Document_Layout_Analysis-MonkAI Document_Layout_Analysis-MonkAI copied to clipboard

Metadata

Document Layout Detection using MonkAI Object Detection Library

Choice of architecture

Tutorial Blog

← Metadata

Owner

Metadata

Document_Layout_Analysis-MonkAI
Document_Layout_Analysis-MonkAI copied to clipboard