Document_Layout_Analysis-MonkAI
Document_Layout_Analysis-MonkAI copied to clipboard
DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.
Document Layout Detection using MonkAI Object Detection Library
Deep learning models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.
Choice of architecture
-Inspiration from the blog- https://medium.com/@Intellica.AI/a-comparative-study-of-custom-object-detection-algorithms-9e7ddf6e765e
Yolov3, FasterRCNN & SSD are broadly top 3 model architectures that are used for Object detection. So, for this task, prediction and confidence on inference images of these 3 architectures have been compared.
Tutorial Blog
https://medium.com/@swapnil.ahlawat/object-detection-document-layout-analysis-using-monk-object-detection-toolkit-6c57200bde5