document-extraction topic
                        List
                        document-extraction repositories
                    
                Image-document-extract-and-correction
                            
                                69
                            
                            
                        
                        Stars
                    
                            
                                18
                            
                            
                        
                        Forks
                    Watchers
                    数字图像课程大作业,实现图片中文档提取与矫正。整体思路是通过hough变换检测出直线,进而得到角点,最后经过投影变换,进行矫正。整个项目只用到了opencv的IO操作(包括手写卷积,hough哈夫变换,投影变换等等)
ingest-file
                            
                                52
                            
                            
                        
                        Stars
                    
                            
                                26
                            
                            
                        
                        Forks
                    Watchers
                    Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
konfuzio-sdk
                            
                                58
                            
                            
                        
                        Stars
                    
                            
                                10
                            
                            
                        
                        Forks
                    Watchers
                    OCR, extract and classify documents. In addition, annotate documents and build your own NLP and Computer Vision models using Python by downloading the data. Find examples in our Colab Notebooks, e. g....
pydoxtools
                            
                                59
                            
                            
                        
                        Stars
                    
                            
                                10
                            
                            
                        
                        Forks
                    Watchers
                    Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pipelines and diverse sources for your projects.