hocr topic
                        List
                        hocr repositories
                    
                gImageReader
                            
                                1.5k
                            
                            
                        
                        Stars
                    
                            
                                184
                            
                            
                        
                        Forks
                    Watchers
                    A Gtk/Qt front-end to tesseract-ocr.
PdfPig
                            
                                1.5k
                            
                            
                        
                        Stars
                    
                            
                                220
                            
                            
                        
                        Forks
                    Watchers
                    Read and extract text and other content from PDFs in C# (port of PDFBox)
DocumentLayoutAnalysis
                            
                                530
                            
                            
                        
                        Stars
                    
                            
                                59
                            
                            
                        
                        Forks
                    Watchers
                    Document Layout Analysis resources repos for development with PdfPig.
ocr-fileformat
                            
                                175
                            
                            
                        
                        Stars
                    
                            
                                23
                            
                            
                        
                        Forks
                    Watchers
                    Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
kraken
                            
                                814
                            
                            
                        
                        Stars
                    
                            
                                142
                            
                            
                        
                        Forks
                    Watchers
                    OCR engine for all the languages
hOCR-to-ALTO
                            
                                51
                            
                            
                        
                        Stars
                    
                            
                                14
                            
                            
                        
                        Forks
                    Watchers
                    Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
mirador-textoverlay
                            
                                50
                            
                            
                        
                        Stars
                    
                            
                                13
                            
                            
                        
                        Forks
                    Watchers
                    Text Overlay plugin for Mirador 3
ocr-conversion
                            
                                71
                            
                            
                        
                        Stars
                    
                            
                                3
                            
                            
                        
                        Forks
                    Watchers
                    Conversions between various OCR formats
ocr-gt-tools
                            
                                47
                            
                            
                        
                        Stars
                    
                            
                                11
                            
                            
                        
                        Forks
                    Watchers
                    Ergonomic line-by-line transcription of scanned text.