kraken icon indicating copy to clipboard operation
kraken copied to clipboard

OCR engine for all the languages

Results 75 kraken issues
Sort by recently updated
recently updated
newest added

This is an implementation for unsupervised pretraining of recognition model weights based on an image inpainting surrogate task that aims to reconstruct randomly sampled masked patches from the initial convolutional...

kraken already shows a good message for the segmentation model, but the corresponding message for the recognition model just says 'default': Loading ANN /home/stweil/src/github/mittagessen/kraken/kraken/blla.mlmodel ✓ Loading ANN default ✓ The...

This problem occurs with 11 out of a set of ~646 PNGs, all of which plopped out of the exact same processing pipeline, scanned on exactly the same hardware. Both...

Hi @mittagessen Just pinpointing a little oddity there : Running the following on 4.1.2 and torch 1.11: (Note that `ketos_sample` actually contains the `Italian` subdir from the [BiblIA dataset](https://zenodo.org/record/5167263)) ```python...

Hello @mittagessen, The training docs mention being able to slice off the final layers of a previously-trained model to speed up inference when alphabets are mismatched between the previous model...

Latest releases only produce an ABBYY file without any OCR results: ``` ```

Since you mentioned some interest in this error, I’ll give full details here. ![grafik](https://user-images.githubusercontent.com/23742686/171066767-a9ea5038-41da-4caa-b13c-0d972329e3f1.png) This is the result of a baseline segmentation using a custom model. The image file is...

I have trained a kraken recognition model on 20 PAGE XML files transcribed manually with Transkribus. Up to the evaluation stage everything seems to work just fine and `ketos test...

PyTorch recently introduced support for Apple's [Metal Performance Shaders (MPS)](https://developer.apple.com/documentation/metalperformanceshaders), see https://pytorch.org/blog/introducing-accelerated-pytorch-training-on-mac/. Using the new mps backend for PyTorch should accelerate training on Apple M1 machines a lot and is...

This pull request ist at the moment just to synchronize the discussion on how to proceed. It introduces a new CLI command, `ketos pagegen` that takes as input * a...