nimtesseract icon indicating copy to clipboard operation
nimtesseract copied to clipboard

A Tesseract OCR wrapper for Nim

Nim Tesseract 👑👁

banner Nim Tesseract is a Nim wrapper for the Tesseract OCR library, via its dynamic library.

Installation 👇

$ nimble install nimtesseract

Usage 🌷

  1. Install (lib)tesseract via your package manager or put the tesseract so/dll/dylib file in the project directory E.g. for Arch Linux:
$ pacman -Sy tesseract

for macOS:

$ brew install tesseract
  1. Download trained data from https://github.com/tesseract-ocr/tessdata or https://github.com/tesseract-ocr/tessdata_fast
  2. Done ✅

Example 🤔

import nimtesseract

echo imageToText("file.png")

More examples in the examples folder

Development 🔩

Download trained data and put it into src folder

⚠️ Outdated, but still useful. Don't refer to this.

$ cd src
$ TESSDATA_PREFIX=$(pwd) nim r -d:pixieUseStb nimtesseract.nim

Run tests with nimble:

$ nimble test

capi.h reference: https://github.com/tesseract-ocr/tesseract/blob/main/include/tesseract/capi.h

Credits 👻

Inspired from https://github.com/Altabeh/tesseract-ocr-wrapper

License 📕

This project is under the Unlicense license. This is free and unencumbered software released into the public domain.