doctr
doctr copied to clipboard
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
### 🚀 The feature This issue is to be used to track the roadmap of docTR for release v1.1.0, and collect feedback from users & contributors. ### Breaking Change -...
Bumps the github-actions group with 4 updates in the / directory: [actions/checkout](https://github.com/actions/checkout), [JamesIves/github-pages-deploy-action](https://github.com/jamesives/github-pages-deploy-action), [actions/upload-artifact](https://github.com/actions/upload-artifact) and [actions/download-artifact](https://github.com/actions/download-artifact). Updates `actions/checkout` from 5 to 6 Release notes Sourced from actions/checkout's releases. v6.0.0 What's...
### Bug description There is a [bug](https://github.com/mindee/doctr/blob/07443a04502d3ef7bd4bf4f067305399a090fe29/doctr/transforms/modules/base.py#L195) in defining image shape at RandomCrop. An input image has shapes [C, H, W], which makes buggy further logic, as number of channels...
### 🚀 The feature I use docTR as OCR pre-processing before I send the text data into a LLM to extract data. However, a lot of information is encoded in...
Updates the requirements on [huggingface-hub](https://github.com/huggingface/huggingface_hub) to permit the latest version. Release notes Sourced from huggingface-hub's releases. [v1.0.1] Remove aiohttp from extra dependencies In huggingface_hub v1.0 release, we've removed our dependency...
Updates the requirements on [pypdfium2](https://github.com/pypdfium2-team/pypdfium2) to permit the latest version. Release notes Sourced from pypdfium2's releases. 5.0.0 Changes (Release 5.0.0) Summary (pypdfium2) API changes Rendering / Bitmap Removed PdfDocument.render() (see...
### Bug description Using Scentence Tranfromer with megablocks (recommended) causes error ``` /xxx/.pyenv/versions/3.11.14/lib/python3.11/site-packages/triton/runtime/driver.py:8: in _create_driver raise RuntimeError(f"{len(actives)} active drivers ({actives}). There should only be one.") E RuntimeError: 0 active drivers...
### Bug description I created the labes.json file and the images. I run the commad: ``` python references\detection\train.py db_resnet50 --epochs 20 --train_path C:\RBEE\DO\DetectionTrain --val_path C:\RBEE\DO\DetectionValidate --pretrained --name DtectDO --output_dir C:\RBEE\DO\DetectionTrain\models...
> ```import torch > from doctr.models import ocr_predictor, crnn_vgg16_bn, db_resnet50 > from doctr.io import DocumentFile > from doctr.datasets import VOCABS > > # Vocab copied from the indic-doctr repo >...