doctr icon indicating copy to clipboard operation
doctr copied to clipboard

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Results 139 doctr issues
Sort by recently updated
recently updated
newest added

### 🚀 The feature This issue is to be used to track the roadmap of docTR for release v1.1.0, and collect feedback from users & contributors. ### Breaking Change -...

critical

topic: documentation
topic: build
topic: ci
module: models
ext: scripts
ext: tests
type: breaking change
module: datasets
ext: references
module: transforms
topic: text detection
topic: text recognition

Bumps the github-actions group with 4 updates in the / directory: [actions/checkout](https://github.com/actions/checkout), [JamesIves/github-pages-deploy-action](https://github.com/jamesives/github-pages-deploy-action), [actions/upload-artifact](https://github.com/actions/upload-artifact) and [actions/download-artifact](https://github.com/actions/download-artifact). Updates `actions/checkout` from 5 to 6 Release notes Sourced from actions/checkout's releases. v6.0.0 What's...

topic: ci

### Bug description There is a [bug](https://github.com/mindee/doctr/blob/07443a04502d3ef7bd4bf4f067305399a090fe29/doctr/transforms/modules/base.py#L195) in defining image shape at RandomCrop. An input image has shapes [C, H, W], which makes buggy further logic, as number of channels...

type: bug

### 🚀 The feature I use docTR as OCR pre-processing before I send the text data into a LLM to extract data. However, a lot of information is encoded in...

type: enhancement

Updates the requirements on [huggingface-hub](https://github.com/huggingface/huggingface_hub) to permit the latest version. Release notes Sourced from huggingface-hub's releases. [v1.0.1] Remove aiohttp from extra dependencies In huggingface_hub v1.0 release, we've removed our dependency...

topic: build

Updates the requirements on [pypdfium2](https://github.com/pypdfium2-team/pypdfium2) to permit the latest version. Release notes Sourced from pypdfium2's releases. 5.0.0 Changes (Release 5.0.0) Summary (pypdfium2) API changes Rendering / Bitmap Removed PdfDocument.render() (see...

topic: build

### Bug description Using Scentence Tranfromer with megablocks (recommended) causes error ``` /xxx/.pyenv/versions/3.11.14/lib/python3.11/site-packages/triton/runtime/driver.py:8: in _create_driver raise RuntimeError(f"{len(actives)} active drivers ({actives}). There should only be one.") E RuntimeError: 0 active drivers...

type: bug

### Bug description I created the labes.json file and the images. I run the commad: ``` python references\detection\train.py db_resnet50 --epochs 20 --train_path C:\RBEE\DO\DetectionTrain --val_path C:\RBEE\DO\DetectionValidate --pretrained --name DtectDO --output_dir C:\RBEE\DO\DetectionTrain\models...

type: bug

> ```import torch > from doctr.models import ocr_predictor, crnn_vgg16_bn, db_resnet50 > from doctr.io import DocumentFile > from doctr.datasets import VOCABS > > # Vocab copied from the indic-doctr repo >...