pdf2dataset issues

Add return_json option

icaropires

feature-request

Page counting just once

Save the page counting and don't perform it again when resuming the processing

icaropires

feature-request

Create .rpm package

icaropires

enhancement

good first issue

Create .deb package

icaropires

enhancement

good first issue

Start processing before page counting ends

When the IO is too slow, probably it's a good a idea to start the processing before the page counting ends: - [ ] When `chunksize` is provided, start processing...

icaropires

enhancement

Test CLI

icaropires

enhancement

Create docker files

As we have many non python dependencies, having a ready to use `Dockerfile` would be very handy.

icaropires

enhancement

good first issue

Optimize extraction of features that has same value for all pages

Currently, one feature from the document (equal value for all pages) will be extracted for each page

icaropires

enhancement

Generate docs page

icaropires

documentation

Add code documentation

- [ ] Document routines - [ ] Document module - [ ] Document classes

icaropires

documentation

pdf2dataset
pdf2dataset copied to clipboard

Metadata

Add return_json option

Page counting just once

Create .rpm package

Create .deb package

Start processing before page counting ends

Test CLI

Create docker files

Optimize extraction of features that has same value for all pages

Generate docs page

Add code documentation

← Metadata

Owner

Metadata

pdf2dataset pdf2dataset copied to clipboard

Metadata

← Metadata

Owner

Metadata

pdf2dataset
pdf2dataset copied to clipboard