unipdf icon indicating copy to clipboard operation
unipdf copied to clipboard

[FEATURE] Concurrent page extraction

Open gunnsth opened this issue 4 years ago • 0 comments

Is your feature request related to a problem? Please describe. Currently extraction only supports processing pages one by one. It might be more efficient to use multiple go-routines to handle page-by-page.

Describe the solution you'd like Explore what the easiest way to support concurrency in extractor package is.

Describe alternatives you've considered Alternative and currently the best way for concurrency is on a document basis. I.e. one go-routine handling a single document.

Additional context Client's comment

We often deal with documents that are 900+ pages and serially processing these with Unidoc was. Taking a long time and this a lot of money in AWS expenses.

gunnsth avatar May 23 '20 19:05 gunnsth