papermage icon indicating copy to clipboard operation
papermage copied to clipboard

library supporting NLP and CV research on scientific papers

Results 25 papermage issues
Sort by recently updated
recently updated
newest added

hello, Could you teach me how to use this function? please show me some examples ,thanks

My setting Windows10+WSL can't run: doc = recipe.run("tests/fixtures/papermage.pdf") It seems to be downloading the 13 pages 100% normally, but then either killed off as a python script, or when in...

``` pip install papermage.[dev,predictors,visualizers] zsh: no matches found: papermage.[dev,predictors,visualizers] ``` Ubuntu 22.04, Python 3.12.2

hello, your work is amazing. My question is can papermage match the tables or figures with their captions? I have no idea.

Hi, Congrats for your great work and beautiful API! I'm especially interested in using it to create a hierarchical document based on the original PDF. My issue is that some...

I want to crop all the figures/images/tables in one pdf. Can get the page number of each figure in doc.figures[x]?

After setup, I tried 1. ``doc.figures `` 2. ``json.dump`` but the results showed only figure box's position and its metadata, how can i get figure in the pdf?

I was dealing with a document triggered this error in `papermage/rasterizers/rasterizer.py`: raise ValueError(f"Failed to attach. {len(images)} `images` != {len(pages)} pages in `doc`.") I did a deep debug found that the...

Metadata should also be a list as agreed on Slack

Hi, At the moment it is possible to derive author information via doc.authors. Is it also possible to further finegrain this information and retrieve the authors name, their institution and...