pdfannots
pdfannots copied to clipboard
Extracts and formats text annotations from a PDF file
Hi folks, thanks for your efforts with this tool. I was wondering if there are plans to add rectangular (image) selection to the tool? The workflow that I imagine would...
Thank you for the new printer interface. It enabled me to add three new output formats: 1. **jsonl** Similar as the original json, but with the output of one file...
It would be really great to differentiate multi-color highlights in a document. Is that possible on pdfannots' end, or is it rather something that has to be done on pdfminer.six'...
With [this PDF-file](https://github.com/0xabu/pdfannots/files/4470936/test.pdf) the words are not split. It's an OCR-scan. I tried modifying the word_margin in LAParams to no avail. When exporting the highlights using PDF Expert (my macOS-PDF...
Hey all, Thank you all for this fantastic script! It works very well, although I found a pdf (attached) whose highlights are being severely truncated. I tweaked `boxhit` function to...
Many thanks for developing this wonderful tool! Just an idea for the format of output file like this: https://forum.zettlr.com/discussion/94/zotero-as-zettelkasten I think it would make the citation of the notes and...
It would be nice if the program could extract "Caret" annotations as well, which are the opposite of StrikeOut (suggestion of new text within a context).
Thank you so much for developing this module, it's fantastic. Is it possible to implement the function of simultaneously outputting an annotation and the entire sentence where the annotation is...
Use case 1: Sometimes, we may need annotations from just a specific chapter or a few pages from here and there. This would also speed up the extraction process as...
Would love CSV output like this: ```csv page,type,author,created,text 1,Highlight,John,2023-05-17T11:38:17,Text ``` Sounds like that should be possible but not sure how. Great tool, thanks!