tropy icon indicating copy to clipboard operation
tropy copied to clipboard

Search in PDFs

Open Wdavery opened this issue 3 years ago • 4 comments

I have some PDFs that have been fully OCR'd and are searchable using any PDF reader, but I can't find any way to search them within the Trophy interface. Ideally they would search in a global search as well, but being able to search while in item view would be sufficient.

Wdavery avatar Dec 17 '21 15:12 Wdavery

When Tropy imports PDFs they get rasterized in order to be viewed in the image viewer (similar to opening a PDF in Photoshop). Thus the OCR text is lost currently (extracting text from PDFs is far from trivial).

We consider to implement an additional PDF viewer, but there are no tangible plans yet.

If we are talking about dozens (not hundreds) of PDFs here, you may export the OCR text using a PDF reader and then add these texts to notes in Tropy. This way you would be able to find certain items/photos at least.

flachware avatar Dec 22 '21 18:12 flachware

Makes sense and an understandable design choice.

For my own use now, I will continue to use Tropy to organize the PDFs alongside images, and simply open them in an external viewer when needed.

What about a prominent GUI button for opening the current document in the default PDF viewer?

Wdavery avatar Dec 22 '21 21:12 Wdavery

How do you open the PDFs, by clicking on the file name in the metadata pane (reveals the PDF in the file system)?

flachware avatar Dec 23 '21 09:12 flachware

Yes, exactly. Reveal in filesystem open from there.

Wdavery avatar Dec 23 '21 11:12 Wdavery