Cloudreve icon indicating copy to clipboard operation
Cloudreve copied to clipboard

OCR/PDF content search

Open JarodSch opened this issue 3 months ago • 4 comments

Is your feature request related to a problem? Please describe. I store many PDFs in my Cloudreve instance. I would love to search for the content in my PDFs via the global search, so I can find them again more easily.

Describe the solution you'd like I would like to enter stuff from inside my PDFs in the global search bar to find the PDF itself.

Describe alternatives you've considered There is no real alternative. The alternative is to know, where the PDF is.

Additional context

Being able to search for the PDF content here Image

JarodSch avatar Sep 14 '25 01:09 JarodSch

Duplicate of #2820, and i think this will be very hard to achieve as Cloudreve only saves several info like path on physical storage, size...

YUDONGLING avatar Sep 14 '25 01:09 YUDONGLING

With postgresql full text-search shouldn'd be a big problem.

markusglaetzner avatar Sep 26 '25 19:09 markusglaetzner

With postgresql full text-search shouldn'd be a big problem.

Thats great! You can have a try and wish to hear from you soon!

Image

YUDONGLING avatar Sep 27 '25 01:09 YUDONGLING

@YUDONGLING I just bought Pro during the Black Friday sale, because I really think this is a great project which should be supported. Maybe you count pro feature request differently, so here is my vote :P

JarodSch avatar Nov 19 '25 22:11 JarodSch

You can see how this is implemented in OpenCloud. As far as I understand, they use Apache Tika to extract data from documents. The task doesn't seem too difficult if you don't try to solve it yourself, but use ready-made components.

krom avatar Dec 17 '25 15:12 krom

At this moment, we’re focusing on the Windows sync client. This full-text search feature will be our next major milestone after the sync client release.

HFO4 avatar Dec 17 '25 16:12 HFO4