files_fulltextsearch icon indicating copy to clipboard operation
files_fulltextsearch copied to clipboard

no image files should be provided to fulltextsearch if files_image is set to 0

Open ferdiga opened this issue 5 months ago • 0 comments

To my understanding "files_image":"0" below should prevent indexing of the image files.

  • Content Providers: Deck 1.13.1 [] Files 29.0.1 { "files_local": "1", "files_external": "2", "files_group_folders": "0", "files_encrypted": "0", "files_federated": "0", "files_size": "1", "files_pdf": "1", "files_office": "1", "files_image": "0", "files_audio": "0", "files_chunk_size": "2", "files_fulltextsearch_tesseract": { "version": "27.0.0", "enabled": "1", "psm": "4", "lang": "eng,deu,fra", "pdf": "1", "pdf_limit": "0" }

nevertheless occ -vvv fulltextsearch:index spends a considerable time (screenshot ~20 seconds) working on image files like this one

┌─ Indexing ──── │ Action: fillDocument │ Provider: Files Account: christoph │ Document: 45296825 │ Info: image/png │ Title: *****/Bildschirmfoto 2018-12-17 um 20.13.49.png │

BTW indexing of pdf and office documents is very fast

ferdiga avatar Sep 11 '24 10:09 ferdiga