fscrawler
fscrawler copied to clipboard
Elasticsearch File System Crawler (FS Crawler)
Sometimes it takes a long while for a document to OCR and index. It would be nice to upload a document and immediately receive back a unique identifier (perhaps something...
Is there any interest in a contribution converting some of the codebase from Java to [Kotlin](https://kotlinlang.org/)?
When running tests, I can see that with workplace search 7.16.1: ``` 1> 20:47:25,031 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Sources found for name [fscrawler_c5dijowmh3oghpd]: [61b7870a6e6bdc77cae0c068] 1> 20:47:25,369 WARN [f.p.e.c.f.t.w.WPSearchClient] Error while running DELETE...
Problem description: Even if in simulate mode, FSCrawler is communicating with Elasticsearch especially on launch, and it will not launch if Elasticsearch is not available. Which should not be mandatory....
We indexed 2 million documents into elasticsearch using fscrawler. But the files count in elsaticsearch doesn't match with the files in the Share path. Is there a way to identify...
**Describe the bug** When i upload image by Rest, ocr always use "eng" language, changing fs.ocr.language settings is not affect on result. Solved this problem by replacing rus.traineddata to eng.traineddata,...
Hello, My fscrawler job doesn't index new file or if a file is moved to another directory, it is deleted in the origin directory but it isn't added from the...
We can try to use https://github.com/elastic/thumbnails4j to generate thumbnails from documents. IMO this could be performed as an async task. As an output we should be able to propose multiple...
For this we most likely need to wait for https://github.com/randomizedtesting/randomizedtesting/issues/256 to be implemented.
Read https://github.com/lightbend/config