wikipedia-to-elastic icon indicating copy to clipboard operation
wikipedia-to-elastic copied to clipboard

Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual support)

Results 3 wikipedia-to-elastic issues
Sort by recently updated
recently updated
newest added

It seems that part of the reason that this takes 5 days to run is that the code is written with one ingestion node in mind. If I have more...

help wanted

Create a process that will add only new pages to existing elastic index from newer wikipedia dumps release.

enhancement