elasticsearch-river-mongodb
elasticsearch-river-mongodb copied to clipboard
How to resume indexing after restarting elasticsearch service?
I have a colllection of over 50 M documents which I want to index in elasticsearch. However there comes a need to restart elasticsearch service which causes rivering process to stop. What's the workaround for this scenario? I am using ES 1.3.4 and mongodb river 2.0.2
The initial import uses the collection data once completed the river will use oplog.rs So you should wait until the initial import is completed before to restart ES.
Thanks. I am encountering a very strange issue now. The docs count in elasticsearch are increasing and decreasing countinously. The count is increased by 1 and decreased 2 or 1. And the overall count has increased quite a lot, more than the actual collection size. What could be the issue?
When the initial import is completed the river will read oplog.rs and apply the operations for the collection defined so it is potentially possible the count increase, decrease until it is completed.
Do you still see document count changing once this phase is completed?
yes. the count has increased to about 65 M documents , whereas my collection contains some 54 M documents. The docs count is increasing and decreasing by 1 or 2 digits, it seems like its stuck.
These are the counts ES is showing. Note that the docs count is like 54,574,622 or 54,574,621 or 54,574,620 .. and it keeps on doing that while count is increasing in index.
I also using marvel plugin.