Petko Minkov

Results 3 comments of Petko Minkov

Thanks for taking a look @mocobeta - your comment makes sense. I have a dataset and have noticed the problem on it. I'll create a short analysis with examples.

I created a branch with some analysis of what happens, it's [here](https://github.com/pminkov/lucene/commit/25c5ea4c12d92b8f534d40e449509a327ab6eea9). The code is a bit hacky, sorry. **Dataset** I used one of the MongoDB Atlas datasets - [mflix](https://www.mongodb.com/docs/atlas/sample-data/sample-mflix/)....

I'm uploading the selected words file: [mlt-selected-words.txt](https://github.com/apache/lucene/files/8863975/mlt-selected-words.txt) And the input file: [plots.txt](https://github.com/apache/lucene/files/8863978/plots.txt)