lucenenet icon indicating copy to clipboard operation
lucenenet copied to clipboard

ShingleFilter produces invalid queries

Open tohidemyname opened this issue 1 year ago • 2 comments

Is there an existing issue for this?

  • [X] I have searched the existing issues

Describe the bug

ShingleFilter uses the position length attribute of the token stream to encode the number of terms they put in a single token. This breaks the query parsing because it creates disconnected graphs.

Expected Behavior

No response

Steps To Reproduce

I will implement a test case to show the bug.

Exceptions (if any)

No response

Lucene.NET Version

No response

.NET Version

8.0.300

Operating System

win 10

Anything else?

No response

tohidemyname avatar Jun 04 '24 08:06 tohidemyname

My patch is submitted: https://github.com/apache/lucenenet/pull/944

tohidemyname avatar Jun 04 '24 08:06 tohidemyname

Sorry. I am unfamiliar with submitting pull requests. I may wrongly close my pull request. I have submitted a new pull request. https://github.com/apache/lucenenet/pull/946

tohidemyname avatar Jun 05 '24 00:06 tohidemyname

Fixed in #946 (did not auto-close due to formatting)

paulirwin avatar Nov 20 '24 17:11 paulirwin