Michael McCandless
Michael McCandless
> @mikemccand -- did you mean to close this PR? Ugh, no I did not! Sorry, I'll reopen!! I somehow fumble-finger'd something, adding a comment before I was done editing,...
`KnnGraphTester.java` already reports some interesting stats about the HNSW graph ... maybe it could also measure/aggregate/report the quantization error?
The tool could also report some aggregate stats, like per-dimension variance, or, do all/some dimensions have negative values, etc.
Awesome! Let's start with that! I'll go merge it :) Thanks @msokolov
To address your 2nd idea (increment the position for each sub-word in the compound word), I think we'd need to create a graph-aware `CompoundWordTokenFilter`. It would also emit `PositionLengthAttribute`, and...