fxt
fxt copied to clipboard
Detect stemming used when building index
It may be useful in future to detect the stemmer used (if any) by Indri when building the index.
This could then facilitate automatic stemming of queries when using extract_features (or when using the Python API).
The likely place to store this information is within some metadata structure within the index itself.