fxt
fxt copied to clipboard
Initial document-independent feature extraction support
For tracking a number of issues related to pre-retrieval document independent features.
- [ ]
preret_csvassumes unigram and bigram files are text - [ ]
preret_csvassumes user must use both unigram and bigram files - [ ]
generate_term_featuresoutput is text - [ ]
create_bigram_inverted_indexoutput is binary file - [ ]
create_bigram_inverted_indexis missing ordered and unordered options - [ ] add documentation for bigram features