cogcomp-nlp
cogcomp-nlp copied to clipboard
Retraining models required for fixing #593
TL;DR: We will need to retrain PrepSRL, Quantifier, (maybe) Question Typer
The Fix applied to #593 (currently residing in branch string_join_require_retrain) will affect the behavior of FeatureNGramUtility in edison. All 4 feature generating functions in FeatureNGramUtility calls StringUtils.join() to produce feature names, which currently have trailing dashes. The fix removes the trailing dash, but also changes the feature names.
Here are all packages that will (likely) be affected --
-
PrepSRL https://github.com/CogComp/cogcomp-nlp/blob/7d9dad3fedc16ac59feb278815e27dc195d1367e/prepsrl/src/main/java/edu/illinois/cs/cogcomp/prepsrl/features/WordBigrams.java#L38-L39
-
Quantifier https://github.com/CogComp/cogcomp-nlp/blob/7d9dad3fedc16ac59feb278815e27dc195d1367e/quantifier/src/main/java/edu/illinois/cs/cogcomp/quant/features/WordBigrams.java#L44-L45
-
Question Typer (it looks like just a debug/helper function, not actually used in model? but not sure) https://github.com/CogComp/cogcomp-nlp/blob/7d9dad3fedc16ac59feb278815e27dc195d1367e/question-type/src/main/java/edu/illinois/cs/cogcomp/question_typer/QuestionTyperFeatureExtractorsUtils.java#L191-L200