gateplugin-LearningFramework
gateplugin-LearningFramework copied to clipboard
A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, NaiveBayes, CRF and others), LibSVM, Scikit-Learn, Weka, and DNN...
This is interesting for huge datasets and stream learning. The LF would essentially just get document by document and pass extracted features on to the online learning algorithm (e.g. Vowpal...
Turns out this was a rubbish idea and it causes problems when trying to make it work with dense representations where we need to know and control the number of...
not sure about this, needs more thinking! This is about predicting per-instance ordinal values where we only know the partial ordering of targets but where the target value is otherwise...
Refactor the CorpusExporterJsonSeq methods so that instead of directly exporting the whole list of sequences in the export method, we split the logic into: * create the string for one...
Ideally by allowing a list of class annotation types, or/and by allowing to use the targetFeature to distinguish between several classes.
This should be doable independently of the algorithm used, i.e. possible for both classifiers and sequence learners
Allow features to depend on (previous) targets. This means we should be able to add something like `-2` to an ATTRIBUTE or ATTRIBUTELIST declaration and then use the value that...
Implement some standard feature functions: wordshape, character ngrams with maximum n or range of ns, prefixes or suffixes of length
Add a parameter "NUMBER" or "N" to ATTRIBUTELIST which is 1 by default. If this is larger than 1 then instead of getting the features for each of the elements,...
Re-check what the intended behaviour is when combinations of the optional parameters are missing. There seems to be a bug in the EvaluateClassification PR when both classAnnotationType and targetFeature are...