gateplugin-LearningFramework
gateplugin-LearningFramework copied to clipboard
A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, NaiveBayes, CRF and others), LibSVM, Scikit-Learn, Weka, and DNN...
Bumps [snakeyaml](https://bitbucket.org/snakeyaml/snakeyaml) from 1.32 to 2.0. Commits c98ffba issue 561: add negative test case e2ca740 Use Maven wrapper on github 49d91a1 Fix target for github 19e331d Disable toolchain for github...
This happens when Python is installed in a non-standard way under Windows, is not on the path or the wrong environment is active when running GATE. With LearningFramework versions 4.2...
Mallet library emits: ``` Couldn't open cc.mallet.util.MalletLogger resources/logging.properties file. Perhaps the 'resources' directories weren't copied into the 'class' directory. Continuing. ``` we should make this go away (maybe we can...
Some parts of this may have bit-rotted by now, make sure the stuff is there is current.
Instead of the current approach that uses CustomDuplication and relies on a specific order in which the instances get initialized and the controllerStarted callback invoked, do something as suggested by...
Currently we use the CorpusStats plugin to create tfidf scores per token which we can then use together with the featureName4Value option of an attribute to inject into the sparse...
There is a java implementation of liblinear there: https://github.com/bwaldvogel/liblinear-java May be interesting to include for classification.
Currently we use a modified version of apache commons-cli: originally it looked like the best match, then we had to hack it and now it turns out this was a...
* see also https://github.com/GateNLP/gate-lf-python-data/issues/15 * Keep the option to have many features but make it easy to have just the simple one-feature approach. * Store dense corpus instances as maps...
Currently the sequence encoding is really done by the feature extractor and the sequence encoder jointly, and the sequence encoder only sees the class annotations for each instance separately. We...