generalized-language-modeling-toolkit
generalized-language-modeling-toolkit copied to clipboard
be able to run the toolkit out of the box
a sample data set and install script (fitting to the data set) should be provided
if the software does not have the correct data sets set up more meaningfull and helpfull error messages should be provided than just a single stacktrace.
maybe rename mvn.sh or create several startscripts for the various standard tasks which would be applied with the software.
visualize processing pipline.
explain input and output format (could happen in the visualization of the processing pipeline)
refactor some variable names and api endpoints.