Elias Ponvert issues

Results 32 issues of


                                            Elias Ponvert

switch to MIT (or BSD) license

Switch the code from Apache 2.0 to MIT (or Simplified BSD) to make this work GPL compatible

enhancement

Publish to Maven Central (or somewhere)

Requires #6, removing chunk.py and ensuring this project is organized correctly as a library

enhancement

Update README

Convert README to Markdown, update links to Github repo

enhancement

Run on unseen material without annotations

When passing a test file to chunk.py, if it does not have annotations for evaluation, we currently have this error: ``` Exception in thread "main" java.lang.AssertionError at upparse.corpus.UnlabeledBracketSet.fromTokens(UnlabeledBracketSet.java:113) at upparse.corpus.UnlabeledBracketSet.fromString(UnlabeledBracketSet.java:97)...

bug

Run cascaded parsing within Java

And remove chunk.py This will facilitate operating upparse as a library

enhancement

tokenized word-per-line input type

enhancement

Facilitate running chunking on a single dataset in chunk.py

Currently chunk.py expects a train / test setup. Allow user to only supply a single file, to chunk, output results and optionally evaluated output.

bug

Separate out train and test in cascade parsing levels

For a slightly more realistic train / evaluation setup, the cascade parser currently chunks the evaluation dataset at each level right after it trains on that level in the training...

enhancement

Model serialization

This affects cascade parsing. Currently, the chunk.py script must run (with full training) chunking twice -- once to re-chunk the training dataset to train the next level in the cascade,...

enhancement