upparse
upparse copied to clipboard
Model serialization
This affects cascade parsing. Currently, the chunk.py script must run (with full training) chunking twice -- once to re-chunk the training dataset to train the next level in the cascade, and once for the evaluation dataset. This would be much more manageable if a model could be serialized and written to disk.