rust-punkt
rust-punkt copied to clipboard
Training data should come as separate libraries
Right now, the training data all comes as a single package. It might be better to include it as compiled code that is generated from a JSON document.
Or perhaps include the zipped json-strings because the binary gets huge when using the punkt crate. A quick check with cmdline zip promises a reduction to 30% of the original size.