cistern
cistern copied to clipboard
chipmunk segmenter dies when line is empty
Chipmunk currently dies when reading empty lines in input. This is of course caused by problems in other parts of pipeline (I am currently experimenting with it as part of SMT pipeline), but it might be a nice thing to have chipmunk just print a warning and output the empty token.
The offending code is in my version at:
chipmunk.segmenter.SegmentationDataReader$1.next(SegmentationDataReader.java:68) chipmunk.segmenter.SegmentationDataReader$1.next(SegmentationDataReader.java:55)
sorry I cannot copy/paste full backtrace, it's on my shell session on another computer at the moment, but I can get it later if you need one.