Zac

Results 2 comments of Zac

This is not reproducible as described. The training algorithm will actually catch cases like e.g. if they are followed by a comma. However, the bug behavior does occur without the...

The README included with the tokenizer says English.pickle was trained on PTB, and the sample 5% corpus contains zero instances of "e.g." and "i.e." so that makes sense. I was...