Zac
Results
2
comments of
Zac
This is not reproducible as described. The training algorithm will actually catch cases like e.g. if they are followed by a comma. However, the bug behavior does occur without the...
The README included with the tokenizer says English.pickle was trained on PTB, and the sample 5% corpus contains zero instances of "e.g." and "i.e." so that makes sense. I was...