LBeaudoux

Results 2 issues of LBeaudoux

Contrary to the 'no-break space' ("\u00A0"), the 'narrow no-break space' ("\u202f") is not recognized as a word boundary. tokenize("La vois-tu souvent ?", "fr") returns ['la', 'vois', 'tu', 'souvent\u202f'] instead of...

**Story** When I search for an English word, I often get a long list of very similar sentences that look like they were generated by a robot. I usually scroll...

enhancement