gilko1981
Results
1
comments of
gilko1981
a better workaround is `text = ''.join([l for l in text if unicodedata.category(unicode(l))[0] not in ('S', 'M', 'C')])` omitting only undesired utf8 chars see http://www.fileformat.info/info/unicode/category/index.htm