gilko1981

Results 1 comments of gilko1981

a better workaround is `text = ''.join([l for l in text if unicodedata.category(unicode(l))[0] not in ('S', 'M', 'C')])` omitting only undesired utf8 chars see http://www.fileformat.info/info/unicode/category/index.htm