gramophone icon indicating copy to clipboard operation
gramophone copied to clipboard

words with german umlauts

Open tiefenb opened this issue 10 years ago • 0 comments

gramophone has a problems with german umlauts ( Ä, Ö, Ü, ä, ö, ü, ß)

var gramophone = require('gramophone'); var fs = require('fs');

var result = gramophone.extract('Hallo Welt! Das ist ein Text über ganz viele Umlaute wie äöüÄÖÜß. Lörem Ipsüm Lörem Ipsüm.', {stopWords: JSON.parse(fs.readFileSync(__dirname + '/stopwords.json')) }); console.log(result);

result: [ 'rem ips m', 'l rem ips' ]

can you help me? maybe their is a problems with the utf8 encoding?

tiefenb avatar Feb 18 '15 12:02 tiefenb