char-rnn-tensorflow
char-rnn-tensorflow copied to clipboard
Does not word with other language
Where I have to change to support UTF-8. so that I can train it on other languages
It actually should work with utf-8 if you're using the latest version.
What are your versions:
- char-rnn-tensorflow
- tensorflow
- python
Thanks.
Actually the sample outputs my Greek text as raw utf-8 , " \xcf\xce\x83, \xb1\xb9\ .........."
@lowtronik that hex format. just decode it result.decode("utf-8", "replace")
@ShuvenduBikash I just deleted .encode('utf-8') and it works
I have the same problem, it generates raw text like this
\xc3\xa8p
However if I follow your suggestion and delete .encode('utf-8') it fails with this error:
UnicodeEncodeError: 'ascii' codec can't encode character '\u201c' in position 444: ordinal not in range(128)