Joshy Cyriac
Joshy Cyriac
Oh, then sorry for the report. I am using OS X El Capitan 10.11.2, Chrome 48.0.2564.116 (64-bit) and my timezone is GMT+1/Central European Standard Time. Maybe the tests could be...
Hi, there is now an option to ignore errors. Please try it with: `rtf_to_text(your_string, errors="ignore")` It was working for me, but I don't know if relevant text is stripped away.
Hi Prabakaran, could you post the full rtf? Otherwise it will be difficult to debug. Best regard, Joshy
Hi Prabakaran, could you post the full rtf? Otherwise it will be difficult to debug. Thanks, Joshy
It works wit the latest version, therefore closing this issue.
@druck13 Thanks a lot for the fake_blink! It helped me a lot.
Hi, according to wikipedia cyrilic rtf should be encoded in cp1251 and not in cp1252. If I change the rtf content to cp1251 it works fine. cp1252 is the western...
I have created a small test case myself with word 365 and indeed it saves it with encoding 1252. I have no idea how in this case word finds out...
@svladimirs: Glad you got a workaround. If I am running your code I get: LookupError: unknown encoding: ansi. How can this run?
> > The question is how do they figure out the right encoding? > > Maybe they do [charset detection](https://en.wikipedia.org/wiki/Charset_detection)? I tried the [chardet](https://chardet.readthedocs.io/) library and it told me with...