markitdown icon indicating copy to clipboard operation
markitdown copied to clipboard

UnicodeEncodeError: 'gbk' codec can't encode character '\u2022' in position 1195: illegal multibyte sequence

Open ArchieZhao opened this issue 1 year ago • 5 comments

Traceback (most recent call last): File "D:\Program\AnacondaEnv\markitdown_env\lib\runpy.py", line 196, in _run_module_as_main return run_code(code, main_globals, None, File "D:\Program\AnacondaEnv\markitdown_env\lib\runpy.py", line 86, in run_code exec(code, run_globals) File "D:\Program\AnacondaEnv\markitdown_env\Scripts\markitdown.exe_main.py", line 7, in File "D:\Program\AnacondaEnv\markitdown_env\lib\site-packages\markitdown_main.py", line 43, in main print(result.text_content) UnicodeEncodeError: 'gbk' codec can't encode character '\u2022' in position 1195: illegal multibyte sequence

ArchieZhao avatar Dec 22 '24 04:12 ArchieZhao

Similar issues with me: UnicodeEncodeError: 'charmap' codec can't encode character '\u2003' in position 3849: character maps to

On another document: UnicodeEncodeError: 'charmap' codec can't encode character '\uf416' in position 6988: character maps to

hanchan78 avatar Dec 23 '24 17:12 hanchan78

I encountered the same problem too. UnicodeEncodeError: 'gbk' codec can't encode character '\u2217' in position 37: illegal multibyte sequence

fishfen avatar Dec 24 '24 02:12 fishfen

I encounter the same problem too meidi2023.pdf

tonygeneral avatar Dec 24 '24 12:12 tonygeneral

I think it was fixed in PR #116

l-lumin avatar Dec 26 '24 03:12 l-lumin

still not working..same issue.

UnicodeEncodeError: 'gbk' codec can't encode character '\xa0' in position 921: illegal multibyte sequence

skybambler avatar Jan 02 '25 11:01 skybambler