markitdown icon indicating copy to clipboard operation
markitdown copied to clipboard

【bug】 UnicodeEncodeError error

Open skybambler opened this issue 1 year ago • 2 comments

UnicodeEncodeError: 'gbk' codec can't encode character '\U0001f4a1'

UnicodeEncodeError: 'gbk' codec can't encode character '\u2022'

skybambler avatar Jan 01 '25 09:01 skybambler

UnicodeEncodeError: 'charmap' codec can't encode character '\u202f' in position 44138: character maps to

narrow non-breaking space (\u202F)

UnicodeEncodeError: 'charmap' codec can't encode character '\u200b' in position 121523: character maps to

zero-width space (ZWSP)

weyCC81 avatar Jan 04 '25 23:01 weyCC81

I have a similar issue.

UnicodeEncodeError: 'charmap' codec can't encode character '\ufb01' in position 4488: character maps to <undefined>

Attempted to convert pdf to markdown. English language.

drolander avatar Jan 07 '25 19:01 drolander