workbench icon indicating copy to clipboard operation
workbench copied to clipboard

Cannot import Chinese articles

Open slipbox opened this issue 2 years ago • 1 comments

Cannot import Chinese articles, it displays as garbled characters. For example: http://www.nopss.gov.cn/n1/2023/0802/c431029-40049194.html

image

lable:bug

slipbox avatar Nov 07 '23 15:11 slipbox

Initial exploration:

https://github.com/RoamJS/workbench/blob/63edf81c3bd16ebe6acf773e4be1e7f0149d0b3b/src/features/article.tsx#L107-L118

  • not frontend / roam issue: copy/paste works and html (ln 109) shows untranslatable characters
  • charset(r.headers) is returning utf8
  • iconv-lite says "Untranslatable characters are set to � or ?. No transliteration is currently supported."

Options for next steps

  • check what is happening in apiPost, r.encoded
  • find different character encodings provider

mdroidian avatar Nov 09 '23 20:11 mdroidian