Marek Gagolewski

Results 118 comments of Marek Gagolewski

I cannot reproduce the above; I get: ```r > library("stringi") > stri_detect_regex("昌平区", "县") [1] FALSE > stri_detect_fixed("昌平区", "县") [1] FALSE > grepl("县", "昌平区") [1] FALSE > sessionInfo() R version 4.1.0...

Also, could you please show me the result of a call to `stri_info(FALSE)`?

With the latter, I get: ``` stri_detect_regex(tx_xi, "同志们") [1] FALSE FALSE FALSE FALSE FALSE TRUE FALSE FALSE FALSE TRUE FALSE FALSE FALSE FALSE FALSE FALSE FALSE [18] FALSE FALSE FALSE...

I think the problem is due to: ``` [2] LC_CTYPE=Chinese (Simplified)_China.936 ... system code page: 65001 ``` ICU thinks your native encoding is UTF-8, whereas it's probably GBK. Could you...

Great, I changed the title of the issue so that it's more searchable. To sum up, the solution was: ``` stri_enc_set("Windows-936") ```

I get `FALSE`. I think the problem might as well be on your system side, not just stringi, but it's worth digging into it. Can you call: * `charToRaw(stri_conv("昌平区", to...

Also, maybe the most recent R - UCRT is worth giving a try? https://github.com/r-windows/docs/blob/master/ucrt.md

Hmmm... are these really generated with `stri_enc_set("Windows-936")` in place? This needs to be called each time the package is loaded. The byte sequence `ef bf bd` denotes the replacement character...

:) Dear all, has anyone working in this locale experienced similar issues?

How about: ``` http_proxy="http://www-proxy3.XXXX.XXX.de:8080/" /usr/bin/Rscript --slave --no-save --no-restore-history -e 'install.packages("...")' ``` ?