icu icon indicating copy to clipboard operation
icu copied to clipboard

ICU-20688 SHIFT-JIS Conversion Issues - Patch

Open DGN001 opened this issue 4 years ago • 4 comments

Checklist
  • [x] Issue filed: https://unicode-org.atlassian.net/browse/ICU-20688
  • [x] Updated PR title and link in previous line to include Issue number
  • [x] Issue accepted
  • [ ] Tests included
  • [ ] Documentation is changed or added

DGN001 avatar Jul 12 '19 10:07 DGN001

I was able to reproduce it on the source code which I download from below link :- https://github.com/unicode-org/icu/releases/tag/release-64-2

DGN001 avatar Jul 12 '19 10:07 DGN001

CLA assistant check
All committers have signed the CLA.

CLAassistant avatar Jul 12 '19 10:07 CLAassistant

What's the status of this PR, awaiting author or reviewer?

sffc avatar Aug 13 '19 04:08 sffc

@sffc neither. it's for a bug analysis. I'm back, and will continue this. please leave it open.

srl295 avatar Aug 13 '19 05:08 srl295

I left a more detailed reply in the associated Jira issue. These authoritative IBM tables should not be modified. IBM did this on purpose. One of the other flavors of Shift-JIS can be used instead, like https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/windows-932-2000.ucm.

grhoten avatar Oct 31 '22 17:10 grhoten

Right. The ibm-*.ucm files were generated from IBM conversion tables and reflect actual IBM conversion behavior. We provide other mapping tables that can be used. We might consider moving to non-IBM conversion tables in our default data, but not as a patch of the IBM data files.

markusicu avatar Oct 31 '22 18:10 markusicu