rdrview
rdrview copied to clipboard
Handle (remove) zero-width-space
We currently end up converting zero-width-space (zws) U+200B / \xe2\x8\x8b to an invalid sequence (FD BF BF BD A3 AC). We (chatgpt and I) added code to just remove these, (in the same place that we swap the non-breaking space with space).
Hi, sorry for the delay. I'm not sure I understand the patch, you are saying there is a bug in the handling of zws somewhere else, so to work around it you get rid of all zws early on? Do you have a link or some other reproducer for the issue?