cmp-buffer icon indicating copy to clipboard operation
cmp-buffer copied to clipboard

Completing Unicode (using \k regex) is broken because common prefix uses bytes instead of characters

Open powerman opened this issue 4 months ago • 1 comments

Here is an example:

# Add some text to provide source words:
получаем битый символ
получив битый символ
# Then type "пол" and attempt to complete it result in adding broken unicode "учÐ":
получÐ

This happens because Cyrillic "а" in UTF8 is 0xD0 0xB0 and Cyrillic "и" in UTF8 is 0xD0 0xB8, so completion returns common part Latin Capital Letter Eth 0xD0.

powerman avatar Jul 30 '25 17:07 powerman