cmp-buffer
cmp-buffer copied to clipboard
Completing Unicode (using \k regex) is broken because common prefix uses bytes instead of characters
Here is an example:
# Add some text to provide source words:
получаем битый символ
получив битый символ
# Then type "пол" and attempt to complete it result in adding broken unicode "учÐ":
получÐ
This happens because Cyrillic "а" in UTF8 is 0xD0 0xB0 and Cyrillic "и" in UTF8 is 0xD0 0xB8, so completion returns common part Latin Capital Letter Eth 0xD0.