detox
detox copied to clipboard
max_length filter chops UTF-8 chars
The max_length filter does not respect multibyte characters.
Reproduce with:
# ---------------------------------------------------------------------------
INPUT=$(printf "\u0201\u0202\u0203\u0204\u0205\u0206\u0207\u0208")
OUPTUT=$(printf "\u0201\u0202\u0203\u0204")
METHOD1=max-length-9
test_sequence "$DETOX" "$INPUT" "$OUTPUT" "$TABLEPATH" "$METHOD1"