taskwarrior icon indicating copy to clipboard operation
taskwarrior copied to clipboard

Unicode tags escaped when modifying a task

Open sclee15 opened this issue 3 years ago • 4 comments

Hello. I am using Taskwarrior and I think this is really good. However, I faced one small bug when I am using tags with Korean characters. The symtom is that adding a tag is okay but while modifying tags the tag turns into an escaped string.

Step to reproduce task add +구매 양말 task +LATEST modify due:eod Then you will see

Tag '구매' deleted.
Due set to '2022-04-12 23:59:59'.
Tag '\uad6c\ub9e4' added.

This is problem because I cannot use the tag for a filter anymore. I think this is a kind of unicode problem. but I cannot spot the cause of the error.

Thank you again.

sclee15 avatar Apr 12 '22 07:04 sclee15

I can confirm this with danish characters æ,ø and å and Æ, Ø and Å on my mac, but not on my linux box.

kiil avatar Jul 17 '22 12:07 kiil

Working properly on my linux with this locale config:

LANG=en_DK.UTF-8
LC_CTYPE="en_DK.UTF-8"
LC_NUMERIC=en_DK.UTF-8
LC_TIME=en_DK.UTF-8
LC_COLLATE="en_DK.UTF-8"
LC_MONETARY=en_DK.UTF-8
LC_MESSAGES="en_DK.UTF-8"
LC_PAPER=en_DK.UTF-8
LC_NAME=en_DK.UTF-8
LC_ADDRESS=en_DK.UTF-8
LC_TELEPHONE=en_DK.UTF-8
LC_MEASUREMENT=en_DK.UTF-8
LC_IDENTIFICATION=en_DK.UTF-8
LC_ALL=

Not working properly on my mac with this locale config:

LANG="da_DK.UTF-8"
LC_COLLATE="da_DK.UTF-8"
LC_CTYPE="da_DK.UTF-8"
LC_MESSAGES="da_DK.UTF-8"
LC_MONETARY="da_DK.UTF-8"
LC_NUMERIC="da_DK.UTF-8"
LC_TIME="da_DK.UTF-8"
LC_ALL=

kiil avatar Jul 18 '22 11:07 kiil

I've similar problem with Croatian characters in a tag, iow. when I add the following tag računi it is rendered as ra\u010duni and edit a tag and manually try to fix it does not help. My locale config is:

LANG=hr_HR.UTF-8
LANGUAGE=en_GB:en
LC_CTYPE="hr_HR.UTF-8"
LC_NUMERIC="hr_HR.UTF-8"
LC_TIME="hr_HR.UTF-8"
LC_COLLATE="hr_HR.UTF-8"
LC_MONETARY="hr_HR.UTF-8"
LC_MESSAGES=en_GB.UTF-8
LC_PAPER="hr_HR.UTF-8"
LC_NAME="hr_HR.UTF-8"
LC_ADDRESS="hr_HR.UTF-8"
LC_TELEPHONE="hr_HR.UTF-8"
LC_MEASUREMENT="hr_HR.UTF-8"
LC_IDENTIFICATION="hr_HR.UTF-8"
LC_ALL=

However, strange thing is that some tasks correctly render the same tag?

gour avatar Sep 21 '22 05:09 gour

Present on Linux with

LANG=pl_PL.UTF-8
LANGUAGE=

DDzwiedziu avatar Dec 22 '23 18:12 DDzwiedziu