symphytum icon indicating copy to clipboard operation
symphytum copied to clipboard

issues with search for non-latin alphabets (case-sensitivity and accents)

Open FotisK opened this issue 4 years ago • 2 comments

Hi, I've noticed a couple of issues when searching in non-latin (specifically greek) text: I have a field with values such as "Βορράς", "βορράς", "βορρας", "Βορρας", "ΒΟΡΡΑΣ" and "ΒΌΡΡΑΣ" All these values ideally should be considered identical (case insensitive and accent stripping) but currently they are treated as different strings.

FotisK avatar Mar 03 '20 20:03 FotisK

This is a limitation of the SQLite backend which handles unicode characters that are outside the ASCII range as case sensitive:

from https://www.sqlitetutorial.net/sqlite-like/

However, for Unicode characters that are not in the ASCII ranges, the LIKE operator is case sensitive e.g., "Ä" LIKE "ä" is false.

I'll think about a workaround.

joshirio avatar Mar 06 '20 15:03 joshirio

ah ic, that makes sense @joshirio ! Thank you for all the time you've put into my issue and the whole project.

FotisK avatar Mar 06 '20 20:03 FotisK