recogito-js icon indicating copy to clipboard operation
recogito-js copied to clipboard

Not recognizing specific characters such as §

Open Thutmose3 opened this issue 3 years ago • 1 comments

If the character § is in a text, when i try to get the start_index and end_index the indexes are wrong by 1.

It is as if the character § is not there. But in reality it is there. Which is problematic when needed the specific position of a word.

When testing, revocito got the position of the words correctly up to this character §. After they position of each subsequent word is wrong by 1.

Thutmose3 avatar Apr 07 '22 11:04 Thutmose3

Tried to reproduce this, but RecogitoJS did count the '§' character correctly in my tests. (Tested in Chrome + FF.) Which browser and OS are you seeing this on? Are you sure you set the correct encoding in your HTML page?

<meta charset="utf-8" />

rsimon avatar Jul 24 '22 12:07 rsimon

Closing due to inactivity/no response

rsimon avatar Sep 03 '22 17:09 rsimon