udger-java icon indicating copy to clipboard operation
udger-java copied to clipboard

WordDetector cannot handle words starting with 2 underscores

Open svendiedrichsen opened this issue 1 year ago • 0 comments

We regularly find warning log messages like Index out of hashmap58 : __weibo__ in our logs.

This seems to be caused by the way the class WordDetector creates the index into the array. Shouldn't the word be stripped from all non-alpha characters to determine the index?

svendiedrichsen avatar Jan 23 '23 11:01 svendiedrichsen