datawave icon indicating copy to clipboard operation
datawave copied to clipboard

Numeric regex encoder edge case (trailing zeros before decimal)

Open ivakegg opened this issue 4 months ago • 2 comments

Currently the numeric regex encode will normalize 123400.* to +[f-z]E1.234(0{2})?.* which is not quite correct. The ? following the (0{2}) in this case should not be added. If the (0{2}) were after a decimal point, then it would be ok, but not before the decimal.

ivakegg avatar Aug 28 '25 19:08 ivakegg

Correction, this is actually correct because to batch 123400 exactly would have to match +fE1.234 in which case there are no zero's in the normalized number.

ivakegg avatar Sep 03 '25 16:09 ivakegg

This is a case that the lossy numeric regex should be handling... I will verify.

ivakegg avatar Sep 03 '25 16:09 ivakegg