datawave
datawave copied to clipboard
Numeric regex encoder edge case (trailing zeros before decimal)
Currently the numeric regex encode will normalize 123400.* to +[f-z]E1.234(0{2})?.* which is not quite correct. The ? following the (0{2}) in this case should not be added. If the (0{2}) were after a decimal point, then it would be ok, but not before the decimal.
Correction, this is actually correct because to batch 123400 exactly would have to match +fE1.234 in which case there are no zero's in the normalized number.
This is a case that the lossy numeric regex should be handling... I will verify.