re2j
re2j copied to clipboard
linear time regular expression matching in Java
dc7d6e5d41225dc0825ea6fe4c6055ff854abe13 unfortunately increases the incidence of infinite loops during case folding if re2j is running on a JVM newer than the version used to generate the bundled UnicodeTables.java and the...
When I compile the pattern: `"..|.#|.."`: `Pattern.compile("..|.#|..")` I get `java.lang.NullPointerException` exception. ``` == Java Exception: java.lang.NullPointerException: Cannot read the array length because "subMax.runes" is null at com.google.re2j.Parser.factor(Parser.java:552) at com.google.re2j.Parser.collapse(Parser.java:344) at...
OSS-Fuzz now offers support for fuzzing Java projects with [Jazzer](https://github.com/CodeIntelligenceTesting/jazzer). If you are interested, I could set up re2j in OSS-Fuzz. By default, Jazzer would detect undeclared exceptions (i.e. those...
... they were ascii-only while java.util.regexp is unicode compliant
Word boundaries should use \p{L} not just A-Za-z to behave like the default regex in java. Added some tests showing the issue and fixed it in this PR : https://github.com/google/re2j/pull/100...
Hello awesome people! I got roped in to looking at this library due to it being a transitive dependency of another dependency I'm using, and I wanted to describe a...
This change takes a brute-force approach and generates the ranges accepted by these methods by testing all the code points. MakeJavaCategories.java generates the source file containing the category ranges. I've...
After #130, binary mode matching is now supported. However, `\C` is not a supported metacharacter, which limits its usefulness.
Hey, I was combining many (>10.000) similar file names to a unified regexp (all escaped and `|`-ed). In Java it compiles (though is extremely slow), but in Re2/j it fails...
https://github.com/google/re2j/blob/66840ce27d4834780f3ddec4a322a3df93b002d0/java/com/google/re2j/Parser.java#L31 This field is never used.