needle icon indicating copy to clipboard operation
needle copied to clipboard

Large regexes are poorly handled

Open hyperpape opened this issue 2 years ago • 2 comments

Figure out how to handle very large regexes--the strategy we're using generates very large class files.

a04107b097ff559542c4e9c2bcd937eed93145dd substantially increased the size of generated regexes, making the problem worse, and required commenting out a previously used test case.

hyperpape avatar May 20 '23 17:05 hyperpape

The problem isn't gone, but some recent work to enable use of byteclasses is probably helpful. The regex Holmes.{0,25}Watson|Watson.{0,25}Holmes now generates a 340 KB class, instead of 2MB.

hyperpape avatar Jan 13 '24 23:01 hyperpape

Subsequent updates to encoding make Holmes.{0,25}Watson|Watson.{0,25}Holmes only 245181 bytes.

hyperpape avatar May 06 '24 07:05 hyperpape