aho-corasick
aho-corasick copied to clipboard
Aho-Corasick algorithm as implemented in Java by Danny Yoo, with little improvements
This is the source code distribution for an implementation of the Aho-Corasick automaton in Java. For license information, please see LICENSE.
This implemenation changes the build system from Ant to Maven
The starting version is 1.2
This implementations improve some aspects of a software written by Danny Yoo, available at https://hkn.eecs.berkeley.edu/~dyoo/java/index.html
The starting version is 1.1
The main changes are:
- the use of Java Generics
- the use of char[] data structure to represent text instead of byte[]; this makes the software compatible with UTF-8 strings
- a Benchmark class, to test the improvements of using the Aho-Corasick algorithm instead of regular expressions
Here follows the original README:
The following commands require Apache Maven, which can be found here:
http://maven.apache.org/
Building the jar
To compile the jar, run `maven install'. The resulting jar should be created in:
target/ahocorasick-
Building the documentation
To build the javadocs, run `maven install'. The javadocs should be created in:
target/ahocorasick-