behemoth icon indicating copy to clipboard operation
behemoth copied to clipboard

Use warc-hadoop library

Open jnioche opened this issue 9 years ago • 0 comments

[https://github.com/ept/warc-hadoop] could be used as a dependency for handling the WARC format in Hadoop. This would be cleaner than having a copy of the lemurproject classes as we currently do.

jnioche avatar Apr 20 '15 14:04 jnioche