Mark H. Butler

Results 5 comments of Mark H. Butler

This commit adds this functionality https://github.com/butlermh/behemoth/commit/87ce262e9f41a07d1025c98790dcb3f9870591b2

Hi Grant, I have made these changes in my branch, input / output arguments are now standardized and it uses Apache Commons CLI as you requested - see https://github.com/butlermh/behemoth

See https://github.com/butlermh/behemoth/commit/7411aa9cbd0fd1bddd61545a9a503daff5d8dcf8 It turns out updating to the new API is a bad idea, DistributedCache does not work with the new API - see https://issues.apache.org/jira/browse/MAPREDUCE-898 http://lucene.472066.n3.nabble.com/Distributed-Cache-with-New-API-td722187.html this breaks the SOLR,...

See also http://autofei.wordpress.com/2011/04/07/distributedcache-incompleted-guide/

In the end, I did manage to find a way of doing this, except for WARC - see https://github.com/butlermh/behemoth/commit/97150bd579ae74eefacae85422937698f2c72445