Lewis John McGibbney

Results 287 comments of Lewis John McGibbney
trafficstars

hi @chrismattmann the regex-urlfilter.txt can be found here https://www.dropbox.com/s/hl6wlvwbr4xrv81/regex-urlfilter.txt?dl=0

@snowangelwmy if you look at the URL @chrismattmann defined, you will see that he's referenced a SNAPSHOT. This is so we can use some of the newer features of Tika....

Hi Mohamed, This is fantastic thank you for the response. I created an issue in our Jira issue tracker for this https://issues.apache.org/jira/browse/NUTCH-1923 We recently added an HBase Docker container, this...

Hi Mohamed, On Fri, Feb 13, 2015 at 9:40 AM, Mohamed Meabed [email protected] wrote: > Thanks @chrismattmann https://github.com/chrismattmann :), i would love > to contribute back to the community. >...

Hi @Meabed can you possibly base your patch off the 2.X branch and then submit a patch to the issue on the Nutch Jira tracker? https://issues.apache.org/jira/browse/NUTCH-1923 Thanks if you can...

Hi Mohamed, Can you please submit a pull request against te Nutch 2.x branch ? Thanks very much Lewis On Sunday, March 15, 2015, Mohamed Meabed [email protected] wrote: > Hi...

Hi @Meabed please see https://issues.apache.org/jira/browse/NUTCH-1923 and comment :) I am really looking forward to this being part of the Nutch releases.

The logging now looks as follows ```INFO o.a.n.n.URLExemptionFilters [LocalJobRunner Map Task Executor #0] Found 1 URLExemptionFilter implementations: '[org.apache.nutch.urlfilter.ignoreexempt.ExemptionUrlFilter@3090c372]’```. If no URLExemptionFilter implementations are found then no log statement is produced.

Excellent @sebastian-nagel 👍 I agree