linkchecker icon indicating copy to clipboard operation
linkchecker copied to clipboard

Duplicate checks?

Open jimpriest opened this issue 10 years ago • 4 comments

Same link, parent and timestamp? Why is this recorded more than once? In this instance it was listed 12 times?

Looking in my csv file I see:

7/21/2015  //www.sameurl.net http://www.parenturl.com/samepage 403 Forbidden FALSE -1  0.9819278717
7/21/2015  //www.sameurl.net http://www.parenturl.com/samepage 403 Forbidden FALSE -1    0.9819278717
7/21/2015  //www.sameurl.net http://www.parenturl.com/samepage 403 Forbidden FALSE -1  0.9819278717

jimpriest avatar Jul 22 '15 20:07 jimpriest

+1 ?

AlexAndrascu avatar Oct 10 '16 17:10 AlexAndrascu

This is very annoying, since it makes outputs quite chaotic. Also it raises suspicion, that Linkchecker does check every link more than once, which could slow it down and generate unnecessary load.

PetrDlouhy avatar Nov 09 '16 02:11 PetrDlouhy

There are two reasons for this - one is described under PR #687, other is that cache is restricted to 100 000 items for memory usage reasons and this can't be changed from command line.

PetrDlouhy avatar Nov 09 '16 15:11 PetrDlouhy

Thank you for the issue report. Sadly this project is dead, and a new team is around with https://github.com/linkcheck/linkchecker for more details please see: #708 Also please close this issue and report it freshly on the new repo https://github.com/linkcheck/linkchecker/issues if your issue still persists

dpalic avatar Oct 29 '17 09:10 dpalic