Andy Jackson
                                            Andy Jackson
                                        
                                    And @ruebot makes this suggestion, which LGTM I think: https://gist.github.com/ruebot/f2c0493bb205ba2db9ef0363ff8d7e50
I think we should go ahead with your changelog proposal. Did you notice any major problems with the README or wiki?
n.b. added change log generation to https://github.com/internetarchive/heritrix3/wiki/HOWTO-Ship-a-Heritrix-Release
However, looking at the code in question, it appears that the `ExtractorHTML` extracts links that might be URLs from any `
Apparently this happens a lot with `og:facebook-tags` attributes. Perhaps given the change in usage of these fields in recent years, it's time to change the default behaviour to avoid this...
I've spent some time looking at generic JavaScript annotation frameworks. They seem to offer a nice separation of concerns, allowing the annotation backend to be kept apart from Wayback/whatever. Unfortunately,...
Sorry for missing this. I can't remember if I did this or if Carl Wilson set it up.
This is another one of those times I'd prefer a `iframe` approach, as the fact that this is known to be damaged from capture could be relayed around the edge....
@ldko yes, that is possible - if the ClassLoader has to open a new file to find the class in question, it'll fail if there are too many file handles...
A good list of HTML diffing tools here: http://www.w3.org/wiki/HtmlDiff