Andy Jackson

Results 167 comments of Andy Jackson

Given most of us generate CDX files from the CLI for small collections, maybe the out-of-the-box collection should just automate the process of generating and updating CDX files and use...

This sounds like exactly the kind of thing we should defer to the container for. Indeed, [Tomcat already supports persistent sessions](http://tomcat.apache.org/tomcat-5.5-doc/config/manager.html#Persistence_Across_Restarts).

Okay, I think I understand that, but I don't understand whether that means we really need our own session persistence mechanism. Am I right in thinking that this complexity just...

In case anyone is able to help test this, I've made a distribution of this version available here: http://webarchive.org.uk/downloads/openwayback.tar.gz

See http://blog.archive.org/2013/10/24/web-archive-404-handler-for-webmasters/ for context. I guess the question is whether OpenWayback default setup supports this, i.e. exposing the API and including the Javascript etc.

The CDX server already provides one API, although given that IA implement this feature using a separate API hook, maybe there are issues with directly exposing the CDX server to...

Maybe this will help: https://stackoverflow.com/questions/2687220/maven-check-for-updated-dependencies-in-repository/21073104

BTW, here's what I posted on the mailing list: --- It looks like Ilya changed the Indexer at this point in time: https://github.com/iipc/openwayback/commit/b8315edb700e5d320ee053848d49993ff235c609 However, as far as I can tell,...

I've put some rough notes on all this here: http://iipc.github.io/warc-specifications/specifications/cdx-format/cdx-2015/ (generated from https://github.com/iipc/warc-specifications/blob/gh-pages/specifications/cdx-format/cdx-2015/index.md).

I created a test case (not committed as I'm not sure of licensing of problem.warc and maybe it's a bit on the big side for a test file): ``` java...