Andy Jackson

Results 167 comments of Andy Jackson

Via @ikreymer I found out that JWAT has WarcReaderFactory.isWarcRecord(), ArcReaderFactory.isArcRecord() and GzipReader.isGzipped(), so it might be possible to clean up the current duplicated sniffing code and use them instead.

How are you indexing the WARCs?

For the original use case, it might be sufficient to just omit the status code unless we're sure of it.

Perhaps best explored via an overlay-style project?

Similar to #29 I think.

I believe this concerned ensuring that the performance and user experience were acceptable when a particular page or host had a very large number of instances.

Should we bake CDX-Server in as a default, and deprecate XML Query?

I'm very confused that this kind of basic issue snuck into the release. Can you describe how you fired up OpenWayback?

Please note that these classes have been in differently named packages recently, i.e. org.netpreserve.commons:webarchive-commons used to be org.netpreserve.commons:commons-web. Similarly, if you also have Heritrix on your classpath, you may bring...