John Erik Halse
John Erik Halse
Current paging API is described here: https://github.com/iipc/openwayback/tree/master/wayback-cdx-server-webapp#pagination-api. The paging api has some shortcomings: - Only available with Zipnum indexes - Can only use one index at a time - Some...
By using Spring for configuration almost all non private methods inn all classes must be considered as part of the API. By replacing config with some other, more restricted mechanism,...
There should be a guide for coding style on the wiki. This style should preferably be enforced in the build, but since that requires a lot of changes in the...
Some information can be found in the old documentation http://archive-access.sourceforge.net/projects/wayback/administrator_manual.html#Replay_Modes. But this document is incomplete and bugs in current code makes it difficult to find out exactly what is allowed...
The configuration files contains commented code as examples of alternative configurations. This code suffers from not being updated when code changes. In addition it makes the important parts of the...
Webarchive-commons is a quite big library with lots of dependencies on third party libraries. It should be split into smaller parts to enable projects to include only what they need....
As of April 2016, Java 7 has reached its end of public updates. We should move to Java 8 both for this reason and to be able to utilize enhancements...
Replaces code for looking up public suffixes with similar code from Google Guava. This change breakes one class in Heritrix: org.archive.crawler.processor.HashCrawlMapper Should be easy to fix though.
Java 6 has reached its end of public updates we should move to Java 7. Se discussion on #3 for implications of this move.
It is possible to use the library as a command line tool. Several classes implements `public static void main`. See #3 for the list of classes. There is no documentation...