bagweb
bagweb copied to clipboard
cdx and format identification
wget writes cdx index; if siegfried is installed generate a csv with file format identification of warc content.