bagweb
bagweb copied to clipboard
mirror a website, put it in a bag
bagweb doesn’t first check if both `wget` and `bagit.py` are installed before trying to run them: https://github.com/edsu/bagweb/blob/62e4d0625d634bc0ff289f417d3169b1a66f9c7f/bagweb#L3 This pull request to add those verifications – including checks for `bagit` via...
bagweb doesn’t first check if both `wget` and `bagit.py` are installed before trying to run them: https://github.com/edsu/bagweb/blob/62e4d0625d634bc0ff289f417d3169b1a66f9c7f/bagweb#L3
wget writes cdx index; if siegfried is installed generate a csv with file format identification of warc content.