browsertrix-crawler
browsertrix-crawler copied to clipboard
Support Screenshot Creation, resolves #11
- add command-line option
--screenshot
to enable screenshot creation - save screenshots as resource records in WARC files
- per page two screenshots are taken:
- one showing the visible screen area
- full page screenshot (not saved in WARC if identical to first screenshot)
Open questions / to be discussed:
- put screenshots into the combined WARCs?
- screenshot location? First version: in a collection subdirectory
screenshots/
. By placing them inarchive/
, they'd be automatically wrapped into the combined WARCs. - screen resolution (
viewport
) if not set implicitly by--mobileDevice
- screenshot file names: So far, one WARC file per page (WARC file name includes the page UUID)
@sebastian-nagel would you be open to adding some simple tests for this? You probably also want to add the flag to our readme where we explain the functionality.
Sure. Will add a test and update the README.
Superseded by https://github.com/webrecorder/browsertrix-crawler/pull/188