browsertrix-crawler icon indicating copy to clipboard operation
browsertrix-crawler copied to clipboard

Support Screenshot Creation, resolves #11

Open sebastian-nagel opened this issue 3 years ago • 2 comments

  • add command-line option --screenshot to enable screenshot creation
  • save screenshots as resource records in WARC files
  • per page two screenshots are taken:
    1. one showing the visible screen area
    2. full page screenshot (not saved in WARC if identical to first screenshot)

Open questions / to be discussed:

  • put screenshots into the combined WARCs?
  • screenshot location? First version: in a collection subdirectory screenshots/. By placing them in archive/, they'd be automatically wrapped into the combined WARCs.
  • screen resolution (viewport) if not set implicitly by --mobileDevice
  • screenshot file names: So far, one WARC file per page (WARC file name includes the page UUID)

sebastian-nagel avatar Apr 14 '21 14:04 sebastian-nagel

@sebastian-nagel would you be open to adding some simple tests for this? You probably also want to add the flag to our readme where we explain the functionality.

emmadickson avatar Apr 23 '21 15:04 emmadickson

Sure. Will add a test and update the README.

sebastian-nagel avatar Apr 27 '21 14:04 sebastian-nagel

Superseded by https://github.com/webrecorder/browsertrix-crawler/pull/188

tw4l avatar Nov 17 '22 17:11 tw4l