JustAnotherArchivist
JustAnotherArchivist
Cf. #490 and #491 Environment variables on the preflight test are not modified, but inside the pipeline, only the selected ones specified in `wpull_env` in `pipeline.py` are passed to wpull....
ArchiveBot does not retry `[Errno 111] Connection refused` errors. I think it should and consider this a bug. This simply requires adding the wpull option `--retry-connrefused`.
* Facebook doesn't like our UA. * Twitter requires the AB UA for useful archival. * Tumblr requires a browser UA. * Flickr blocks AB UA requests with a 503....
Some cookies or cookie values have bad effects on the archival. For example, many classical forum softwares let the user choose between different view modes (linear, threaded, hybrid), styles, or...
I just realised that a feature we've been talking about for years in `#archivebot` still isn't filed here: bulk ignore handling. The issue at hand is that wpull is fairly...
On job 6g7jcc64ct3ad8izr4dz82xdl, there were some URLs containing braces (`{` and `}`). This was displayed correctly in the log window, but when copying ignore patterns from the context menu, `%7B`...
Recently, on some pipelines, aborting a job started throwing logging exceptions at the end of `NameError: name 'open' is not defined`. This exception is raised in the logging's `__init__.py` inside...
A basic ignore for [ikiwiki](https://ikiwiki.info/) /ikiwiki\.cgi\?(.*&)?do=(create|edit|revert)(&|$) Thanks, @anarcat!
Job c6gd8eb8rwk5su6ijj5g52sv5 crashed with the following traceback: Starting StartHeartbeat for Item Finished StartHeartbeat for Item Starting SetFetchDepth for Item Finished SetFetchDepth for Item Starting PreparePaths for Item Finished PreparePaths for...
Job 2dpu0yxg4tnyzemryrs5iyvxs just crashed while trying to write the WARC `wpullinc` file: ``` Starting WgetDownload for Item Manhole[3714:1569868995.0911]: Patched and . Manhole[3714:1569868995.0947]: Manhole UDS path: /tmp/manhole-3714 Manhole[3714:1569868995.0949]: Waiting for new...