Tessa Walsh

Results 68 issues of Tessa Walsh

Creating DFXML with walk_to_dfxml.py (called with Python 3.5.2 in Ubuntu 16.04) on two mounted disk images resulted in the following errors: > Traceback (most recent call last): > File "/usr/share/dfxml/python/walk_to_dfxml.py",...

## Description Werkzeug deprecated the `useragents` module in 2.0.x and removed it entirely in 2.1.x in favor of `werkzeug.user_agent`. This commit modifies the `rewriterapp` to import `UserAgent` from `user_agent` and...

## Describe the bug The 2.1.0 release of werkzeug [removed the `werkzeug.useragents` module](https://werkzeug.palletsprojects.com/en/2.1.x/changes/#version-2-1-0), which pywb uses in the `rewriterapp`. This causes the `wayback` command to throw a 500 Internal Server...

Solution for CCA Disk Image Processor mounts disk image, copies files using shutil.copytree, and then unmounts image in Bitcurator. Difficulty is in making such an approach work across different OSes.

Connected to https://github.com/tw4l/brunnhilde/issues/53 Brunnhilde/Siegfried report on the file created and modified dates as they are in the file system where files are being scanned. Sometimes files contain more accurate timestamps...

Extract metadata from files, using [Apache Tika](https://tika.apache.org), [exiftool](https://exiftool.org/) or native Python libraries. Questions: * Tool(s) used to extract metadata: Apache Tika](https://tika.apache.org), [exiftool](https://exiftool.org/), [Hachoir](https://hachoir.readthedocs.io/en/latest/), ... * Which schemas/formats: EXIF, XMP... *...

## What's happening If I upload a WACZ or WARC file to replayweb.page that has the identical name of an existing uploaded file (e.g. `screenshots.warc.gz`), I am taken to the...

Fixes #881 ## Description This PR adds the option to use `*,` as a SURT in ACLJ files to match any URL. This builds off of the existing ability to...

## Is your feature request related to a problem? Please describe. Currently, if a user wants to set `default_access: block` but allow particular users to have access to any URL,...