Tessa Walsh comments

Results 216 comments of


                                            Tessa Walsh

WACZ-files dowloaded from Browsertrix and then uploaded to Browsertrix using "Upload WACZ" contains 0 pages

Hi Anders! Thanks for reporting this, definitely seems like a bug we'd want to address. By any chance, are the WACZ files you're uploading from Browsertrix multi-WACZs? Just a hunch...

WACZ-files dowloaded from Browsertrix and then uploaded to Browsertrix using "Upload WACZ" contains 0 pages

I have verified that this is an issue with multi-waczs, where our routine to read the pagelist on WACZ upload doesn't account for multi-WACZ. Unfortunately the remotezip library we're using...

[Feature]: How to have >100 e.g. 10K or more seeds in a "list of pages".

@tw4l To create feature document as first step, likely implementation involves uploading list as a file that crawler can download

[Feature]: How to have >100 e.g. 10K or more seeds in a "list of pages".

Supported in 1.18, which will be released shortly!

[Feature]: Add the "Supplement" feature after the archive is complete

Hi @xiaozhile, thanks for the report! If I'm understanding your use case correctly, this is something you should already be able to do in ArchiveWeb.page. If you open the browser...

Check page URLs for extension before direct fetch attempt

I wonder if it might be better to direct fetch any URL that ends in a file extension (and that's not `.html` or `.htm`, since some older sites followed that...

Check page URLs for extension before direct fetch attempt

> Yeah, maybe that's a smaller list to maintain, would also include .asp, .php, etc.. Another option is to always try browser load, and then if non-HTML, add extension to...

Out of disk space despite having enough disk space

Hi @pato-pan , on the latest Browsertrix Crawler releases (since 1.6.3), the disk utilization check should be disabled by default. It looks like you're hitting a related but different check...

[Bug]: web games not saving progress. resets after doing F5... 2nd bug is: in Fandom wiki, images not saving

Hi @furllmm, thanks for the issue. Have you tried enabling "Archive local storage" in the extension settings? First, open Settings by clicking on the cog icon in the extension homepage,...

[Bug]: web.archive.org not archiving or playing properly

Hi, this is a known issue - our tools tend not to do capture/replay existing web archives well. The issue stems from the fact that the Internet Archive's Wayback Machine...