Tessa Walsh

Results 219 comments of Tessa Walsh

So for your use case I think `--failOnFailedSeed` and `--failOnInvalidStatus` together should work well, or `--failOnInvalidStatus` and `--failOnFailedLimit 1` if you want the crawl to mark the crawl a failure...

> Hm, it does look like there might be an issue with the latter (`--failOnInvalidStatus` + `--failOnInvalidLimit 1`) without `--failOnFailedSeed` Looks like we weren't awaiting the result of `crawlState.numFailed()`! Fix...

Hi @steph-nb, the block rules target requests from specific URLs, so if you have a page at example.com with an iframe loading content from othersite.com and add a block rule...

There seems to be an issue with Elasticsearch in the Docker development environment: ``` scope on  master [!] is 📦 v1.0.0-beta.1 on 🐳 v20.10.7 via 🐍 3.9.4 took 5s...

The `elasticsearch` client library needs to be pinned to a version below `7.14` - see: https://github.com/elastic/elasticsearch-py/issues/1639#issuecomment-883319286

Related js-wacz PR: https://github.com/harvard-lil/js-wacz/pull/89

Closed in favor of https://github.com/webrecorder/browsertrix-crawler/issues/674

Thank you for doing this investigation! I agree that it'd be wise to tweak the default profile to fit user expectations a bit better. We could also potentially have a...

I have a vague memory now of this being commented out in relation to the development of the Vue banner UI, but for the life of me can't remember why....

Ah, I dug back through our internal Discord conversations and remember now! This was our hacky temporary solution at the time to get pywb to populate the calendar from a...