Nick Sweeting

Results 1234 comments of Nick Sweeting

Would go well with: https://github.com/checkly/puppeteer-recorder

The beginnings of this will start to be implemented with our move from `chromium-browser` to ~~`pyppeteer`~~ playwright: #177, then these will be possible: - support for scripted user flows (this...

Sweet, the super-rough planned design is for ArchiveBox to run user-provided scripts like this: ```python archive_scripts = { 'dismiss_modals: '() => {document.querySelectorAll(".modal").delete()}', ... } browser = await launch() page =...

If possible, work on the Phase III scripts first. Those would be most helpful to me, as I've already started work on the phase I and II steps you outlined...

I found some huge repositories of Seleneium/Puppeteer scripts for dismissing modals and logging in to lots of sites. These are going to be super useful: - https://github.com/CriseLYJ/awesome-python-login-model - https://github.com/facert/awesome-spider -...

Whoops closed/reopened by accident. A quick update for those following this issue, we have a number of blocking tasks before we're going to get around to this: 1. Finish refactoring...

I've started mocking up what a playwright-based pluginized refactor would look like for ArchiveBox, and I think it's pretty elegant so far! This is still a ways away, but I'm...

Not yet but soon! It's just in a gist right now. Will publish it once I've moved >50% of the old codebase into the new structure. I'm traveling in Mexico...

The changes are unfortunately complicated because I use some internal Django template components to build the UI (specifically the table-view action buttons), and those components were completely rewritten after 3.1.

I've been investigating using `pywb`'s `wayback --proxy-record --proxy archivebox` and `google-chrome --proxy-server=http://localhost:8080 --ignore-certificate-errors --disable-web-security` to pipe all chrome and wget requests into a warc file. So far it looks promising,...