scrapy-playwright icon indicating copy to clipboard operation
scrapy-playwright copied to clipboard

With playwright 1.55.0 downloads don't go in scrapy body instead get exception

Open bluthen opened this issue 4 months ago • 2 comments

I was chromium browser type using setting

            "playwright_page_goto_kwargs": {"wait_until": "networkidle"},

In playwright 1.55.0 I kept getting error about file page() and file downloading, like:

playwright._impl._errors.Error: Page.goto: Download is starting
Call log:

navigating to "http://localhost:8081/something.docx", waiting until "networkidle"

Whereas playwright 1.54.0 was working fine with scrapy-playwright. Even without accept-downloads: True.

Downgrading to playwright 1.54.0 allowed me to access these files like regular scrapy response and use response.body to get the content instead of getting the exception.

Something must of changed in playwright 1.55.0 that doesn't let playwright-scrapy handle files that want to be downloadable like it did before.

bluthen avatar Aug 30 '25 18:08 bluthen

This is probably the cause: https://github.com/microsoft/playwright/pull/37082

Molkree avatar Sep 01 '25 14:09 Molkree

Thanks both for the report and the upstream reference.

elacuesta avatar Sep 10 '25 13:09 elacuesta