Data-Engineering-Projects
Data-Engineering-Projects copied to clipboard
Bump scrapy from 2.6.1 to 2.6.2 in /3. Web Scraping using Scrapy, Mongo ETL
Bumps scrapy from 2.6.1 to 2.6.2.
Release notes
Sourced from scrapy's releases.
2.6.2
Fixes a security issue around HTTP proxy usage, and addresses a few regressions introduced in Scrapy 2.6.0.
See the changelog.
Changelog
Sourced from scrapy's changelog.
Scrapy 2.6.2 (2022-07-25)
Security bug fix:
When :class:
~scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddlewareprocesses a request with :reqmeta:proxymetadata, and that :reqmeta:proxymetadata includes proxy credentials, :class:~scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddlewaresets theProxy-Authenticationheader, but only if that header is not already set.There are third-party proxy-rotation downloader middlewares that set different :reqmeta:
proxymetadata every time they process a request.Because of request retries and redirects, the same request can be processed by downloader middlewares more than once, including both :class:
~scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddlewareand any third-party proxy-rotation downloader middleware.These third-party proxy-rotation downloader middlewares could change the :reqmeta:
proxymetadata of a request to a new value, but fail to remove theProxy-Authenticationheader from the previous value of the :reqmeta:proxymetadata, causing the credentials of one proxy to be sent to a different proxy.To prevent the unintended leaking of proxy credentials, the behavior of :class:
~scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddlewareis now as follows when processing a request:
If the request being processed defines :reqmeta:
proxymetadata that includes credentials, theProxy-Authorizationheader is always updated to feature those credentials.If the request being processed defines :reqmeta:
proxymetadata without credentials, theProxy-Authorizationheader is removed unless it was originally defined for the same proxy URL.To remove proxy credentials while keeping the same proxy URL, remove the
Proxy-Authorizationheader.If the request has no :reqmeta:
proxymetadata, or that metadata is a falsy value (e.g.None), theProxy-Authorizationheader is removed.It is no longer possible to set a proxy URL through the :reqmeta:
proxymetadata but set the credentials through theProxy-Authorizationheader. Set proxy credentials through the :reqmeta:proxymetadata instead.
... (truncated)
Commits
aecbccbBump version: 2.6.1 → 2.6.2af7dd16Merge pull request from GHSA-9x8m-2xpf-crp34205609Fixed intersphinx referencese3e69d1Pin documentation requirements (#5536)54bfb96Cover #5525 in the 2.6.2 release notes (#5535)4ef7182If TWISTED_REACTOR is None, reuse any pre-installed reactor (#5528)1c1cd5dUpdate the 2.6.2 release notes84c29a2Unset the release date of still-unreleased 2.6.2 (#5503)b9b9422Merge pull request #5482 from alexpdev/parse_help_msg915c288edit- Additional commits viewable in compare view
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
@dependabot rebasewill rebase this PR@dependabot recreatewill recreate this PR, overwriting any edits that have been made to it@dependabot mergewill merge this PR after your CI passes on it@dependabot squash and mergewill squash and merge this PR after your CI passes on it@dependabot cancel mergewill cancel a previously requested merge and block automerging@dependabot reopenwill reopen this PR if it is closed@dependabot closewill close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually@dependabot ignore this major versionwill close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this minor versionwill close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this dependencywill close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)@dependabot use these labelswill set the current labels as the default for future PRs for this repo and language@dependabot use these reviewerswill set the current reviewers as the default for future PRs for this repo and language@dependabot use these assigneeswill set the current assignees as the default for future PRs for this repo and language@dependabot use this milestonewill set the current milestone as the default for future PRs for this repo and language
You can disable automated security fix PRs for this repo from the Security Alerts page.