rotating-free-proxies icon indicating copy to clipboard operation
rotating-free-proxies copied to clipboard

Use freely available proxies automatically for scrapy

rotating-free-proxies

Avoid being banned by webpages when you crawl them. This is an extension to the amazing scrapy-rotating-proxies_ library. The main target of this library is to get proxies dynamically when the spider is running. This library automatically fetches freely available lists of proxies from free-proxy-list.net_.

.. _scrapy-rotating-proxies: https://pypi.python.org/pypi/rotating-free-proxies .. _free-proxy-list.net: https://free-proxy-list.net/

Installation

::

pip install rotating-free-proxies

Usage

After installing you need to do just add following two variables in settings.py of your Scrapy project ::

ROTATING_PROXY_LIST_PATH = '/my/path/proxies.txt' # Path that this library uses to store list of proxies
NUMBER_OF_PROXIES_TO_FETCH = 5 # Controls how many proxies to use


DOWNLOADER_MIDDLEWARES = {
    'rotating_free_proxies.middlewares.RotatingProxyMiddleware': 610,
    'rotating_free_proxies.middlewares.BanDetectionMiddleware': 620,
}

For further details on using this library, refer to the original readme_.

.. _readme: https://github.com/TeamHG-Memex/scrapy-rotating-proxies/blob/master/README.rst

Thank you!