proxy-scraper-checker
proxy-scraper-checker copied to clipboard
HTTP, SOCKS4, SOCKS5 proxies scraper and checker with rich functionality.
proxy-scraper-checker

HTTP, SOCKS4, SOCKS5 proxies scraper and checker.
- Asynchronous.
- Uses regex to search for proxies (ip:port format) on a web page, which allows you to pull out proxies even from json without making any changes to the code.
- Supports determining the geolocation of the proxy exit node.
- Can determine if a proxy is anonymous.
For a version that uses Python's built-in logging instead of rich, see the simple-output branch.
You can get proxies obtained using this script in monosans/proxy-list.
Usage
- Install Python (Windows 7 requires Python 3.8.X). During installation, be sure to check the box
Add Python to PATH. - Download and unpack the archive with the program.
- Install dependencies from
requirements.txt(cdinto the unpacked folder and runpython -m pip install -U -r requirements.txton the command line). - Edit
config.iniaccording to your preference. - Run
main.py(python main.pyon the command line).
Folders description
When the script finishes running, the following folders will be created (this behavior can be changed in the config):
proxies- proxies with any anonymity level.proxies_anonymous- anonymous proxies.proxies_geolocation- same asproxies, but includes exit-node's geolocation.proxies_geolocation_anonymous- same asproxies_anonymous, but includes exit-node's geolocation.
Geolocation format is ip:port|Country|Region|City.
Buy me a coffee
Ask for details in Telegram or VK.
License
MIT