crawler-user-agents
crawler-user-agents copied to clipboard
adding IP ranges along with user-agent
I don't know how feasible is this request, and I don't how easily it can be implemented, but I thought that the user agent header is there to be forged.
For instance, in the case of Google, there is this page: https://developers.google.com/search/docs/crawling-indexing/verifying-googlebot
and this Json contains the IPs: https://developers.google.com/search/apis/ipranges/googlebot.json
it's a good idea. pull-request welcome!
FTR ip range of TikTok spider
From ttspider-feedback [email protected] The IP range is 47.128.0.0/16