tranco-list icon indicating copy to clipboard operation
tranco-list copied to clipboard

List generation fails when enabling Google Safe Browsing

Open Bender250 opened this issue 3 months ago • 1 comments

I have tried generating 2 lists differing only in checking the box "Only include domains not flagged as dangerous by Google Safe Browsing". The one with the Google Safe Browsing option enabled time-outed.

  • Failed list: https://tranco-list.eu/list/NNYWW/300000
  • Successful list: https://tranco-list.eu/list/7N7LX/300000

My experience with Google Safe Browsing is that it allows only 10k queries per day, so unless Google granted Tranco an increased quota, this option should not be configurable for so large lists. Or maybe you got an exception from Google and it expired?

Bender250 avatar Dec 10 '25 14:12 Bender250

Seems like this issue is the same as https://github.com/DistriNet/tranco-list/issues/10, suggesting it is around for 3 years. I just recommend removing "Only include domains not flagged as dangerous by Google Safe Browsing" checkbox. In our crawls, we anyway run the safe browsing check just before the crawl and we typically filter less than 1 website per 10k of list (when using CrUX), it is not so important feature.

Bender250 avatar Dec 10 '25 14:12 Bender250