github-crawler icon indicating copy to clipboard operation
github-crawler copied to clipboard

need throttling options

Open vincent-fuchs opened this issue 6 years ago • 2 comments

Summary

in some cases, rate limitation may be in place and pretty restrictive, even for authenticated API requests.

We would need a throttling config param, under which we can specify :

  • the throttling period in seconds
  • the max number of repositories we can process during the throttling period.

if we cross the limit, we need to wait for the throttling period before we continue.

While we're at it, we can move the existing crawlInParallel property under the throttling property, as it's related.

Type of Issue

It is a :

  • [x] request

Your Environment

  • Version used: 1.0.11
  • OS and version:
  • Version of libs used:

vincent-fuchs avatar Nov 13 '18 12:11 vincent-fuchs

Moreover, the standard rate-limit headers of GitHub & Gitlab could be supported: X-RateLimit-Limit / X-RateLimit-Remaining / X-RateLimit-Reset

  • https://docs.gitlab.com/runner/configuration/rate_limiting.html
  • https://developer.github.com/v3/rate_limit/

Lucas-C avatar Feb 27 '20 13:02 Lucas-C

Moreover, the standard rate-limit headers of GitHub & Gitlab could be supported: X-RateLimit-Limit / X-RateLimit-Remaining / X-RateLimit-Reset

  • https://docs.gitlab.com/runner/configuration/rate_limiting.html
  • https://developer.github.com/v3/rate_limit/

yes, that's the idea - interested in contributing ? ;-)

vincent-fuchs avatar Feb 27 '20 19:02 vincent-fuchs