billow icon indicating copy to clipboard operation
billow copied to clipboard

[WIP] Populate each service's data separately

Open BenMusch opened this issue 3 years ago • 2 comments

Right now, billow updates can take hours (or days) because of the size and rate limiting on our account. If a single stage in a job fails, the entire update fails and prevents any data from syncing.

This PR refactors the system so that each AWS service's data updates in its own job, so EC2 queries failing won't impact RDS data collection. A full initial build is still required on startup so we don't end up with an incomplete view of the data, but afterwards the services will update incrementally

Reviewers

@jtai @erluoli

BenMusch avatar Mar 23 '21 18:03 BenMusch

Sadly this does not seem to actually improve things in testing 😞 Will leave it running for a couple days to see

BenMusch avatar Mar 26 '21 03:03 BenMusch

Update: Does seem to be helping after some time. I also tweaked the refresh rate to be lower to avoid hammering the API

BenMusch avatar Mar 26 '21 17:03 BenMusch