guac
guac copied to clipboard
Utilizing wealth of Deps.dev information
Deps.dev has a lot of existing data crawled from open source packages. We'd ideally be able to utilize the information available there to perform analysis and aggregation from.
There are two modes that a large data set like Deps.dev can be used.
- One is through a standard collector, where it emits documents about what it knows and that gets ingested into GUAC, taken from the bigquery table
- Another use case which may be useful is as a data augmenter (similar to certifiers). Based on information in GUAC, it emits documents which augment the particular data sets in GUAC
Additional evaluation needs to be done to determine the best way to go forward with using Deps.dev information.