the-algorithm
the-algorithm copied to clipboard
Fake data for features testing
I am looking through the features SQL here: https://github.com/twitter/the-algorithm/tree/main/src/scala/com/twitter/simclusters_v2/scio/bq_generation/sql
I am wondering if you guys can release some fake data or at least the schema of those tables mentioned.
It would also be quite helpful if you give us a sense of how big each of those table are.
I am working on an open source distributed SQL query engine (https://github.com/marsupialtail/quokka if interested) and want to test run some of those queries. Also curious in general!