the-algorithm icon indicating copy to clipboard operation
the-algorithm copied to clipboard

Fake data for features testing

Open marsupialtail opened this issue 2 years ago • 0 comments

I am looking through the features SQL here: https://github.com/twitter/the-algorithm/tree/main/src/scala/com/twitter/simclusters_v2/scio/bq_generation/sql

I am wondering if you guys can release some fake data or at least the schema of those tables mentioned.

It would also be quite helpful if you give us a sense of how big each of those table are.

I am working on an open source distributed SQL query engine (https://github.com/marsupialtail/quokka if interested) and want to test run some of those queries. Also curious in general!

marsupialtail avatar Apr 01 '23 04:04 marsupialtail