text2sql-data
text2sql-data copied to clipboard
A collection of datasets that pair questions with SQL queries.
Hi @jkkummerfeld, Thanks for open-sourcing the code and datasets. The google drive links for Academic (MAS), IMDB, Yelp, and Scholar databases are not publicly accessible. Could you please share them...
It appears to me that primary key information in [geography-schema.csv](https://github.com/jkkummerfeld/text2sql-data/blob/master/data/geography-schema.csv) is wrong. For example, how can `state_name` be the primary key in the `mountain` table?
It's a little annoying that the anonymised variable names sometimes but not always correspond to the table/column name they come from. E.g in some datasets like academic, the variable name...