data-analysis icon indicating copy to clipboard operation
data-analysis copied to clipboard

Results 9 data-analysis issues
Sort by recently updated
recently updated
newest added

Adds support for passing a file within a local zip file (path to zip_file provided via `zip_file`, zipped filename from default `file`)

https://www.dolthub.com/repositories/dolthub/us-housing-prices-v2 Latest housing price bounty data must contain recent sales. Mortgage interest rates have gone up massively in the past 6 months. Can you see an effect in the recent...

https://www.dolthub.com/repositories/dolthub/hospital-price-transparency-v3 Try to join or group similar codes based on description and price. See if we can identify a set of "super codes" that describe the universe of procedures/prices better.

https://www.dolthub.com/repositories/dolthub/hospital-price-transparency-v3 Find all the prices in this databases that are way outside the distribution on the high side. Trim to common procedures. Create a top X list. Investigate some of...

By this query of all the Van Gogh's in our database, it looks like we have pretty poor coverage. ``` $ dolt sql -q "SELECT institution_name, institution_city, title FROM objects...

It would be fun to write a "weirdness" detection algorithm for the museum collections and then find the weirdest objects and rate the weirdest collection.

There is a bit of a controversy about how many items are in storage at museums. https://podcasts.apple.com/us/podcast/dragon-psychology-101/id1119389968?i=1000478450374 We could use our museum collections database: https://www.dolthub.com/repositories/dolthub/museum-collections to estimate the square footage...

We collected prisons data in this bounty database: https://www.dolthub.com/repositories/dolthub/us-jails In the `jails` table there is a `num_inmates_rated_for` and in the `inmate_population_snapshots` there are population snapshots. This query is interesting: ```...