Josh Rosen
Josh Rosen
I implemented a confusion matrix in e6eb6e846cc9192ab81e19eee724e7af5d1204bc. I'll upload a snapshot of the database to Dropbox in a little bit so you can test it out. I'd still like to...
f118f4d0692b5919bbe303a243662ce00f9709eb added support for viewing a sample of misclassified images. Here's a link to a recent build of the stats_db: https://www.dropbox.com/s/t5zg9dqb0eoe6lv/stats_db.zip Unizp that file and use that directory as the...
I think there's a bug in the confusion matrix's image display because the images' classes displayed in the histograms don't match up with the classifications reported by the table. I'll...
Just curious: what pain points have you encountered that require you to publish different artifacts for different Spark versions? For Databricks' own [spark-redshift](https://github.com/databricks/spark-redshift) library, I've configured our tests to build...
I wouldn't read quite too much into `spark-packages`' API compatibility indicators; the methodology used to generate them isn't very precise: it would flag your project as incompatible with Spark 1.3.0,...
+1. I'd really appreciate it if you published pyqver on PyPi, or at least added a setup.py so that I could install pyqver in one line using pip (`pip install...
Regarding the proposed removal of `IsolatedClientLoader`, I am concerned that doing so might break some users' ability to interface with very old Hive metastores: Although newer Hive client versions are...
The latest stats_db dump should have the data for this: https://www.dropbox.com/s/t5zg9dqb0eoe6lv/stats_db.zip This should have the required data to begin exploring different clustering techniques. Here was my first attempt at using...
Note that I see the above output on both 1.6.0 and 1.6.1.
I managed to work around this using a trick similar to https://github.com/tomerfiliba/plumbum/issues/124#issuecomment-54664378, using `sudo[ls]['arg with spaces']`, but I'm pretty confused by the behavior that I saw above and would appreciate...