Results 53 comments of Kevin Kuo

just hit this too, workaround is to do the ordering in a subquery, note in my case the ordering column is double not int which might be relevant

@hannes reprex below plz lmk if bug or expected behavior ``` create table foo(a real, b real); insert into foo values (1, 69), (1, 420), (2, 69), (2, 420); select...

re: the postgres error perhaps the correct syntax is ``` select distinct on(a) a, b from foo order by a, b desc; ``` which should yield the same result... but...

from a bit of googling i can't find any other sql impl that has `DISTINCT ON`, so i'm guessing we got the inspiration from postgres and it would make sense...

@hannes checking in to see if any other info would help w/ further discussion here thanks

Yes, when Spark 2.4 is released, we'll have it implemented. https://issues.apache.org/jira/browse/SPARK-21898

https://issues.apache.org/jira/browse/SPARK-21898

@Fooourche it should be possible right now to do "embarrassingly parallel" hyperparameter tuning with `spark_apply()`. We'll try to put some documentation together. Distributed training (training TF models on big data...

@ashishrjha could you try with a smaller grid and data first? E.g. ``` grid