Kevin Kuo
Kevin Kuo
just hit this too, workaround is to do the ordering in a subquery, note in my case the ordering column is double not int which might be relevant
@hannes reprex below plz lmk if bug or expected behavior ``` create table foo(a real, b real); insert into foo values (1, 69), (1, 420), (2, 69), (2, 420); select...
re: the postgres error perhaps the correct syntax is ``` select distinct on(a) a, b from foo order by a, b desc; ``` which should yield the same result... but...
from a bit of googling i can't find any other sql impl that has `DISTINCT ON`, so i'm guessing we got the inspiration from postgres and it would make sense...
@hannes checking in to see if any other info would help w/ further discussion here thanks
Yes, when Spark 2.4 is released, we'll have it implemented. https://issues.apache.org/jira/browse/SPARK-21898
https://issues.apache.org/jira/browse/SPARK-21898
What version are you using?
@Fooourche it should be possible right now to do "embarrassingly parallel" hyperparameter tuning with `spark_apply()`. We'll try to put some documentation together. Distributed training (training TF models on big data...
@ashishrjha could you try with a smaller grid and data first? E.g. ``` grid