Kevin Kuo comments

Results 53 comments of


                                            Kevin Kuo

SELECT DISTINCT ON does not return the right rows

just hit this too, workaround is to do the ordering in a subquery, note in my case the ordering column is double not int which might be relevant

SELECT DISTINCT ON does not return the right rows

@hannes reprex below plz lmk if bug or expected behavior ``` create table foo(a real, b real); insert into foo values (1, 69), (1, 420), (2, 69), (2, 420); select...

SELECT DISTINCT ON does not return the right rows

re: the postgres error perhaps the correct syntax is ``` select distinct on(a) a, b from foo order by a, b desc; ``` which should yield the same result... but...

SELECT DISTINCT ON does not return the right rows

from a bit of googling i can't find any other sql impl that has `DISTINCT ON`, so i'm guessing we got the inspiration from postgres and it would make sense...

SELECT DISTINCT ON does not return the right rows

@hannes checking in to see if any other info would help w/ further discussion here thanks

Kolmogorov-Smirnov test

Yes, when Spark 2.4 is released, we'll have it implemented. https://issues.apache.org/jira/browse/SPARK-21898

Kolmogorov-Smirnov test

https://issues.apache.org/jira/browse/SPARK-21898

Error: `x` and `y` must share the same src, set `copy` = TRUE (may be slow)

What version are you using?

Train Tensor Flow model with sparklyr

@Fooourche it should be possible right now to do "embarrassingly parallel" hyperparameter tuning with `spark_apply()`. We'll try to put some documentation together. Distributed training (training TF models on big data...

ML Pipeline Error: org.apache.spark.SparkException: Exception thrown in awaitResult:

@ashishrjha could you try with a smaller grid and data first? E.g. ``` grid