Debasish Das
Debasish Das
val regression = new OLSMultipleLinearRegression() regression.setNoIntercept(true) regression.newSampleData(Y, X) val params = regression.estimateRegressionParameters() This solver does not scale beyond 1024 kind features...Do you use nonlinear CG somewhere ? I have a...
Closes #201
I am interested to add KalmanFilter and more state space algorithms like RNN for time series....Let me know if you are still looking...
Forecast API we can add it...idea is that we use one-step ahead prediction and then add the predicted point back to the time-series to predict the next step...as the prediction...
I would like to use it for cassandra access through storehaus.. .will it be merged to master soon ?
I am confused why is Cascading showing up in Cassandra StoreHaus ? Cassandra does not use any Cascading right and StoreHaus should not be using it either
I think we should not mix cascading and storehaus cassandra....storehaus can be used by compute fabrics like spark flink and summingbird including frontend apis....
Right now in my experiments I am finding it slower compared to Spark SQL. None of the dataset is cached. Is this expected ?
Example query: Spark SQL select \* from table where col1="x" and col2="y" limit 1; IndexedRDD val indexedkv = IndexedRDD(df.map{row=>(row.col1+row.col2,value)}) indexedkv.lookup("x" + "y") I will be surprised if redis is not...
I will run the same experiments with caching...I am not yet sure if Redis caches all the data in memory or not...Most likely it will for read optimization