training
training copied to clipboard
Training materials for Strata, AMP Camp, etc
Hey guys, I m trying to run the command: ./spark-ec2 -i -k --copy launch amplab-training from your Big-Data Mini Course. I have downloaded my US East key pair and also...
I followed the GraphX tutorial at http://ampcamp.berkeley.edu/big-data-mini-course/graph-analytics-with-graphx.html on a local stand-alone cluster (Spark version 0.9.0) with two workers. Somehow, the graph.triplets is not returning what it should. ``` scala> graph.vertices.toArray...
Issue encountered when following the exercises at http://ampcamp.berkeley.edu/4/exercises/data-exploration-using-shark.html. Ran the following query on a single reducer: select page_name, sum(page_views) as views from wikistats_mem group by page_name having views > 200000;...
I think there is a good chance that we might have too many modules for the average user to complete in a few hours. Should we update the instructions in...
When is python going to become available for mlbase? Or R? Would be nice to have exercises/tutorials in python/mlbase.
At AMP Camp 3 some folks "didn't get the email" because they registered for the event using a different email address than the one they were checking at AMP Camp...
In the Java version of example 6 in http://spark-summit.org/2013/exercises/machine-learning-with-spark.html, `tempDist` is used in the conditional of the while loop but declared inside the loop.
The k-means example has a challenge exercise: > Challenge Exercise: The K-Means implementation uses a groupBy and mapValues to compute the new centers. This can be optimized by using a...