spark-ec2 icon indicating copy to clipboard operation
spark-ec2 copied to clipboard

Documentation incorrect regarding missing "ec2" directory

Open matthewadams opened this issue 7 years ago • 3 comments

The documentation appears to be incorrect in at least the branch-1.6 & branch-2.0 branches. At https://github.com/amplab/spark-ec2#launching-a-cluster, the doc says "Go into the ec2 directory in the release of Apache Spark you downloaded." Problem is, there is no ec2 directory in the Spark distribution.

http://stackoverflow.com/a/38882774/969237 says "Download the official ec2 directory as detailed in the Spark 2.0.0 documentation." (in Edit 2). Problem is, the official Spark documentation (now at 2.1), at http://spark.apache.org/docs/latest/, links to https://github.com/amplab/spark-ec2, which takes me right back here. No help.

I'm suspecting that what was formerly the ec2 directory in an Apache Spark distribution is now the root directory of https://github.com/amplab/spark-ec2, but I'm not familiar enough with this stuff to know.

Please update the documentation so that I can follow the installation instructions.

matthewadams avatar Feb 28 '17 02:02 matthewadams

Yes - the contents of EC2 directory in Spark is now in the root of this repository. Would you be interested in opening a PR updating the documentation ?

shivaram avatar Mar 03 '17 04:03 shivaram

@matthewadams You got any resolution for your problem above?

aditya-mittal avatar Jul 13 '17 11:07 aditya-mittal

As correctly pointed out by @matthewadams, the ec2 folder is missing in the spark 2.2.0. Well, the ec2 folder was present in previous versions of spark.

Solution: Create a directory named ec2 inside downloaded spark and then clone this repository (https://github.com/amplab/spark-ec2) inside ec2 directory of downloaded spark.

aditya-mittal avatar Jul 18 '17 08:07 aditya-mittal