spark-ec2 icon indicating copy to clipboard operation
spark-ec2 copied to clipboard

Is there a handy Hadoop cluster running?

Open biolearning opened this issue 9 years ago • 3 comments

With Spark cluster launched and started with ec2 script, is there the Hadoop cluster ready to go, or just an easy command to start it? The reason I am asking for this is - I would like to enable log and save that in hadoop FS which is accessible for Spark workers?

biolearning avatar Nov 07 '16 09:11 biolearning

Yes - there should be a HDFS installation on /root/ephemeral-hdfs

shivaram avatar Nov 07 '16 16:11 shivaram

Thanks for your reply, @shivaram, is it started by default? I guess not, then what is the command I can start the Hadoop cluster?

biolearning avatar Nov 08 '16 01:11 biolearning

It should be started by default. If you want to reconfigure / restart it you can read the Hadoop docs for the appropriate version.

shivaram avatar Nov 08 '16 03:11 shivaram