mrjob
mrjob copied to clipboard
Is there any way to connect to a remote Hadoop cluster?
Can't find any trace of this in the docs.
Can I run a Python script with mrjob on my laptop, and have it connect to a remote Hadoop cluster over VPN, run the mapreduce job there, get the results back on my local system? I can connect to any Hadoop TCP port just fine from my laptop.
Is there a config or code example for this?
Thanks!
Facing the same issue.