ecosystem icon indicating copy to clipboard operation
ecosystem copied to clipboard

TensorFlow on Hadoop YARN

Open tslam75 opened this issue 7 years ago • 11 comments

Hadoop YARN is a commonly deployed cluster manager. Having the ability to run TensorFlow on YARN would be very useful in such environment.

Our team is currently working on a YARN application for this purpose, and would like to contribute our work here. We will provide more details of our contribution soon.

-Jason

tslam75 avatar Feb 16 '17 05:02 tslam75

Thanks! That'd be really useful.

jhseu avatar Feb 16 '17 18:02 jhseu

@tslam75 Have you looked at https://issues.apache.org/jira/browse/YARN-6043?

zhe-thoughts avatar Feb 17 '17 22:02 zhe-thoughts

@zhe-thoughts Thanks for the reference! Looked over YARN-6043, and both uses a native application master for TensorFlow.

Attaching a design document here now. We also have an implementation based on this design, and will publish the code soon.

TensorFlow_on_YARN.pdf

tslam75 avatar Feb 22 '17 22:02 tslam75

Sorry for the delay.

Created pull request #39 while waiting for the CLA to be signed.

tslam75 avatar Mar 21 '17 21:03 tslam75

Awesome job :O

IDerr avatar Apr 11 '17 14:04 IDerr

In Hadoop 3.0, YARN native services can support running Tensorflow services on YARN without adding any dependencies or implement a new YARN application master.

Please see our blogpost: https://hortonworks.com/blog/distributed-tensorflow-assembly-hadoop-yarn/ and let me know if you have any questions. Thanks!

leftnoteasy avatar Apr 17 '17 20:04 leftnoteasy

focus ...

tbchj avatar Apr 21 '17 09:04 tbchj

@tslam75 Does your 'TensorFlow on Yarn' support fault tolerance ? If yes, how?

butterluo avatar May 14 '17 13:05 butterluo

mark

zhanglistar avatar Dec 25 '17 07:12 zhanglistar

+1

oliverhu avatar Feb 01 '18 19:02 oliverhu

We (LinkedIn Hadoop team) just open sourced TonY: Repo: https://github.com/linkedin/TonY Blog post: https://engineering.linkedin.com/blog/2018/09/open-sourcing-tony--native-support-of-tensorflow-on-hadoop

Comments / discussions very welcome!

zhe-thoughts avatar Sep 27 '18 05:09 zhe-thoughts