Zhe Zhang
Zhe Zhang
@tslam75 Have you looked at https://issues.apache.org/jira/browse/YARN-6043?
We (LinkedIn Hadoop team) just open sourced TonY: Repo: https://github.com/linkedin/TonY Blog post: https://engineering.linkedin.com/blog/2018/09/open-sourcing-tony--native-support-of-tensorflow-on-hadoop Comments / discussions very welcome!
Thanks @davzaman for the contribution and @matthewdeng for the initial review! Ping @richardliaw for the question "should we ensure backwards compatible support for PTL < 1.6?"
@sven1977 @simonsays1980 please follow up. Thanks
@rickyyx @rkooo567 Please get this merged into master first (master is unfrozen for now). Thanks
This is a great conclusion. Thanks @raulchen . Overall I think we should be using prototypes more for REPs. cc @ericl and @scv119 for prototyping future REPs from Anyscale Ray...
> Support TorchTitan for TP + PP parallelism Is there a timeline for this? Thanks
@jjyao Minor: you meant for the title to be "Priority based" right? (Or, "Prioritizer based"?)