Lee Yang
Lee Yang
@tgaddair Here is the NVTabular PR with a higher-level API to abstract away the Readers and a reduced image size (18.1GB vs. 37GB) for `Dockerfile.test.gpu`.
@EnricoMi Thanks for the review! > What do you mean with `reduced image size (18.1GB vs. 37GB) for Dockerfile.test.gpu.`? This was just a reference to an earlier version of the...
Yes, it should run. Note: there were some minor bugs per #585.
1. When using the "built-in" TensorBoard server in TFoS (triggered by supplying `tensorboard=True`), the TB server is hosted in the "chief" worker, so it has the same lifecycle as the...
@orwa-te I'm not aware of anything within tensorflow which monitors cross-node traffic.
This was mostly to account for different APIs and behaviors between different versions of TensorFlow, since different users of TFoS were on different versions of TensorFlow. Anyhow, if you set...
Yes, it is the complete model, and unfortunately, this is just how TF works at the moment.
Have you seen the [wiki docs](https://github.com/yahoo/TensorFlowOnSpark/wiki) which explain the architecture details of TensorFlowOnSpark along with the [FAQ](https://github.com/yahoo/TensorFlowOnSpark/wiki/Frequently-Asked-Questions)? Anyhow, at the most general level, TFoS basically just helps to stand up...
Yes, in theory, [it should be supported](https://github.com/yahoo/TensorFlowOnSpark/blob/master/tensorflowonspark/TFCluster.py#L225), especially when using InputMode.TENSORFLOW (where Spark is really just being used as a cluster/resource manager to start up the nodes of your TF...
Pretty sure [this](https://www.tensorflow.org/tutorials/distribute/parameter_server_training#clusters_in_the_real_world) should work, since [we use TF_CONFIG to set up the cluster](https://github.com/yahoo/TensorFlowOnSpark/blob/master/tensorflowonspark/TFSparkNode.py#L377-L384).