ecosystem
ecosystem copied to clipboard
Integration of TensorFlow with other open-source frameworks
I run distributed mnist on k8s. 1 ps and 3 works. After a hour, the status of pods are: NAME READY STATUS RESTARTS AGE distributed-mnist-ps-0-fz4gw 1/1 Running 0 1h distributed-mnist-worker-0-l4nv5...
I followed the instructions from _**https://github.com/tensorflow/ecosystem/tree/master/kubernetes**_ to run mnist sample on kubernetes with 2 worker and 2 ps. Then I got below errors. Do you have any idea what's the...
18/04/20 11:34:53 ERROR ApplicationMaster: User class threw exception: java.io.IOException: Cannot run program "/usr/local/python3.6/bin/python3": error=2, No such file or directory java.io.IOException: Cannot run program "/usr/local/python3.6/bin/python3": error=2, No such file or directory
Now we try to build the docker image for hdfs supported and it fails all the time. It is easy to re-produce by following these commands. ``` git clone https://github.com/tensorflow/ecosystem...
As I have seen that here we are deploying Tensorflow on Kubernetes under Google Cloud Platform and it would be great if we could add Kubernetes deployment steps under AWS...
Since we wrote below code in the parameter server part: `server.join()` the parameter server could not stop itself when the training finishes unless we kill the process. do you have...
Bumps [protobuf-java](https://github.com/protocolbuffers/protobuf) from 3.16.1 to 3.16.3. Release notes Sourced from protobuf-java's releases. Protobuf Release v3.16.3 Java Refactoring java full runtime to reuse sub-message builders and prepare to migrate parsing logic...
Bumps org.apache.spark:spark-core_2.12 from 3.0.0 to 3.3.3. [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a...
why spark-tensorflow-connector can't support double and integer data type.