mesh
mesh copied to clipboard
Can you go across multiple nodes?
Is it possible to use devices that are on different machines? For example, in Horovod I can specify the IP addresses of multiple machines and do data parallelism across them. However, this requires me to specifically have MPI setup on each machine. It's unclear to me if this can be done with TF Mesh. Maybe with a tf.train.clusterspec and the parameter server model??
Thanks. -Tony
Did you found a solution for this? @toponado
Is there a response to this question?