mesh issues

Examples of image-classification models

2

@nikip: this is resolved, right?

dustinvtran

assert that the mesh shape reflects laid out dimensions

1

WuTheFWasThat

easy: implement mtf.expand_dims and mtf.squeeze

1

mtf.expand_dims can be implemented in terms of stack mtf.squeeze can be implemented in terms of reduce_sum + some sanity checks

WuTheFWasThat

The memory consumption does not include the backwards phase?

Dear authors, I have read the code of auto-mesh. I found that when calculating the memory consumption given a schedule, it only included the consumption by the forward phase, but...

xiaodathereal

Support for MultiworkerMirroredStrategy?

Is it possible to incorporate [MultiworkerMirroredStrategy](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/distribute/experimental/MultiWorkerMirroredStrategy) into Mesh TF? I would like to run model + data parallelism on a supercomputer that has multiple GPUs on multiple nodes. It seems...

ShutoAraki

README Questions

Hi there, Thanks for creating this framework. I was trying to run the transformer example provided in the README.md and I realized some files are missing in the repository. Could...

msharmavikram

.travis.yml: The 'sudo' tag is now deprecated in Travis CI

[Travis are now recommending removing the __sudo__ tag](https://blog.travis-ci.com/2018-11-19-required-linux-infrastructure-migration). "_If you currently specify __sudo: false__ in your __.travis.yml__, we recommend removing that configuration_"

cclauss

Mesh overlapping is allowed ?

Hi, does MTF support overlapped meshes? For example, for a NN model with 6 layers, I want to parallelize three first layers with 1d mesh and three remaining with 2d...

zzhangfr

_infer_binary_broadcast_shape should allow dimensions of size 1

currently broadcasting semantics aren't the same as regular tensorflow

WuTheFWasThat

Support for training with multiple TPUs

3

The [mtf_transformer](https://github.com/tensorflow/tensor2tensor/blob/v1.9.0/tensor2tensor/mesh_tensorflow/mtf_transformer.py#L749) in Tensor2Tensor defaults to a mesh configuration for TPUs that uses 32 cores or 4 Cloud TPUs. I wasn't able to find documentation on utilizing more than a...

wconstab

mesh
mesh copied to clipboard

Metadata

Examples of image-classification models

assert that the mesh shape reflects laid out dimensions

easy: implement mtf.expand_dims and mtf.squeeze

The memory consumption does not include the backwards phase?

Support for MultiworkerMirroredStrategy?

README Questions

.travis.yml: The 'sudo' tag is now deprecated in Travis CI

Mesh overlapping is allowed ?

_infer_binary_broadcast_shape should allow dimensions of size 1

Support for training with multiple TPUs

← Metadata

Owner

Metadata

mesh mesh copied to clipboard

Metadata

← Metadata

Owner

Metadata

mesh
mesh copied to clipboard