Fei Wang
Fei Wang
Make sense. I have added broadcasting to Lantern. Maybe I will look into rank-0 tensor soon as well. :)
Thanks for clearing that up! If you don't mind, can you address a few more questions related to the realization of the GPipe arXiv paper? From the StackedRecurrent here: https://github.com/tensorflow/lingvo/blob/master/lingvo/core/gpipe.py#L443...
@bignamehyp Thanks for your reply! I have been digging in the codebase (especially the lingvo/core/recurrent.py and the lingvo/core/gpipe.py) and learned a lot about GPipe. I am also trying to re-create...
It will be great if more guidance (tutorials) can be offered for running GPipe on the image classification models, such as the AmoebaNet models evaluated in the GPipe arXiv paper...
I am curious about this too. In fact, I am trying to re-do the benchmarks mentioned in the GPipe arXiv paper and GPipe blog (https://ai.googleblog.com/2019/03/introducing-gpipe-open-source-library.html) but on GPUs. However, I...
> > I am curious about this too. In fact, I am trying to re-do the benchmarks mentioned in the GPipe arXiv paper and GPipe blog (https://ai.googleblog.com/2019/03/introducing-gpipe-open-source-library.html) but on GPUs....
> > > > I am curious about this too. In fact, I am trying to re-do the benchmarks mentioned in the GPipe arXiv paper and GPipe blog (https://ai.googleblog.com/2019/03/introducing-gpipe-open-source-library.html) but...
Flare needs updates to work with this PR
-- Parse the format string (list of characters) into an AST. -- Example: "%d,%s" → (FInt (FOther ',' (FString FEnd))) format : List Char -> Format format ('%' :: 'd'...