Vadim Dabravolski

Results 7 comments of Vadim Dabravolski

Sharing an implementation of working TF Config for MultiNodeMirroredStrategy below. This has been tested on SageMaker Deep Learning container with TensorFlow v2.8 (link to [dockerfile](https://github.com/aws/deep-learning-containers/blob/master/tensorflow/training/docker/2.8/py3/cu112/Dockerfile.gpu)). ```python def _build_tf_config(): hosts =...

See reported D2 issue: https://github.com/facebookresearch/detectron2/issues/1566

Was interested in the same questions, @usuyama. See excerpt from Megatron paper. Does look like MegatronHF will require some updates on HF side. ![image](https://user-images.githubusercontent.com/505262/99604907-93898180-29d4-11eb-829e-a06870b841e0.png)

The following worked for me (adding "super" call in spider init): ``` def __init__(self, *a, **kw): super(NewsSpider, self).__init__(*a, **kw) ```

Hi, adding +1 to @AlpinDale. We are interested to experiment with MLP speculator, specifically, on latest Llama3.1 models. Excellent work overall @JRosenkranz !

@pietro-fragnito hi, was this issue resolved? Appreciate if you could share the solution.