DeepLearningExamples issues

How to convert Tacotron2 to Libtorch

I'm not good at Python so I want to inference Tacotron by libtorch C++ How to convert Tacotron2 to Libtorch ?

Updates README.md to include --save_ckpt

Adds `--save_ckpt` in the above documentation. I spent time training assuming that checkpoints would be saved by default in the pipeline, so I think adding this to the documentation would...

aksg87

[BART] Documentation references run_pretraining.py that doesn't exist

2

Related to **Model/Framework(s)** *BART*'s documentation **Describe the bug** The documentation to your [BART example](https://github.com/NVIDIA/DeepLearningExamples/blob/master/PyTorch/LanguageModeling/BART/README.md) is unclear and ambiguous about whether it supports pre-training or not. In my opinion you should...

Lauler

bug

[EfficientNetV2/Tensorflow2] oom during training

2

I'm using the training script from https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow2/Classification/ConvNets/efficientnet_v2/S/training/AMP/convergence_8xA100.sh on my A100-80G node, no changes of parameters I am getting lot of errors about ```yml 7: [1,5]: File "/usr/local/lib/python3.8/dist-packages/tensorflow/python/eager/execute.py", line 59, in...

ZJLi2013

bug

[BERT/PyTorch] How to get accuracy in prediciton mode?

2

Related to **BERT/PyTorch** **Describe the bug** A clear and concise description of what the bug is. I want to get exact_match and f1 score when doing prediction. I changed some...

gieflij

bug

Update README.md

1

Kxvish

[ConvNet/PyTorch] SyncBatchNorm not Used in ConvNet ImageNet Classification

Related to **ConvNet/PyTorch** *(e.g. GNMT/PyTorch or FasterTransformer/All)* **Describe the bug** SyncBatchNorm is not used in the script https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Classification/ConvNets. Is this a potential bug when DDP is activated? I make this...

YuchuanTian

bug

[Fastpitch] Multi-speaker model changes output speaker identity for different texts

28

Hi, We are trying to train a multi-speaker model starting from the LibriTTS data and using the latest FastPitch commit. We selected the 50 speakers which have the most utterances...

adrianastan

bug

[Bert/Pytorch] During pretraining, checkpoints won't be saved automatically.

3

Related to **Bert/Pytorch** **Describe the bug** After running a long period, for example, after 200,000 iterations, there will be some skipped steps. Such skipped steps are counted into the total...

Itok2000u

bug

[BERT/PyTorch] Unable to reproduce bert benchmark under A100

3

Hi, I have notice that on A100 80G, bert Phase1 and Phase 2 can have a throughput of 853 and 289 sequences/sec respectively. ![image](https://user-images.githubusercontent.com/24752948/175768130-19f74769-30e6-441c-a87b-b4a01dddcd9d.png) I want to reproduce this result...

wchen61

bug

DeepLearningExamples
DeepLearningExamples copied to clipboard

Metadata

How to convert Tacotron2 to Libtorch

Updates README.md to include --save_ckpt

[BART] Documentation references run_pretraining.py that doesn't exist

[EfficientNetV2/Tensorflow2] oom during training

[BERT/PyTorch] How to get accuracy in prediciton mode?

Update README.md

[ConvNet/PyTorch] SyncBatchNorm not Used in ConvNet ImageNet Classification

[Fastpitch] Multi-speaker model changes output speaker identity for different texts

[Bert/Pytorch] During pretraining, checkpoints won't be saved automatically.

[BERT/PyTorch] Unable to reproduce bert benchmark under A100

← Metadata

Owner

Metadata

DeepLearningExamples DeepLearningExamples copied to clipboard

Metadata

← Metadata

Owner

Metadata

DeepLearningExamples
DeepLearningExamples copied to clipboard