dingevin comments

Results 9 comments of


                                            dingevin

Training time

this is the utilization rate of the GPU，sometime it was zero, but i have recorded time of data pipeline and found it very fast, about 0.2 s each step when...

Training time

that's tensorflow timeline profilling [timeline_3.json](https://share.weiyun.com/5jGchyp)

Training time

I once suspected the version of tensorflow and try many version of tf-nightly-gpu, but have no effect; now my env is : **tf_nightly_gpu-1.13.0.dev20181210** , **CUDA-9.0** , **CUDNN: 7.3.0** . is...

it looks like spend much time before Forward and backward, like op 'RandomStandardNormal' , but i don't konw how to repair it. ![image](https://user-images.githubusercontent.com/16797858/56036650-f4b09400-5d5f-11e9-97c5-f37d0b3cf335.png)

Training time

Sync mode and async mode all have been trying: ``` bazel-bin/lingvo/trainer --run_locally=gpu --mode=sync --model=asr.librispeech.Librispeech960Grapheme --logdir=/data/dingzhenyou/speech_data/librispeech/log/ --logtostderr --enable_asserts=false ``` ``` bazel-bin/lingvo/trainer --run_locally=gpu --mode=async --model=asr.librispeech.Librispeech960Grapheme --logdir=/data/dingzhenyou/speech_data/librispeech/log/ --logtostderr --enable_asserts=false --job=controller,trainer ``` Am I...

Training time

i have updated tf-nightly, and the training speed still very slow. that's my env : [tf_env.txt](https://github.com/tensorflow/lingvo/files/3099748/tf_env.txt) and here is the training log, each step still spent 7~8s， [nohup.txt](https://github.com/tensorflow/lingvo/files/3100102/nohup.txt) that training...