Anton Bakhtin comments

Results 21 comments of


                                            Anton Bakhtin

Benchmark on 1 Billion

I've finally added information about finally of different models. Check out https://github.com/yandex/faster-rnnlm#experiments

Benchmark on 1 Billion

The model is downloaded from benchmark's repo. I used quite straightforward interpolation, i.e. mean per-word probability, as it's describe in the benchmarks' paper.

NCE training (with help GPU)

Hi! 1. -direct 1000 requires 1000 \* 10000000 \* 4 (float) ~= 4GB memory on your GPU. As a result maxent weights fail to be copied to your GPU. A...

NCE training (with help GPU)

That's weird. Does the rnnlm actually learn anything in less then a second? Does valid entropy decrease?

NCE training (with help GPU)

Yeap, that's normal. Validation for Hierarchical Softmax is a few orders faster than for HS. The problem with NCE is that nobody guarantees that predicted probabilites would be stochastic, i.e....

NCE validation uses only simple operations (like matrix multiplication) and could be efficiently implemented on GPU. As for training, some operations works faster on GPU (matrix multiplication) and some operations...

the binary format of output

Hi! There is not flag to use text format. All the information about the network is stored in NNet class. Once it is constructed, you can dump it in a...

the binary format of output

In kaldi faster-rnnln is used for rescoring only. For instance, checkout librispeech recipe: https://github.com/kaldi-asr/kaldi/blob/master/egs/librispeech/s5/run.sh#L333 On Fri, Dec 25, 2015 at 4:10 AM, xiaofeixia1234 [email protected] wrote: > Merry Christmas! > I...

Build in ubuntu 16.04

Which compile version to you use? Can you try g++-4.8?

Undefined Reference Errors when running make

Is this fixed?