Rohan Badlani comments

Results 6 comments of


                                            Rohan Badlani

Strange attention_weights matrix

A lower learning rate or lower ctc loss weight should work better. Since it is warmstarting from pretrained checkpoint, a lower lr like 1e-4 works fine (with both ctc loss...

Strange attention_weights matrix

If you warmstarted with a very high lr like you were saying you used lr=10e-3, then the attention module can be broken and even removal of prior will not help....

Strange attention_weights matrix

hmmmm...I think I see the problem -- if you look at the gate loss, there is a spike around 575k (which I'm guessing the point when you removed the prior?...

Missing Generator Under Models

Yes...BUMP on this -- no generator.py in models/

Get Neighbors implementation for Undirected and Directed Graphs with Tests

Adding in the implementation for directed graphs as well.

Get Neighbors implementation for Undirected and Directed Graphs with Tests

Added Tests in the tests/tests-TUNGraph.cpp and tests/tests-TNGraph.cpp that follow the same convention as all the other tests. These test basic functionality using a small test graph. SUNet Id: **rbadlani**