Julia A
Results
3
comments of
Julia A
I believe that this is just a reporting difference. The internal (hidden) states should be the same. You can confirm this by looking at when subsequent spikes occur.
Oh nice! Great news! Is this part of main?
Wow, that is a lot faster! For LearningDense is there or will there be a similar way to update dw? As a future "nice to have" suggestion, it would be...