Varuna Jayasiri
Varuna Jayasiri
Looks like it's not compatible with some newer version of opencv.
The ordering is different. So it wont affect training from scratch but you cant load a model trained with different ordering.
It's easier to code
Thanks. It's a bug
Fixed it here https://github.com/labmlai/annotated_deep_learning_paper_implementations/commit/09d09379c2169eac06662e17cb9969dc6e48e36a
all the code is in this repo
I think, without it, the current approach can lead to overflow in the accumulator and logsum when a block's max is significantly lower than the previous global max, as it...
Our implementation has sequence first. PyTorch LSTM used that and in our initial implementations we used C B H and just continued with it. B C D is more commonly...
You can create pull requests with new contributions. We've had a few people contributing new paper implementations and improving/fixing existing implementations.
Sorry for the really late reply. Been really busy. Hoping to spend a little time on this project in the next few weeks (mostly cleaning up code and fixing)