keras-attention-augmented-convs Does not work when training

Does not work when training

Open rezazad68 opened this issue 6 years ago • 2 comments

trafficstars

Hi,

When I build a model and use the attntion-augmented conv as a first layer and then several convolutions and max polling layer it is fine when compiling the model. However, in training, it has an error for both Adam and SGD optimizers. it looks like the code has issues. not working in any setup for training . The main problem is for an input with size (64, 128, 1) inside the attention-augmented code it makes a 6-dimensional tensor which is more than 1B parameters!!!! I believe the code needs a small change

Jun 10 '19 07:06 rezazad68

If you could post the stack trace that would be more helpful.

Jun 10 '19 12:06 titu1994

Could you post a code snippet where this issue occurs?

Jun 20 '19 14:06 titu1994

keras-attention-augmented-convs keras-attention-augmented-convs copied to clipboard

Does not work when training

keras-attention-augmented-convs
keras-attention-augmented-convs copied to clipboard