seq2seq icon indicating copy to clipboard operation
seq2seq copied to clipboard

Bucketed model with shared weights

Open ishalyminov opened this issue 7 years ago • 1 comments

Is there a way to share internal layers' weights of a Seq2Seq (preferably AttentionSeq2Seq) model in order to make a bucketed training of those, or it's supposed to be implemented by the user?:)

ishalyminov avatar Nov 14 '16 10:11 ishalyminov

@farizrahman4u I've just managed to:

  1. create a model with explicitly specified input_length, output_length for one bucket
  2. save its weights
  3. create a model for another bucket (again, setting its input and output lengths explicitly)
  4. loaded the 1st model's weights into it
  5. fitted some data -- and nothing crashed:)

What is the meaning of input_length, output_length if they don't affect the topology? And is the way described above OK for training bucketed models?

ishalyminov avatar Nov 14 '16 11:11 ishalyminov