why use the output of the first decoder layers in ACT model？

Open junhui1997 opened this issue 2 years ago • 0 comments

hs = self.transformer(src, None, self.query_embed.weight, pos, latent_input, proprio_input, self.additional_pos_embed.weight)[0], In the ACT model， should this index be -1？

Jan 17 '24 13:01 junhui1997