earth-forecasting-transformer icon indicating copy to clipboard operation
earth-forecasting-transformer copied to clipboard

Question about hierarchical Structure

Open Titolasanta opened this issue 1 year ago • 1 comments

Hi, researching your paper I read that the used architecture utilizes a hierarchical Structure in a similar way as in the Hourglass transformer ( https://arxiv.org/abs/2110.13711 ). But for your case you are utilizing a encoder-decoder transformer, so how do you reshape what would usually be the input for your decoders which consists of both the output of the encoder which I would consider to be downsampled and the original input, which would be in the original input shape?

Thank you

Titolasanta avatar Jan 11 '24 01:01 Titolasanta

Thank you for your interest and your question. It may help to refer to the following piece of our implementation: https://github.com/amazon-science/earth-forecasting-transformer/blob/7732b03bdb366110563516c3502315deab4c2026/src/earthformer/cuboid_transformer/cuboid_transformer.py#L3192-L3195 The decoder typically takes as input the outputs from each layer of the encoder, as well as a dummy input named initial_z that matches the shape of the encoder's final layer output.

gaozhihan avatar Jan 11 '24 03:01 gaozhihan