TFC-pretraining Where is 3-layer 1 D Resnet?

Where is 3-layer 1 D Resnet?

Open Apolsus opened this issue 1 year ago • 4 comments

In the paper, you mentioned using ResNet, but there is a Transformer in the code.

The built-in implementation of Pytorch is still used, where the input is supposed to be (seq_len, N, D), but you just (N, 1, seq_len) tensor.

Is this the reason why Transformer is not as good as ResNet?

Apr 10 '23 12:04 Apolsus