Haus
Haus
Hi, thank you for the innovative work. Would you mind to explain why the fusion of bias (b in Eq.6) are zeros? Why we dont let the bias becomes a...
For question 2, I think it is used to reduce the weighted channels in each group to obtain the attention map $a$ 
Hi, I just found the paper has mentioned about this part, however, for depthwise convolution, shouldn't we check for groups == in_channels instead of groups == out_channels in the code...
parser.add_argument('--data', type=str, default='ETTh1', help='data') parser.add_argument('--root_path', type=str, default='./data', help='root path of the data file') parser.add_argument('--data_path', type=str, default='ETTh1.csv', help='data file') 确认这几个参数是对的
参数不对:--root_path --data_path , 应该是你数据相关的路径不是