beauty-snowman

Results 1 comments of beauty-snowman

我想知道用卷积构建self-attention训练小规模数据集会不会像transformer一样造成负提升