PETR
PETR copied to clipboard
ablation study about numbers of decoder layer
Hi,I‘m curious about transformer decoder work mechanism here, have you make some ablation study about numbers of decoder layer yet?