DINO
DINO copied to clipboard
Visualize encoder-decoder multi-head attention weights
Are you able to visualize encoder-decoder multi-head attention weights with the deformable attention similar to what was done in the original DETR paper? I know the Deformable DETR paper was able to do this but they did not post their code or respond to issues on their repo.