MonoDETR icon indicating copy to clipboard operation
MonoDETR copied to clipboard

Visualizations of attention maps in depth cross-attention

Open yangfan293 opened this issue 1 year ago • 0 comments

Hello, may I ask if the visualization in Figure 5 is directly output and drawn by attn_output_weights.sum(dim=1)/num_heads of depth cross-attention layer? Why is the picture drawn by my trained model very different from yours?

yangfan293 avatar Apr 14 '23 15:04 yangfan293