yolov7_d2
yolov7_d2 copied to clipboard
What is the reason why Swin-T only outputs features 1,2,3 and not 0?
I saw from papers and architectures that YOLO only takes 3 feature maps (YOLOv4, 5, 6). Since Swin-T outputs four feature maps, why is that only three features are used here? IS there a way to use all of them or are they necessary?
you can specific any output features
Does that mean I can specify four output features? it seems like YOLO only takes three. is there a better way to use all features with YOLO without loss of information?
Was specifically referring to how to accomodate 4 features for YOLOv6 since original YOLOv6 only takes three feature maps