Transformer-in-Computer-Vision icon indicating copy to clipboard operation
Transformer-in-Computer-Vision copied to clipboard

[MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection]

Open heitorrapela opened this issue 1 year ago • 1 comments
trafficstars

Hello, thanks for the nice list,

Please consider adding our recent work: https://arxiv.org/abs/2404.18849

We propose a Mixed Patches (MiPa), in conjunction with a patch-wise domain agnostic module, which is responsible for learning the best way to find a common representation of both modalities (RGB/Infrared) built on top of DINO (https://github.com/IDEA-Research/DINO).

We have a link for the code, but the work is under review, so we are waiting to release it soon here: https://github.com/heitorrapela/MiPa

heitorrapela avatar Jun 30 '24 20:06 heitorrapela

已收到邮件,谢谢

hust-lidelong avatar Jun 30 '24 20:06 hust-lidelong

Hello, thanks for the nice list,

Please consider adding our recent work: https://arxiv.org/abs/2404.18849

We propose a Mixed Patches (MiPa), in conjunction with a patch-wise domain agnostic module, which is responsible for learning the best way to find a common representation of both modalities (RGB/Infrared) built on top of DINO (https://github.com/IDEA-Research/DINO).

We have a link for the code, but the work is under review, so we are waiting to release it soon here: https://github.com/heitorrapela/MiPa

Thanks for your help. We have updated this project.

Yangzhangcst avatar Jul 05 '24 00:07 Yangzhangcst