ECCV2022-Papers-with-Code
ECCV2022-Papers-with-Code copied to clipboard
欢迎分享ECCV 2024 论文和代码 / Welcome to share the paper and code of ECCV 2024
[The format of the issue] Paper name/title: Project link: Paper link: Code link:
Domain: OCR Paper name/title: Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors Paper link: https://arxiv.org/pdf/2312.05286 Code link: https://github.com/SJTU-DeepVisionLab/FreeReal
Paper title: milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing Paper link: https://arxiv.org/abs/2306.17010 Code link: https://github.com/Toytiny/milliFlow/
Domain: MLLM Paper title: ControlCap: Controllable Region-level Captioning Paper link: https://arxiv.org/abs/2401.17910 Code link: https://github.com/callsys/ControlCap
Paper name: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries Project link: https://zhang-tao-whu.github.io/projects/DVIS_DAQ/ Paper link: https://arxiv.org/abs/2404.00086 Code link: https://github.com/zhang-tao-whu/DVIS_Plus Features: New SOTA on YTVIS19, YTVIS21 and OVIS datasets.
Paper name/title: 3D Small Object Detection with Dynamic Spatial Pruning Project link: https://xuxw98.github.io/DSPDet3D/ Paper link: https://arxiv.org/abs/2305.03716 Code link: https://github.com/xuxw98/DSPDet3D
field: Image Generation + Diffusion Models Paper name/title: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models Paper link: https://arxiv.org/abs/2404.07389 Code link: https://github.com/YasminZhang/EBAMA
[Medical Image, Medical Image Segmentation] Paper title: Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging Paper link: https://arxiv.org/abs/2311.16914 Code link: https://github.com/peirong26/Brain-ID
[Medical Image, Medical Image Segmentation] Paper name/title: ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image Project link: https://scribbleprompt.csail.mit.edu/ Paper link: https://arxiv.org/abs/2312.07381 Code link: https://github.com/halleewong/ScribblePrompt
[Video Generation] Paper title: VideoStudio: Generating Consistent-Content and Multi-Scene Videos Project link: https://vidstudio.github.io/ Code link: https://github.com/FuchenUSTC/VideoStudio
Paper title: 4D Contrastive Superflows are Dense 3D Representation Learners Paper link: https://arxiv.org/abs/2407.06190 Code link: https://github.com/Xiangxu-0103/SuperFlow
[Low level vision] Paper title: Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization Project link: https://kaminyou.com/Dense-Normalization/ Paper link: https://arxiv.org/abs/2407.04245 Code link: https://github.com/Kaminyou/Dense-Normalization
[3D Visual Grounding] Paper title: Multi-branch Collaborative Learning Network for 3D Visual Grounding Paper link: https://arxiv.org/abs/2407.05363v2 Code link: https://github.com/qzp2018/MCLN
[NeRF + Vision Transformers + Self-Supervised Learning] Paper name/title: NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields Project link: https://nerf-mae.github.io/ Paper link: https://arxiv.org/pdf/2404.01300 Code link: https://github.com/zubair-irshad/NeRF-MAE
Domain: OCR paper title: PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer paper link: https://arxiv.org/abs/2407.07764 code link: https://github.com/SJTU-DeepVisionLab/PosFormer
Domain: 3D Object Detection Paper name/title: Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection Paper link: https://arxiv.org/abs/2402.03634 Code link: https://github.com/LiewFeng/RayDN
Domain: low-level Image Compression paper title: Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation code link: https://github.com/qingshi9974/ECCV2024-AdpatICMH paper link: http://arxiv.org/abs/2407.09853
Domain: Interpretable-by-Design Models, Unsupervised Part Discovery Paper Title: PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers Code Link: https://github.com/ananthu-aniraj/pdiscoformer Paper Link: https://arxiv.org/abs/2407.04538
Paper name/title: C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition Paper link: https://arxiv.org/abs/2407.06113 Code link: https://github.com/RongchangLi/ZSCAR_C2C
Paper name/title: AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking Paper link: https://arxiv.org/abs/2407.06468 Code link: https://github.com/ricklisz/AnatoMask
Domain: Object Detection, DETR Paper name/title: Relation DETR: Exploring Explicit Position Relation Prior for Object Detection Paper link: https://arxiv.org/abs/2407.11699v1 Code link: https://github.com/xiuqhou/Relation-DETR Dataset link: https://huggingface.co/datasets/xiuqhou/SA-Det-100k
Domain: Medical Imaging, Fairness learning Paper name/title: FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification Project link: https://ophai.hms.harvard.edu/datasets/harvard-fairdomain20k Paper link: https://arxiv.org/abs/2407.08813 Dataset link: https://drive.google.com/drive/u/1/folders/1huH93JVeXMj9rK6p1OZRub868vv0UK0O Code link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairDomain
Domain: 视觉和语言(Vision-Language), 视频理解(Video Understanding), Zero-Shot Learning(零样本学习)
Paper name/title:SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders Project link: N/A Paper link: https://arxiv.org/abs/2407.13460 Code link: https://github.com/pha123661/SA-DVAE
Thanks for your amazing work!
ZIGMA: A DiT-style Zigzag Mamba Diffusion Model
-
Paper: https://arxiv.org/abs/2403.13802
-
Code: https://taohu.me/zigma/
[Diffusion Models] Paper name/title: Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation Paper link: https://arxiv.org/abs/2403.16394 Code link: https://github.com/zdxdsw/skewed_relations_T2I
Domain: Object Detection Paper name/title: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector Project link: http://yuqianfu.com/CDFSOD-benchmark/ Paper link: https://arxiv.org/pdf/2402.03094 Code link: https://github.com/lovelyqian/CDFSOD-benchmark
Semantic Segmentation/ Medical Image Segmentation Paper name/title: Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures Paper link: https://arxiv.org/abs/2407.14754 Code link: https://github.com/cbmi-group/FFM-Multi-Decoder-Network
3D Registration / Visual Localization Paper Name: SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments Paper Link: https://arxiv.org/abs/2404.10527 Code link: https://github.com/fraunhoferhhi/spvloc Project Link: https://fraunhoferhhi.github.io/spvloc/
Domain: Low-level Vision Paper name/title: OneRestore: A Universal Restoration Framework for Composite Degradation Project link: https://gy65896.github.io/projects/ECCV2024_OneRestore Paper link: https://arxiv.org/abs/2407.04621 Code link: https://github.com/gy65896/OneRestore
Domain: Object Counting Paper name/title: Zero-shot Object Counting with Good Exemplars Paper link: https://arxiv.org/abs/2407.04948 Code link: https://github.com/HopooLinZ/VA-Count
Domain: real-time rendering / glossy object modeling Paper name/title:REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices Project link: https://xdimlab.github.io/REFRAME/ Paper link: https://arxiv.org/abs/2403.16481 Code link: https://github.com/MARVELOUSJI/REFRAME