ECCV2022-Papers-with-Code 欢迎分享ECCV 2024 论文和代码 / Welcome to share the paper and code of ECCV 2024

[The format of the issue] Paper name/title: Project link: Paper link: Code link:

Jul 03 '24 07:07 amusi

Domain: OCR Paper name/title: Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors Paper link: https://arxiv.org/pdf/2312.05286 Code link: https://github.com/SJTU-DeepVisionLab/FreeReal

Jul 03 '24 08:07 TongkunGuan

Paper title: milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing Paper link: https://arxiv.org/abs/2306.17010 Code link: https://github.com/Toytiny/milliFlow/

Jul 03 '24 14:07 Toytiny

Domain: MLLM Paper title: ControlCap: Controllable Region-level Captioning Paper link: https://arxiv.org/abs/2401.17910 Code link: https://github.com/callsys/ControlCap

Jul 04 '24 01:07 callsys

Paper name: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries Project link: https://zhang-tao-whu.github.io/projects/DVIS_DAQ/ Paper link: https://arxiv.org/abs/2404.00086 Code link: https://github.com/zhang-tao-whu/DVIS_Plus Features: New SOTA on YTVIS19, YTVIS21 and OVIS datasets.

Jul 04 '24 03:07 zhang-tao-whu

Paper name/title: 3D Small Object Detection with Dynamic Spatial Pruning Project link: https://xuxw98.github.io/DSPDet3D/ Paper link: https://arxiv.org/abs/2305.03716 Code link: https://github.com/xuxw98/DSPDet3D

Jul 05 '24 16:07 xuxw98

field: Image Generation + Diffusion Models Paper name/title: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models Paper link: https://arxiv.org/abs/2404.07389 Code link: https://github.com/YasminZhang/EBAMA

Jul 07 '24 05:07 YasminZhang

[Medical Image, Medical Image Segmentation] Paper title: Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging Paper link: https://arxiv.org/abs/2311.16914 Code link: https://github.com/peirong26/Brain-ID

Jul 07 '24 14:07 peirong26

[Medical Image, Medical Image Segmentation] Paper name/title: ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image Project link: https://scribbleprompt.csail.mit.edu/ Paper link: https://arxiv.org/abs/2312.07381 Code link: https://github.com/halleewong/ScribblePrompt

Jul 08 '24 02:07 halleewong

[Video Generation] Paper title: VideoStudio: Generating Consistent-Content and Multi-Scene Videos Project link: https://vidstudio.github.io/ Code link: https://github.com/FuchenUSTC/VideoStudio

Jul 09 '24 07:07 FuchenUSTC

Paper title: 4D Contrastive Superflows are Dense 3D Representation Learners Paper link: https://arxiv.org/abs/2407.06190 Code link: https://github.com/Xiangxu-0103/SuperFlow

Jul 09 '24 08:07 Xiangxu-0103

[Low level vision] Paper title: Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization Project link: https://kaminyou.com/Dense-Normalization/ Paper link: https://arxiv.org/abs/2407.04245 Code link: https://github.com/Kaminyou/Dense-Normalization

Jul 10 '24 12:07 Kaminyou

[3D Visual Grounding] Paper title: Multi-branch Collaborative Learning Network for 3D Visual Grounding Paper link: https://arxiv.org/abs/2407.05363v2 Code link: https://github.com/qzp2018/MCLN

Jul 11 '24 14:07 qzp2018

[NeRF + Vision Transformers + Self-Supervised Learning] Paper name/title: NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields Project link: https://nerf-mae.github.io/ Paper link: https://arxiv.org/pdf/2404.01300 Code link: https://github.com/zubair-irshad/NeRF-MAE

Jul 11 '24 14:07 zubair-irshad

Domain: OCR paper title: PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer paper link: https://arxiv.org/abs/2407.07764 code link: https://github.com/SJTU-DeepVisionLab/PosFormer

Jul 12 '24 11:07 TongkunGuan

Domain: 3D Object Detection Paper name/title: Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection Paper link: https://arxiv.org/abs/2402.03634 Code link: https://github.com/LiewFeng/RayDN

Jul 15 '24 03:07 LiewFeng

Domain: low-level Image Compression paper title: Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation code link: https://github.com/qingshi9974/ECCV2024-AdpatICMH paper link: http://arxiv.org/abs/2407.09853

Jul 16 '24 04:07 qingshi9974

Domain: Interpretable-by-Design Models, Unsupervised Part Discovery Paper Title: PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers Code Link: https://github.com/ananthu-aniraj/pdiscoformer Paper Link: https://arxiv.org/abs/2407.04538

Jul 16 '24 07:07 ananthu-aniraj

Paper name/title: C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition Paper link: https://arxiv.org/abs/2407.06113 Code link: https://github.com/RongchangLi/ZSCAR_C2C

Jul 16 '24 11:07 RongchangLi

Paper name/title: AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking Paper link: https://arxiv.org/abs/2407.06468 Code link: https://github.com/ricklisz/AnatoMask

Jul 16 '24 21:07 ricklisz

Domain: Object Detection, DETR Paper name/title: Relation DETR: Exploring Explicit Position Relation Prior for Object Detection Paper link: https://arxiv.org/abs/2407.11699v1 Code link: https://github.com/xiuqhou/Relation-DETR Dataset link: https://huggingface.co/datasets/xiuqhou/SA-Det-100k

Jul 18 '24 07:07 xiuqhou

Domain: Medical Imaging, Fairness learning Paper name/title: FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification Project link: https://ophai.hms.harvard.edu/datasets/harvard-fairdomain20k Paper link: https://arxiv.org/abs/2407.08813 Dataset link: https://drive.google.com/drive/u/1/folders/1huH93JVeXMj9rK6p1OZRub868vv0UK0O Code link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairDomain

Jul 18 '24 20:07 tianyu0207

Domain: 视觉和语言(Vision-Language), 视频理解(Video Understanding), Zero-Shot Learning(零样本学习)

Paper name/title:SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders Project link: N/A Paper link: https://arxiv.org/abs/2407.13460 Code link: https://github.com/pha123661/SA-DVAE

Thanks for your amazing work!

Jul 19 '24 06:07 pha123661

ZIGMA: A DiT-style Zigzag Mamba Diffusion Model

Paper: https://arxiv.org/abs/2403.13802
Code: https://taohu.me/zigma/

Jul 19 '24 20:07 dongzhuoyao

[Diffusion Models] Paper name/title: Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation Paper link: https://arxiv.org/abs/2403.16394 Code link: https://github.com/zdxdsw/skewed_relations_T2I

Jul 21 '24 02:07 YasminZhang

Domain: Object Detection Paper name/title: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector Project link: http://yuqianfu.com/CDFSOD-benchmark/ Paper link: https://arxiv.org/pdf/2402.03094 Code link: https://github.com/lovelyqian/CDFSOD-benchmark

Jul 23 '24 06:07 lovelyqian

Semantic Segmentation/ Medical Image Segmentation Paper name/title: Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures Paper link: https://arxiv.org/abs/2407.14754 Code link: https://github.com/cbmi-group/FFM-Multi-Decoder-Network

Jul 23 '24 09:07 WHU-YH-jx

3D Registration / Visual Localization Paper Name: SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments Paper Link: https://arxiv.org/abs/2404.10527 Code link: https://github.com/fraunhoferhhi/spvloc Project Link: https://fraunhoferhhi.github.io/spvloc/

Jul 26 '24 08:07 gard-n

Domain: Low-level Vision Paper name/title: OneRestore: A Universal Restoration Framework for Composite Degradation Project link: https://gy65896.github.io/projects/ECCV2024_OneRestore Paper link: https://arxiv.org/abs/2407.04621 Code link: https://github.com/gy65896/OneRestore

Jul 27 '24 06:07 YuanGao-YG

Domain: Object Counting Paper name/title: Zero-shot Object Counting with Good Exemplars Paper link: https://arxiv.org/abs/2407.04948 Code link: https://github.com/HopooLinZ/VA-Count

Jul 30 '24 07:07 HopooLinZ

Domain: real-time rendering / glossy object modeling Paper name/title:REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices Project link: https://xdimlab.github.io/REFRAME/ Paper link: https://arxiv.org/abs/2403.16481 Code link: https://github.com/MARVELOUSJI/REFRAME

Aug 05 '24 07:08 MARVELOUSJI

ECCV2022-Papers-with-Code ECCV2022-Papers-with-Code copied to clipboard

欢迎分享ECCV 2024 论文和代码 / Welcome to share the paper and code of ECCV 2024

ECCV2022-Papers-with-Code
ECCV2022-Papers-with-Code copied to clipboard