Video-to-Video Synthesis |
NIPS |
code |
5578 |
Deep Image Prior |
CVPR |
code |
3736 |
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation |
CVPR |
code |
3405 |
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network |
ECCV |
code |
2434 |
Learning to See in the Dark |
CVPR |
code |
2326 |
Glow: Generative Flow with Invertible 1x1 Convolutions |
NIPS |
code |
2088 |
Squeeze-and-Excitation Networks |
CVPR |
code |
1477 |
Efficient Neural Architecture Search via Parameters Sharing |
ICML |
code |
1382 |
Multimodal Unsupervised Image-to-image Translation |
ECCV |
code |
1296 |
Non-Local Neural Networks |
CVPR |
code |
992 |
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? |
CVPR |
code |
924 |
Single-Shot Refinement Neural Network for Object Detection |
CVPR |
code |
875 |
Image Generation From Scene Graphs |
CVPR |
code |
851 |
GANimation: Anatomically-aware Facial Animation from a Single Image |
ECCV |
code |
772 |
Simple Baselines for Human Pose Estimation and Tracking |
ECCV |
code |
752 |
Visualizing the Loss Landscape of Neural Nets |
NIPS |
code |
724 |
Detect-and-Track: Efficient Pose Estimation in Videos |
CVPR |
code |
650 |
Relation Networks for Object Detection |
CVPR |
code |
635 |
Generative Image Inpainting With Contextual Attention |
CVPR |
code |
609 |
PointCNN |
NIPS |
code |
607 |
Look at Boundary: A Boundary-Aware Face Alignment Algorithm |
CVPR |
code |
575 |
Pelee: A Real-Time Object Detection System on Mobile Devices |
NIPS |
code |
548 |
Distractor-aware Siamese Networks for Visual Object Tracking |
ECCV |
code |
545 |
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples |
ICML |
code |
535 |
Which Training Methods for GANs do actually Converge? |
ICML |
code |
520 |
End-to-End Recovery of Human Shape and Pose |
CVPR |
code |
502 |
Taskonomy: Disentangling Task Transfer Learning |
CVPR |
code |
502 |
Cascaded Pyramid Network for Multi-Person Pose Estimation |
CVPR |
code |
497 |
Neural 3D Mesh Renderer |
CVPR |
code |
489 |
Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs |
CVPR |
code |
489 |
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs |
CVPR |
code |
485 |
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric |
CVPR |
code |
447 |
Frustum PointNets for 3D Object Detection From RGB-D Data |
CVPR |
code |
434 |
The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks |
CVPR |
code |
416 |
ICNet for Real-Time Semantic Segmentation on High-Resolution Images |
ECCV |
code |
415 |
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume |
CVPR |
code |
398 |
Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++ |
CVPR |
code |
397 |
Gibson Env: Real-World Perception for Embodied Agents |
CVPR |
code |
385 |
Acquisition of Localization Confidence for Accurate Object Detection |
ECCV |
code |
384 |
Noise2Noise: Learning Image Restoration without Clean Data |
ICML |
code |
370 |
GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation |
CVPR |
code |
359 |
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose |
CVPR |
code |
359 |
A Style-Aware Content Loss for Real-time HD Style Transfer |
ECCV |
code |
349 |
Soccer on Your Tabletop |
CVPR |
code |
338 |
Pyramid Stereo Matching Network |
CVPR |
code |
335 |
Neural Baby Talk |
CVPR |
code |
332 |
License Plate Detection and Recognition in Unconstrained Scenarios |
ECCV |
code |
326 |
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors |
CVPR |
code |
326 |
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images |
ECCV |
code |
323 |
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning |
CVPR |
code |
317 |
Fast End-to-End Trainable Guided Filter |
CVPR |
code |
312 |
Deep Clustering for Unsupervised Learning of Visual Features |
ECCV |
code |
302 |
Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs |
CVPR |
code |
294 |
Neural Relational Inference for Interacting Systems |
ICML |
code |
289 |
Adversarially Regularized Autoencoders |
ICML |
code |
282 |
Learning to Adapt Structured Output Space for Semantic Segmentation |
CVPR |
code |
280 |
Convolutional Neural Networks With Alternately Updated Clique |
CVPR |
code |
272 |
Learning to Segment Every Thing |
CVPR |
code |
269 |
Supervising Unsupervised Learning |
NIPS |
code |
262 |
LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation |
CVPR |
code |
261 |
Bilinear Attention Networks |
NIPS |
code |
258 |
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation |
ECCV |
code |
254 |
An intriguing failing of convolutional neural networks and the CoordConv solution |
NIPS |
code |
249 |
End-to-End Learning of Motion Representation for Video Understanding |
CVPR |
code |
238 |
Image Super-Resolution Using Very Deep Residual Channel Attention Networks |
ECCV |
code |
234 |
Iterative Visual Reasoning Beyond Convolutions |
CVPR |
code |
228 |
Semi-Parametric Image Synthesis |
CVPR |
code |
226 |
Compressed Video Action Recognition |
CVPR |
code |
225 |
Style Aggregated Network for Facial Landmark Detection |
CVPR |
code |
223 |
Pose-Robust Face Recognition via Deep Residual Equivariant Mapping |
CVPR |
code |
220 |
Multi-Content GAN for Few-Shot Font Style Transfer |
CVPR |
code |
218 |
GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models |
ICML |
code |
214 |
Referring Relationships |
CVPR |
code |
210 |
MoCoGAN: Decomposing Motion and Content for Video Generation |
CVPR |
code |
205 |
Latent Alignment and Variational Attention |
NIPS |
code |
204 |
LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image |
CVPR |
code |
202 |
Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs |
CVPR |
code |
197 |
An End-to-End TextSpotter With Explicit Alignment and Attention |
CVPR |
code |
195 |
DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks |
CVPR |
code |
189 |
SPLATNet: Sparse Lattice Networks for Point Cloud Processing |
CVPR |
code |
188 |
Attentive Generative Adversarial Network for Raindrop Removal From a Single Image |
CVPR |
code |
186 |
Single View Stereo Matching |
CVPR |
code |
182 |
MegaDepth: Learning Single-View Depth Prediction From Internet Photos |
CVPR |
code |
181 |
ECO: Efficient Convolutional Network for Online Video Understanding |
ECCV |
code |
180 |
Unsupervised Feature Learning via Non-Parametric Instance Discrimination |
CVPR |
code |
180 |
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing |
CVPR |
code |
179 |
Video Based Reconstruction of 3D People Models |
CVPR |
code |
179 |
Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks |
CVPR |
code |
178 |
Learning Category-Specific Mesh Reconstruction from Image Collections |
ECCV |
code |
176 |
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms |
NIPS |
code |
175 |
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation |
ECCV |
code |
175 |
Group Normalization |
ECCV |
code |
175 |
Real-Time Seamless Single Shot 6D Object Pose Prediction |
CVPR |
code |
174 |
MVSNet: Depth Inference for Unstructured Multi-view Stereo |
ECCV |
code |
174 |
Neural Motifs: Scene Graph Parsing With Global Context |
CVPR |
code |
171 |
Learning a Single Convolutional Super-Resolution Network for Multiple Degradations |
CVPR |
code |
169 |
Optimizing Video Object Detection via a Scale-Time Lattice |
CVPR |
code |
168 |
MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network |
ECCV |
code |
167 |
Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns |
CVPR |
code |
166 |
Weakly Supervised Instance Segmentation Using Class Peak Response |
CVPR |
code |
166 |
PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image |
CVPR |
code |
164 |
Residual Dense Network for Image Super-Resolution |
CVPR |
code |
163 |
Embodied Question Answering |
CVPR |
code |
162 |
Evolved Policy Gradients |
NIPS |
code |
160 |
Camera Style Adaptation for Person Re-Identification |
CVPR |
code |
159 |
Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer |
CVPR |
code |
159 |
Scale-Recurrent Network for Deep Image Deblurring |
CVPR |
code |
159 |
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction |
CVPR |
code |
158 |
Relational recurrent neural networks |
NIPS |
code |
157 |
Densely Connected Pyramid Dehazing Network |
CVPR |
code |
155 |
Image Inpainting for Irregular Holes Using Partial Convolutions |
ECCV |
code |
153 |
SO-Net: Self-Organizing Network for Point Cloud Analysis |
CVPR |
code |
152 |
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling |
CVPR |
code |
152 |
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices |
CVPR |
code |
152 |
DenseASPP for Semantic Segmentation in Street Scenes |
CVPR |
code |
151 |
Facelet-Bank for Fast Portrait Manipulation |
CVPR |
code |
150 |
Self-Imitation Learning |
ICML |
code |
145 |
Graph R-CNN for Scene Graph Generation |
ECCV |
code |
144 |
A Closer Look at Spatiotemporal Convolutions for Action Recognition |
CVPR |
code |
143 |
Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation |
CVPR |
code |
143 |
Quantized Densely Connected U-Nets for Efficient Landmark Localization |
ECCV |
code |
143 |
Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining |
ECCV |
code |
142 |
Two-Stream Convolutional Networks for Dynamic Texture Synthesis |
CVPR |
code |
141 |
Integral Human Pose Regression |
ECCV |
code |
141 |
Adaptive Affinity Fields for Semantic Segmentation |
ECCV |
code |
141 |
LSTM Pose Machines |
CVPR |
code |
141 |
Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships |
CVPR |
code |
140 |
Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform |
CVPR |
code |
139 |
Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification |
CVPR |
code |
137 |
Learning to Compare: Relation Network for Few-Shot Learning |
CVPR |
code |
135 |
CosFace: Large Margin Cosine Loss for Deep Face Recognition |
CVPR |
code |
135 |
Deep Depth Completion of a Single RGB-D Image |
CVPR |
code |
134 |
Deep Back-Projection Networks for Super-Resolution |
CVPR |
code |
132 |
Context Embedding Networks |
CVPR |
code |
131 |
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics |
CVPR |
code |
131 |
Perturbative Neural Networks |
CVPR |
code |
130 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis |
ICML |
code |
129 |
Fast and Accurate Online Video Object Segmentation via Tracking Parts |
CVPR |
code |
129 |
Nonlinear 3D Face Morphable Model |
CVPR |
code |
128 |
BodyNet: Volumetric Inference of 3D Human Body Shapes |
ECCV |
code |
126 |
3D-CODED: 3D Correspondences by Deep Deformation |
ECCV |
code |
125 |
DeepMVS: Learning Multi-View Stereopsis |
CVPR |
code |
125 |
Hierarchical Imitation and Reinforcement Learning |
ICML |
code |
124 |
Domain Adaptive Faster R-CNN for Object Detection in the Wild |
CVPR |
code |
123 |
L4: Practical loss-based stepsize adaptation for deep learning |
NIPS |
code |
123 |
A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts |
CVPR |
code |
122 |
Recurrent Relational Networks |
NIPS |
code |
121 |
Gated Path Planning Networks |
ICML |
code |
121 |
PSANet: Point-wise Spatial Attention Network for Scene Parsing |
ECCV |
code |
121 |
Rethinking Feature Distribution for Loss Functions in Image Classification |
CVPR |
code |
120 |
Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network |
CVPR |
code |
118 |
FOTS: Fast Oriented Text Spotting With a Unified Network |
CVPR |
code |
118 |
ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes |
ECCV |
code |
117 |
PU-Net: Point Cloud Upsampling Network |
CVPR |
code |
117 |
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning |
CVPR |
code |
117 |
Long-term Tracking in the Wild: a Benchmark |
ECCV |
code |
116 |
Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene |
CVPR |
code |
114 |
Repulsion Loss: Detecting Pedestrians in a Crowd |
CVPR |
code |
113 |
Unsupervised Attention-guided Image-to-Image Translation |
NIPS |
code |
110 |
Attention-based Deep Multiple Instance Learning |
ICML |
code |
109 |
Learning Blind Video Temporal Consistency |
ECCV |
code |
109 |
Noisy Natural Gradient as Variational Inference |
ICML |
code |
108 |
End-to-End Weakly-Supervised Semantic Alignment |
CVPR |
code |
106 |
Decoupled Networks |
CVPR |
code |
105 |
LiDAR-Video Driving Dataset: Learning Driving Policies Effectively |
CVPR |
code |
104 |
MAttNet: Modular Attention Network for Referring Expression Comprehension |
CVPR |
code |
104 |
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks |
ECCV |
code |
103 |
FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors |
CVPR |
code |
100 |
Deep Mutual Learning |
CVPR |
code |
100 |
Macro-Micro Adversarial Network for Human Parsing |
ECCV |
code |
98 |
ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans |
CVPR |
code |
97 |
Learning Depth From Monocular Videos Using Direct Methods |
CVPR |
code |
97 |
VITON: An Image-Based Virtual Try-On Network |
CVPR |
code |
95 |
Cascade R-CNN: Delving Into High Quality Object Detection |
CVPR |
code |
93 |
Learning Human-Object Interactions by Graph Parsing Neural Networks |
ECCV |
code |
93 |
Future Frame Prediction for Anomaly Detection – A New Baseline |
CVPR |
code |
92 |
Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence |
ECCV |
code |
92 |
Tell Me Where to Look: Guided Attention Inference Network |
CVPR |
code |
91 |
Neural Kinematic Networks for Unsupervised Motion Retargetting |
CVPR |
code |
90 |
Learning SO(3) Equivariant Representations with Spherical CNNs |
ECCV |
code |
89 |
One-Shot Unsupervised Cross Domain Translation |
NIPS |
code |
89 |
Synthesizing Images of Humans in Unseen Poses |
CVPR |
code |
88 |
Depth-aware CNN for RGB-D Segmentation |
ECCV |
code |
88 |
Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights |
ECCV |
code |
88 |
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding |
CVPR |
code |
87 |
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes |
CVPR |
code |
87 |
Neural Arithmetic Logic Units |
NIPS |
code |
87 |
A PID Controller Approach for Stochastic Optimization of Deep Networks |
CVPR |
code |
87 |
VITAL: VIsual Tracking via Adversarial Learning |
CVPR |
code |
86 |
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking |
CVPR |
code |
86 |
Recurrent Pixel Embedding for Instance Grouping |
CVPR |
code |
85 |
SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation |
CVPR |
code |
84 |
Multi-Scale Location-Aware Kernel Representation for Object Detection |
CVPR |
code |
84 |
Repeatability Is Not Enough: Learning Affine Regions via Discriminability |
ECCV |
code |
84 |
“Zero-Shot” Super-Resolution Using Deep Internal Learning |
CVPR |
code |
84 |
DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency |
ECCV |
code |
82 |
Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction |
CVPR |
code |
80 |
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation |
ECCV |
code |
78 |
Generalizing A Person Retrieval Model Hetero- and Homogeneously |
ECCV |
code |
78 |
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning |
CVPR |
code |
77 |
Pairwise Confusion for Fine-Grained Visual Classification |
ECCV |
code |
77 |
Learning to Reweight Examples for Robust Deep Learning |
ICML |
code |
76 |
Improving Generalization via Scalable Neighborhood Component Analysis |
ECCV |
code |
76 |
SparseMAP: Differentiable Sparse Structured Inference |
ICML |
code |
75 |
PDE-Net: Learning PDEs from Data |
ICML |
code |
75 |
Pose-Normalized Image Generation for Person Re-identification |
ECCV |
code |
75 |
Disentangled Person Image Generation |
CVPR |
code |
75 |
Learning to Navigate for Fine-grained Classification |
ECCV |
code |
74 |
Superpixel Sampling Networks |
ECCV |
code |
74 |
Shift-Net: Image Inpainting via Deep Feature Rearrangement |
ECCV |
code |
74 |
3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation |
ECCV |
code |
74 |
Ordinal Depth Supervision for 3D Human Pose Estimation |
CVPR |
code |
74 |
Path-Level Network Transformation for Efficient Architecture Search |
ICML |
code |
73 |
Diverse Image-to-Image Translation via Disentangled Representations |
ECCV |
code |
72 |
Visual Feature Attribution Using Wasserstein GANs |
CVPR |
code |
72 |
Real-World Anomaly Detection in Surveillance Videos |
CVPR |
code |
72 |
Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval |
CVPR |
code |
72 |
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image |
ECCV |
code |
72 |
Learning to Find Good Correspondences |
CVPR |
code |
72 |
Learning Less Is More - 6D Camera Localization via 3D Surface Regression |
CVPR |
code |
72 |
Object Level Visual Reasoning in Videos |
ECCV |
code |
71 |
Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing |
CVPR |
code |
71 |
Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration |
CVPR |
code |
71 |
Fast and Accurate Single Image Super-Resolution via Information Distillation Network |
CVPR |
code |
71 |
Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present |
CVPR |
code |
70 |
Multi-Shot Pedestrian Re-Identification via Sequential Decision Making |
CVPR |
code |
70 |
PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition |
CVPR |
code |
69 |
Progressive Neural Architecture Search |
ECCV |
code |
68 |
Generative Neural Machine Translation |
NIPS |
code |
68 |
Learning Latent Super-Events to Detect Multiple Activities in Videos |
CVPR |
code |
67 |
Generate to Adapt: Aligning Domains Using Generative Adversarial Networks |
CVPR |
code |
67 |
Adversarial Feature Augmentation for Unsupervised Domain Adaptation |
CVPR |
code |
67 |
Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking |
CVPR |
code |
67 |
Pointwise Convolutional Neural Networks |
CVPR |
code |
67 |
Optimizing the Latent Space of Generative Networks |
ICML |
code |
66 |
Part-Aligned Bilinear Representations for Person Re-Identification |
ECCV |
code |
64 |
Geometry-Aware Learning of Maps for Camera Localization |
CVPR |
code |
63 |
Fighting Fake News: Image Splice Detection via Learned Self-Consistency |
ECCV |
code |
62 |
Isolating Sources of Disentanglement in Variational Autoencoders |
NIPS |
code |
62 |
Neural Program Synthesis from Diverse Demonstration Videos |
ICML |
code |
62 |
Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation |
ECCV |
code |
61 |
Rotation-Sensitive Regression for Oriented Scene Text Detection |
CVPR |
code |
61 |
Human Semantic Parsing for Person Re-Identification |
CVPR |
code |
61 |
Unsupervised Discovery of Object Landmarks as Structural Representations |
CVPR |
code |
61 |
IQA: Visual Question Answering in Interactive Environments |
CVPR |
code |
60 |
Hierarchical Long-term Video Prediction without Supervision |
ICML |
code |
60 |
Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency |
ECCV |
code |
60 |
Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning |
CVPR |
code |
59 |
Neural Style Transfer via Meta Networks |
CVPR |
code |
59 |
Frame-Recurrent Video Super-Resolution |
CVPR |
code |
58 |
PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction |
ECCV |
code |
57 |
CBAM: Convolutional Block Attention Module |
ECCV |
code |
57 |
Decorrelated Batch Normalization |
CVPR |
code |
57 |
Learning Conditioned Graph Structures for Interpretable Visual Question Answering |
NIPS |
code |
57 |
Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition |
ECCV |
code |
57 |
Leveraging Unlabeled Data for Crowd Counting by Learning to Rank |
CVPR |
code |
56 |
Deep Marching Cubes: Learning Explicit Surface Representations |
CVPR |
code |
56 |
Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation |
CVPR |
code |
56 |
LF-Net: Learning Local Features from Images |
NIPS |
code |
55 |
Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model |
ECCV |
code |
55 |
Discriminability Objective for Training Descriptive Captions |
CVPR |
code |
54 |
BlockDrop: Dynamic Inference Paths in Residual Networks |
CVPR |
code |
54 |
Conditional Probability Models for Deep Image Compression |
CVPR |
code |
54 |
Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation |
CVPR |
code |
54 |
Learning towards Minimum Hyperspherical Energy |
NIPS |
code |
54 |
DeepVS: A Deep Learning Based Video Saliency Prediction Approach |
ECCV |
code |
53 |
Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting |
ECCV |
code |
52 |
Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation |
CVPR |
code |
52 |
Wasserstein Introspective Neural Networks |
CVPR |
code |
51 |
SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis |
CVPR |
code |
51 |
Self-produced Guidance for Weakly-supervised Object Localization |
ECCV |
code |
51 |
Measuring abstract reasoning in neural networks |
ICML |
code |
51 |
A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation |
NIPS |
code |
51 |
RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials |
CVPR |
code |
51 |
Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation |
ECCV |
code |
50 |
Efficient end-to-end learning for quantizable representations |
ICML |
code |
50 |
Visual Question Generation as Dual Task of Visual Question Answering |
CVPR |
code |
50 |
Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam |
ICML |
code |
49 |
Surface Networks |
CVPR |
code |
48 |
Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions |
ICML |
code |
48 |
Stacked Cross Attention for Image-Text Matching |
ECCV |
code |
48 |
Actor and Observer: Joint Modeling of First and Third-Person Videos |
CVPR |
code |
48 |
Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation |
CVPR |
code |
47 |
Learning-based Video Motion Magnification |
ECCV |
code |
47 |
Pose Partition Networks for Multi-Person Pose Estimation |
ECCV |
code |
47 |
Neural Autoregressive Flows |
ICML |
code |
47 |
Weakly- and Semi-Supervised Panoptic Segmentation |
ECCV |
code |
46 |
Video Re-localization |
ECCV |
code |
46 |
Real-time 'Actor-Critic' Tracking |
ECCV |
code |
46 |
Black-box Adversarial Attacks with Limited Queries and Information |
ICML |
code |
46 |
Hyperbolic Entailment Cones for Learning Hierarchical Embeddings |
ICML |
code |
46 |
Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation |
CVPR |
code |
46 |
Differentiable Compositional Kernel Learning for Gaussian Processes |
ICML |
code |
45 |
Visualizing and Understanding Atari Agents |
ICML |
code |
45 |
Image Manipulation with Perceptual Discriminators |
ECCV |
code |
45 |
Learning Intrinsic Image Decomposition From Watching the World |
CVPR |
code |
45 |
Overcoming Catastrophic Forgetting with Hard Attention to the Task |
ICML |
code |
44 |
Learning Pose Specific Representations by Predicting Different Views |
CVPR |
code |
44 |
Zero-Shot Object Detection |
ECCV |
code |
43 |
Mean Field Multi-Agent Reinforcement Learning |
ICML |
code |
43 |
Partial Adversarial Domain Adaptation |
ECCV |
code |
43 |
Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation |
ECCV |
code |
43 |
Robust Classification With Convolutional Prototype Learning |
CVPR |
code |
43 |
SimplE Embedding for Link Prediction in Knowledge Graphs |
NIPS |
code |
42 |
PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning |
ICML |
code |
42 |
Learning to Blend Photos |
ECCV |
code |
42 |
Mask-Guided Contrastive Attention Model for Person Re-Identification |
CVPR |
code |
41 |
Link Prediction Based on Graph Neural Networks |
NIPS |
code |
41 |
Generalisation in humans and deep neural networks |
NIPS |
code |
41 |
Towards Binary-Valued Gates for Robust LSTM Training |
ICML |
code |
41 |
Multi-scale Residual Network for Image Super-Resolution |
ECCV |
code |
41 |
Fully Motion-Aware Network for Video Object Detection |
ECCV |
code |
41 |
Interpretable Convolutional Neural Networks |
CVPR |
code |
40 |
Generative Adversarial Perturbations |
CVPR |
code |
40 |
The Sound of Pixels |
ECCV |
code |
40 |
Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization |
CVPR |
code |
40 |
Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance |
ECCV |
code |
40 |
Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation |
NIPS |
code |
40 |
Learning Warped Guidance for Blind Face Restoration |
ECCV |
code |
39 |
Adversarial Complementary Learning for Weakly Supervised Object Localization |
CVPR |
code |
39 |
Learning Semantic Representations for Unsupervised Domain Adaptation |
ICML |
code |
39 |
Neural Architecture Search with Bayesian Optimisation and Optimal Transport |
NIPS |
code |
39 |
Mutual Information Neural Estimation |
ICML |
code |
39 |
NetGAN: Generating Graphs via Random Walks |
ICML |
code |
39 |
Learning to Evaluate Image Captioning |
CVPR |
code |
38 |
Hyperbolic Neural Networks |
NIPS |
code |
37 |
Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation |
ECCV |
code |
37 |
Adversarially Learned One-Class Classifier for Novelty Detection |
CVPR |
code |
37 |
Disentangling by Factorising |
ICML |
code |
37 |
Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples |
ICML |
code |
37 |
Tangent Convolutions for Dense Prediction in 3D |
CVPR |
code |
37 |
Few-Shot Image Recognition by Predicting Parameters From Activations |
CVPR |
code |
37 |
Real-Time Monocular Depth Estimation Using Synthetic Data With Domain Adaptation via Image Style Transfer |
CVPR |
code |
37 |
Generalizing to Unseen Domains via Adversarial Data Augmentation |
NIPS |
code |
36 |
SeGAN: Segmenting and Generating the Invisible |
CVPR |
code |
36 |
Graphical Generative Adversarial Networks |
NIPS |
code |
36 |
PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference |
CVPR |
code |
36 |
Gated Fusion Network for Single Image Dehazing |
CVPR |
code |
35 |
Neural Code Comprehension: A Learnable Representation of Code Semantics |
NIPS |
code |
35 |
Eye In-Painting With Exemplar Generative Adversarial Networks |
CVPR |
code |
35 |
Deep One-Class Classification |
ICML |
code |
34 |
Deep Regression Tracking with Shrinkage Loss |
ECCV |
code |
34 |
Deflecting Adversarial Attacks With Pixel Deflection |
CVPR |
code |
34 |
Learning Visual Question Answering by Bootstrapping Hard Attention |
ECCV |
code |
33 |
Human-Centric Indoor Scene Synthesis Using Stochastic Grammar |
CVPR |
code |
33 |
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering |
CVPR |
code |
33 |
CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise |
CVPR |
code |
33 |
Speaker-Follower Models for Vision-and-Language Navigation |
NIPS |
code |
33 |
Improving Shape Deformation in Unsupervised Image-to-Image Translation |
ECCV |
code |
33 |
Learning Single-View 3D Reconstruction with Limited Pose Supervision |
ECCV |
code |
33 |
3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data |
NIPS |
code |
33 |
Adversarial Logit Pairing |
NIPS |
code |
32 |
Attention in Convolutional LSTM for Gesture Recognition |
NIPS |
code |
32 |
Graph-Cut RANSAC |
CVPR |
code |
32 |
Neural Guided Constraint Logic Programming for Program Synthesis |
NIPS |
code |
32 |
Learning Dynamic Memory Networks for Object Tracking |
ECCV |
code |
32 |
GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints |
ECCV |
code |
32 |
A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks |
NIPS |
code |
32 |
Flow-Grounded Spatial-Temporal Video Prediction from Still Images |
ECCV |
code |
32 |
Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection |
ECCV |
code |
32 |
On the Robustness of Semantic Segmentation Models to Adversarial Attacks |
CVPR |
code |
31 |
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning |
CVPR |
code |
31 |
SketchyScene: Richly-Annotated Scene Sketches |
ECCV |
code |
31 |
Deep Randomized Ensembles for Metric Learning |
ECCV |
code |
30 |
Deep High Dynamic Range Imaging with Large Foreground Motions |
ECCV |
code |
30 |
Revisiting Video Saliency: A Large-Scale Benchmark and a New Model |
CVPR |
code |
30 |
Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning |
CVPR |
code |
30 |
Deep Model-Based 6D Pose Refinement in RGB |
ECCV |
code |
30 |
TOM-Net: Learning Transparent Object Matting From a Single Image |
CVPR |
code |
30 |
Quaternion Convolutional Neural Networks |
ECCV |
code |
30 |
Densely Connected Attention Propagation for Reading Comprehension |
NIPS |
code |
30 |
A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising |
ECCV |
code |
30 |
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings |
ICML |
code |
29 |
Video Rain Streak Removal by Multiscale Convolutional Sparse Coding |
CVPR |
code |
29 |
Recurrent Scene Parsing With Perspective Understanding in the Loop |
CVPR |
code |
29 |
Single Shot Scene Text Retrieval |
ECCV |
code |
29 |
Toward Characteristic-Preserving Image-based Virtual Try-On Network |
ECCV |
code |
29 |
Explainable Neural Computation via Stack Neural Module Networks |
ECCV |
code |
29 |
Exploring Disentangled Feature Representation Beyond Face Identification |
CVPR |
code |
29 |
Controllable Video Generation With Sparse Trajectories |
CVPR |
code |
28 |
Layer-structured 3D Scene Inference via View Synthesis |
ECCV |
code |
28 |
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation |
ECCV |
code |
28 |
PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection |
CVPR |
code |
28 |
Learning Rich Features for Image Manipulation Detection |
CVPR |
code |
27 |
Fast Video Object Segmentation by Reference-Guided Mask Propagation |
CVPR |
code |
27 |
3DFeat-Net: Weakly Supervised Local 3D Features for Point Cloud Registration |
ECCV |
code |
27 |
Who Let the Dogs Out? Modeling Dog Behavior From Visual Data |
CVPR |
code |
27 |
EC-Net: an Edge-aware Point set Consolidation Network |
ECCV |
code |
27 |
Interpretable Intuitive Physics Model |
ECCV |
code |
27 |
Learning a Discriminative Feature Network for Semantic Segmentation |
CVPR |
code |
26 |
Partial Transfer Learning With Selective Adversarial Networks |
CVPR |
code |
26 |
Cross-Modal Deep Variational Hand Pose Estimation |
CVPR |
code |
26 |
Between-Class Learning for Image Classification |
CVPR |
code |
26 |
AON: Towards Arbitrarily-Oriented Text Recognition |
CVPR |
code |
26 |
Conditional Image-to-Image Translation |
CVPR |
code |
25 |
Learning Convolutional Networks for Content-Weighted Image Compression |
CVPR |
code |
25 |
Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification |
CVPR |
code |
25 |
Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries |
ECCV |
code |
25 |
CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation |
CVPR |
code |
25 |
Deep Texture Manifold for Ground Terrain Recognition |
CVPR |
code |
25 |
Audio-Visual Event Localization in Unconstrained Videos |
ECCV |
code |
25 |
First Order Generative Adversarial Networks |
ICML |
code |
25 |
Visual Coreference Resolution in Visual Dialog using Neural Module Networks |
ECCV |
code |
25 |
SYQ: Learning Symmetric Quantization for Efficient Deep Neural Networks |
CVPR |
code |
24 |
Deep Reinforcement Learning of Marked Temporal Point Processes |
NIPS |
code |
24 |
Explicit Inductive Bias for Transfer Learning with Convolutional Networks |
ICML |
code |
24 |
LEGO: Learning Edge With Geometry All at Once by Watching Videos |
CVPR |
code |
24 |
Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes |
ECCV |
code |
24 |
Multi-Agent Diverse Generative Adversarial Networks |
CVPR |
code |
23 |
Face Aging With Identity-Preserved Conditional Generative Adversarial Networks |
CVPR |
code |
23 |
Learning to Separate Object Sounds by Watching Unlabeled Video |
ECCV |
code |
23 |
Exploiting the Potential of Standard Convolutional Autoencoders for Image Restoration by Evolutionary Search |
ICML |
code |
23 |
To Trust Or Not To Trust A Classifier |
NIPS |
code |
23 |
Im2Flow: Motion Hallucination From Static Images for Action Recognition |
CVPR |
code |
22 |
ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing |
CVPR |
code |
22 |
Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning |
CVPR |
code |
22 |
Anonymous Walk Embeddings |
ICML |
code |
22 |
Learning to Multitask |
NIPS |
code |
22 |
CondenseNet: An Efficient DenseNet Using Learned Group Convolutions |
CVPR |
code |
22 |
HashGAN: Deep Learning to Hash With Pair Conditional Wasserstein GAN |
CVPR |
code |
22 |
Hierarchical Relational Networks for Group Activity Recognition and Retrieval |
ECCV |
code |
22 |
Collaborative and Adversarial Network for Unsupervised Domain Adaptation |
CVPR |
code |
22 |
Geometry-Aware Scene Text Detection With Instance Transformation Network |
CVPR |
code |
22 |
Learning to Promote Saliency Detectors |
CVPR |
code |
21 |
CSGNet: Neural Shape Parser for Constructive Solid Geometry |
CVPR |
code |
21 |
Local Spectral Graph Convolution for Point Set Feature Learning |
ECCV |
code |
21 |
HiDDeN: Hiding Data with Deep Networks |
ECCV |
code |
21 |
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning |
CVPR |
code |
20 |
Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal |
CVPR |
code |
20 |
Fully-Convolutional Point Networks for Large-Scale Point Clouds |
ECCV |
code |
20 |
Learning Superpixels With Segmentation-Aware Affinity Loss |
CVPR |
code |
20 |
Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks |
CVPR |
code |
20 |
Crowd Counting With Deep Negative Correlation Learning |
CVPR |
code |
20 |
Dimensionality-Driven Learning with Noisy Labels |
ICML |
code |
20 |
Objects that Sound |
ECCV |
code |
20 |
Deep Expander Networks: Efficient Deep Networks from Graph Theory |
ECCV |
code |
19 |
Low-Shot Learning With Large-Scale Diffusion |
CVPR |
code |
19 |
Low-Shot Learning With Imprinted Weights |
CVPR |
code |
19 |
Cross-Domain Self-Supervised Multi-Task Feature Learning Using Synthetic Imagery |
CVPR |
code |
19 |
Learning Descriptor Networks for 3D Shape Synthesis and Analysis |
CVPR |
code |
19 |
Disentangling Factors of Variation with Cycle-Consistent Variational Auto-Encoders |
ECCV |
code |
19 |
CTAP: Complementary Temporal Action Proposal Generation |
ECCV |
code |
18 |
DVAE#: Discrete Variational Autoencoders with Relaxed Boltzmann Priors |
NIPS |
code |
18 |
Conditional Image-Text Embedding Networks |
ECCV |
code |
18 |
EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth From Light Field Images |
CVPR |
code |
18 |
Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points |
CVPR |
code |
18 |
Bayesian Optimization of Combinatorial Structures |
ICML |
code |
18 |
FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis |
CVPR |
code |
18 |
Learning Type-Aware Embeddings for Fashion Compatibility |
ECCV |
code |
17 |
Sliced Wasserstein Distance for Learning Gaussian Mixture Models |
CVPR |
code |
17 |
Revisiting Deep Intrinsic Image Decompositions |
CVPR |
code |
17 |
A Spectral Approach to Gradient Estimation for Implicit Distributions |
ICML |
code |
17 |
Hierarchical Novelty Detection for Visual Object Recognition |
CVPR |
code |
17 |
Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies |
CVPR |
code |
17 |
Learning Generative ConvNets via Multi-Grid Modeling and Sampling |
CVPR |
code |
17 |
Learning 3D Shape Completion From Laser Scan Data With Weak Supervision |
CVPR |
code |
17 |
Triplet Loss in Siamese Network for Object Tracking |
ECCV |
code |
17 |
Adversarial Attack on Graph Structured Data |
ICML |
code |
17 |
Arbitrary Style Transfer With Deep Feature Reshuffle |
CVPR |
code |
17 |
Visual Question Reasoning on General Dependency Tree |
CVPR |
code |
17 |
Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition |
ECCV |
code |
16 |
Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks |
NIPS |
code |
16 |
Coded Sparse Matrix Multiplication |
ICML |
code |
16 |
Weakly-Supervised Action Segmentation With Iterative Soft Boundary Assignment |
CVPR |
code |
16 |
Recovering 3D Planes from a Single Image via Convolutional Neural Networks |
ECCV |
code |
16 |
SegStereo: Exploiting Semantic Information for Disparity Estimation |
ECCV |
code |
16 |
Functional Gradient Boosting based on Residual Network Perception |
ICML |
code |
16 |
NAG: Network for Adversary Generation |
CVPR |
code |
16 |
Generative Probabilistic Novelty Detection with Adversarial Autoencoders |
NIPS |
code |
16 |
Hashing as Tie-Aware Learning to Rank |
CVPR |
code |
15 |
Pose Proposal Networks |
ECCV |
code |
15 |
Convolutional Sequence to Sequence Model for Human Dynamics |
CVPR |
code |
15 |
Joint Pose and Expression Modeling for Facial Expression Recognition |
CVPR |
code |
15 |
Grounding Referring Expressions in Images by Variational Context |
CVPR |
code |
15 |
Rethinking the Form of Latent States in Image Captioning |
ECCV |
code |
15 |
Open Set Domain Adaptation by Backpropagation |
ECCV |
code |
15 |
Neural Sign Language Translation |
CVPR |
code |
15 |
SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters |
ECCV |
code |
15 |
Efficient Neural Audio Synthesis |
ICML |
code |
15 |
Deep Learning Under Privileged Information Using Heteroscedastic Dropout |
CVPR |
code |
14 |
Image Transformer |
ICML |
code |
14 |
Learning to Understand Image Blur |
CVPR |
code |
14 |
Learning and Using the Arrow of Time |
CVPR |
code |
14 |
Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints |
CVPR |
code |
14 |
Learning to Forecast and Refine Residual Motion for Image-to-Video Generation |
ECCV |
code |
14 |
Multi-Scale Weighted Nuclear Norm Image Restoration |
CVPR |
code |
14 |
Synthesizing Robust Adversarial Examples |
ICML |
code |
13 |
Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data |
ECCV |
code |
13 |
Assessing Generative Models via Precision and Recall |
NIPS |
code |
13 |
Deep Diffeomorphic Transformer Networks |
CVPR |
code |
13 |
Learning by Asking Questions |
CVPR |
code |
13 |
Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection |
CVPR |
code |
13 |
Variational Autoencoders for Deforming 3D Mesh Models |
CVPR |
code |
13 |
Min-Entropy Latent Model for Weakly Supervised Object Detection |
CVPR |
code |
13 |
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering |
CVPR |
code |
13 |
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace |
ICML |
code |
13 |
Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition |
CVPR |
code |
13 |
Finding Influential Training Samples for Gradient Boosted Decision Trees |
ICML |
code |
13 |
Gesture Recognition: Focus on the Hands |
CVPR |
code |
12 |
Cross-View Image Synthesis Using Conditional GANs |
CVPR |
code |
12 |
Joint Optimization Framework for Learning With Noisy Labels |
CVPR |
code |
12 |
Future Person Localization in First-Person Videos |
CVPR |
code |
12 |
AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos |
ECCV |
code |
12 |
Learning Transferable Architectures for Scalable Image Recognition |
CVPR |
code |
12 |
Clipped Action Policy Gradient |
ICML |
code |
12 |
Mix and Match Networks: Encoder-Decoder Alignment for Zero-Pair Image Translation |
CVPR |
code |
12 |
Decouple Learning for Parameterized Image Operators |
ECCV |
code |
12 |
Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction |
ICML |
code |
12 |
Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models |
NIPS |
code |
12 |
AMNet: Memorability Estimation With Attention |
CVPR |
code |
12 |
Adversarial Time-to-Event Modeling |
ICML |
code |
12 |
Reversible Recurrent Neural Networks |
NIPS |
code |
12 |
Human Pose Estimation With Parsing Induced Learner |
CVPR |
code |
11 |
ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking |
ECCV |
code |
11 |
A Joint Sequence Fusion Model for Video Question Answering and Retrieval |
ECCV |
code |
11 |
Learning Face Age Progression: A Pyramid Architecture of GANs |
CVPR |
code |
11 |
Robust Physical-World Attacks on Deep Learning Visual Classification |
CVPR |
code |
11 |
High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach |
ICML |
code |
11 |
Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory |
ICML |
code |
11 |
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence |
CVPR |
code |
11 |
Accelerating Natural Gradient with Higher-Order Invariance |
ICML |
code |
11 |
Hierarchical Multi-Label Classification Networks |
ICML |
code |
11 |
Convolutional Image Captioning |
CVPR |
code |
11 |
Boosting Domain Adaptation by Discovering Latent Domains |
CVPR |
code |
11 |
Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks |
CVPR |
code |
10 |
PacGAN: The power of two samples in generative adversarial networks |
NIPS |
code |
10 |
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification |
CVPR |
code |
10 |
End-to-End Incremental Learning |
ECCV |
code |
10 |
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation |
CVPR |
code |
10 |
On GANs and GMMs |
NIPS |
code |
10 |
Salient Object Detection Driven by Fixation Prediction |
CVPR |
code |
9 |
Semantic Video Segmentation by Gated Recurrent Flow Propagation |
CVPR |
code |
9 |
Constraint-Aware Deep Neural Network Compression |
ECCV |
code |
9 |
Statistically-motivated Second-order Pooling |
ECCV |
code |
9 |
Excitation Backprop for RNNs |
CVPR |
code |
9 |
Analyzing Uncertainty in Neural Machine Translation |
ICML |
code |
9 |
Learning Dynamics of Linear Denoising Autoencoders |
ICML |
code |
9 |
Saliency Detection in 360° Videos |
ECCV |
code |
9 |
Density Adaptive Point Set Registration |
CVPR |
code |
9 |
Decoupled Parallel Backpropagation with Convergence Guarantee |
ICML |
code |
9 |
Classification from Pairwise Similarity and Unlabeled Data |
ICML |
code |
9 |
oi-VAE: Output Interpretable VAEs for Nonlinear Group Factor Analysis |
ICML |
code |
9 |
Modeling Sparse Deviations for Compressed Sensing using Generative Models |
ICML |
code |
9 |
Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction |
CVPR |
code |
9 |
Towards Open-Set Identity Preserving Face Synthesis |
CVPR |
code |
9 |
Five-Point Fundamental Matrix Estimation for Uncalibrated Cameras |
CVPR |
code |
8 |
BourGAN: Generative Networks with Metric Embeddings |
NIPS |
code |
8 |
Fast Information-theoretic Bayesian Optimisation |
ICML |
code |
8 |
Deep Variational Reinforcement Learning for POMDPs |
ICML |
code |
8 |
Specular-to-Diffuse Translation for Multi-View Reconstruction |
ECCV |
code |
8 |
Dynamic Conditional Networks for Few-Shot Learning |
ECCV |
code |
8 |
Learning Facial Action Units From Web Images With Scalable Weakly Supervised Clustering |
CVPR |
code |
8 |
High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs |
CVPR |
code |
8 |
Deep Defense: Training DNNs with Improved Adversarial Robustness |
NIPS |
code |
8 |
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations |
ICML |
code |
8 |
Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based Modeling |
ECCV |
code |
7 |
Non-metric Similarity Graphs for Maximum Inner Product Search |
NIPS |
code |
7 |
Towards Realistic Predictors |
ECCV |
code |
7 |
Deep Non-Blind Deconvolution via Generalized Low-Rank Approximation |
NIPS |
code |
7 |
Don’t Just Assume Look and Answer: Overcoming Priors for Visual Question Answering |
CVPR |
code |
7 |
Learning Dual Convolutional Neural Networks for Low-Level Vision |
CVPR |
code |
7 |
The Mirage of Action-Dependent Baselines in Reinforcement Learning |
ICML |
code |
7 |
DVQA: Understanding Data Visualizations via Question Answering |
CVPR |
code |
7 |
A Two-Step Disentanglement Method |
CVPR |
code |
7 |
Detecting and Correcting for Label Shift with Black Box Predictors |
ICML |
code |
7 |
Conditional Prior Networks for Optical Flow |
ECCV |
code |
7 |
Generative Adversarial Learning Towards Fast Weakly Supervised Detection |
CVPR |
code |
7 |
Adversarial Learning with Local Coordinate Coding |
ICML |
code |
7 |
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks |
CVPR |
code |
7 |
AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks |
CVPR |
code |
7 |
Learning to Explain: An Information-Theoretic Perspective on Model Interpretation |
ICML |
code |
7 |
Banach Wasserstein GAN |
NIPS |
code |
7 |
Gradually Updated Neural Networks for Large-Scale Image Recognition |
ICML |
code |
7 |
Learning Steady-States of Iterative Algorithms over Graphs |
ICML |
code |
7 |
Progressive Attention Guided Recurrent Network for Salient Object Detection |
CVPR |
code |
7 |
Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains |
CVPR |
code |
6 |
Unsupervised holistic image generation from key local patches |
ECCV |
code |
6 |
Inner Space Preserving Generative Pose Machine |
ECCV |
code |
6 |
Bilevel Programming for Hyperparameter Optimization and Meta-Learning |
ICML |
code |
6 |
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition |
CVPR |
code |
6 |
Breaking the Activation Function Bottleneck through Adaptive Parameterization |
NIPS |
code |
6 |
Ultra Large-Scale Feature Selection using Count-Sketches |
ICML |
code |
6 |
Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks |
CVPR |
code |
6 |
Orthogonally Decoupled Variational Gaussian Processes |
NIPS |
code |
6 |
Batch Bayesian Optimization via Multi-objective Acquisition Ensemble for Automated Analog Circuit Design |
ICML |
code |
6 |
A Modulation Module for Multi-task Learning with Applications in Image Retrieval |
ECCV |
code |
6 |
A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos |
CVPR |
code |
6 |
Towards Effective Low-Bitwidth Convolutional Neural Networks |
CVPR |
code |
5 |
Disentangling Factors of Variation by Mixing Them |
CVPR |
code |
5 |
Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior |
ECCV |
code |
5 |
Learning Longer-term Dependencies in RNNs with Auxiliary Losses |
ICML |
code |
5 |
Contour Knowledge Transfer for Salient Object Detection |
ECCV |
code |
5 |
HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning |
ECCV |
code |
5 |
Sidekick Policy Learning for Active Visual Exploration |
ECCV |
code |
5 |
Learning to Localize Sound Source in Visual Scenes |
CVPR |
code |
5 |
Neural Architecture Optimization |
NIPS |
code |
5 |
COLA: Decentralized Linear Learning |
NIPS |
code |
5 |
Diverse and Coherent Paragraph Generation from Images |
ECCV |
code |
5 |
DRACO: Byzantine-resilient Distributed Training via Redundant Gradients |
ICML |
code |
5 |
Inter and Intra Topic Structure Learning with Word Embeddings |
ICML |
code |
5 |
Estimating the Success of Unsupervised Image to Image Translation |
ECCV |
code |
5 |
Dynamic-Structured Semantic Propagation Network |
CVPR |
code |
5 |
The Description Length of Deep Learning models |
NIPS |
code |
5 |
Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving |
ECCV |
code |
5 |
Blind Justice: Fairness with Encrypted Sensitive Attributes |
ICML |
code |
5 |
Transfer Learning via Learning to Transfer |
ICML |
code |
5 |
Deepcode: Feedback Codes via Deep Learning |
NIPS |
code |
4 |
Configurable Markov Decision Processes |
ICML |
code |
4 |
A Framework for Evaluating 6-DOF Object Trackers |
ECCV |
code |
4 |
Differentially Private Database Release via Kernel Mean Embeddings |
ICML |
code |
4 |
Recognizing Human Actions as the Evolution of Pose Estimation Maps |
CVPR |
code |
4 |
Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images |
CVPR |
code |
4 |
DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map |
CVPR |
code |
4 |
Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification |
ECCV |
code |
4 |
Tracking Emerges by Colorizing Videos |
ECCV |
code |
4 |
Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out Codes |
ECCV |
code |
4 |
Inference Suboptimality in Variational Autoencoders |
ICML |
code |
4 |
Black Box FDR |
ICML |
code |
4 |
Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence |
CVPR |
code |
4 |
Quadrature-based features for kernel approximation |
NIPS |
code |
4 |
Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking |
ECCV |
code |
4 |
Transferable Adversarial Perturbations |
ECCV |
code |
4 |
Single Image Water Hazard Detection using FCN with Reflection Attention Units |
ECCV |
code |
4 |
Multimodal Generative Models for Scalable Weakly-Supervised Learning |
NIPS |
code |
4 |
Importance Weighted Transfer of Samples in Reinforcement Learning |
ICML |
code |
3 |
Feature Generating Networks for Zero-Shot Learning |
CVPR |
code |
3 |
DICOD: Distributed Convolutional Coordinate Descent for Convolutional Sparse Coding |
ICML |
code |
3 |
CapProNet: Deep Feature Learning via Orthogonal Projections onto Capsule Subspaces |
NIPS |
code |
3 |
Bidirectional Retrieval Made Simple |
CVPR |
code |
3 |
Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages |
NIPS |
code |
3 |
A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping |
CVPR |
code |
3 |
Spatially-Adaptive Filter Units for Deep Neural Networks |
CVPR |
code |
3 |
Learning to Branch |
ICML |
code |
3 |
Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives |
NIPS |
code |
3 |
Lifelong Learning via Progressive Distillation and Retrospection |
ECCV |
code |
3 |
CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition |
CVPR |
code |
3 |
Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care |
ICML |
code |
3 |
Learning Answer Embeddings for Visual Question Answering |
CVPR |
code |
3 |
Information Constraints on Auto-Encoding Variational Bayes |
NIPS |
code |
3 |
Parallel Bayesian Network Structure Learning |
ICML |
code |
3 |
Ring Loss: Convex Feature Normalization for Face Recognition |
CVPR |
code |
3 |
Teaching Categories to Human Learners With Visual Explanations |
CVPR |
code |
3 |
Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization |
ICML |
code |
3 |
Deep Burst Denoising |
ECCV |
code |
3 |
Convergent Tree Backup and Retrace with Function Approximation |
ICML |
code |
3 |
Gaze Prediction in Dynamic 360° Immersive Videos |
CVPR |
code |
3 |
Statistical Recurrent Models on Manifold valued Data |
NIPS |
code |
3 |
End-to-End Flow Correlation Tracking With Spatial-Temporal Attention |
CVPR |
code |
3 |