| Video-to-Video Synthesis |
NIPS |
code |
5578 |
| Deep Image Prior |
CVPR |
code |
3736 |
| StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation |
CVPR |
code |
3405 |
| Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network |
ECCV |
code |
2434 |
| Learning to See in the Dark |
CVPR |
code |
2326 |
| Glow: Generative Flow with Invertible 1x1 Convolutions |
NIPS |
code |
2088 |
| Squeeze-and-Excitation Networks |
CVPR |
code |
1477 |
| Efficient Neural Architecture Search via Parameters Sharing |
ICML |
code |
1382 |
| Multimodal Unsupervised Image-to-image Translation |
ECCV |
code |
1296 |
| Non-Local Neural Networks |
CVPR |
code |
992 |
| Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? |
CVPR |
code |
924 |
| Single-Shot Refinement Neural Network for Object Detection |
CVPR |
code |
875 |
| Image Generation From Scene Graphs |
CVPR |
code |
851 |
| GANimation: Anatomically-aware Facial Animation from a Single Image |
ECCV |
code |
772 |
| Simple Baselines for Human Pose Estimation and Tracking |
ECCV |
code |
752 |
| Visualizing the Loss Landscape of Neural Nets |
NIPS |
code |
724 |
| Detect-and-Track: Efficient Pose Estimation in Videos |
CVPR |
code |
650 |
| Relation Networks for Object Detection |
CVPR |
code |
635 |
| Generative Image Inpainting With Contextual Attention |
CVPR |
code |
609 |
| PointCNN |
NIPS |
code |
607 |
| Look at Boundary: A Boundary-Aware Face Alignment Algorithm |
CVPR |
code |
575 |
| Pelee: A Real-Time Object Detection System on Mobile Devices |
NIPS |
code |
548 |
| Distractor-aware Siamese Networks for Visual Object Tracking |
ECCV |
code |
545 |
| Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples |
ICML |
code |
535 |
| Which Training Methods for GANs do actually Converge? |
ICML |
code |
520 |
| End-to-End Recovery of Human Shape and Pose |
CVPR |
code |
502 |
| Taskonomy: Disentangling Task Transfer Learning |
CVPR |
code |
502 |
| Cascaded Pyramid Network for Multi-Person Pose Estimation |
CVPR |
code |
497 |
| Neural 3D Mesh Renderer |
CVPR |
code |
489 |
| Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs |
CVPR |
code |
489 |
| In-Place Activated BatchNorm for Memory-Optimized Training of DNNs |
CVPR |
code |
485 |
| The Unreasonable Effectiveness of Deep Features as a Perceptual Metric |
CVPR |
code |
447 |
| Frustum PointNets for 3D Object Detection From RGB-D Data |
CVPR |
code |
434 |
| The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks |
CVPR |
code |
416 |
| ICNet for Real-Time Semantic Segmentation on High-Resolution Images |
ECCV |
code |
415 |
| PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume |
CVPR |
code |
398 |
| Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++ |
CVPR |
code |
397 |
| Gibson Env: Real-World Perception for Embodied Agents |
CVPR |
code |
385 |
| Acquisition of Localization Confidence for Accurate Object Detection |
ECCV |
code |
384 |
| Noise2Noise: Learning Image Restoration without Clean Data |
ICML |
code |
370 |
| GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation |
CVPR |
code |
359 |
| GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose |
CVPR |
code |
359 |
| A Style-Aware Content Loss for Real-time HD Style Transfer |
ECCV |
code |
349 |
| Soccer on Your Tabletop |
CVPR |
code |
338 |
| Pyramid Stereo Matching Network |
CVPR |
code |
335 |
| Neural Baby Talk |
CVPR |
code |
332 |
| License Plate Detection and Recognition in Unconstrained Scenarios |
ECCV |
code |
326 |
| Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors |
CVPR |
code |
326 |
| Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images |
ECCV |
code |
323 |
| Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning |
CVPR |
code |
317 |
| Fast End-to-End Trainable Guided Filter |
CVPR |
code |
312 |
| Deep Clustering for Unsupervised Learning of Visual Features |
ECCV |
code |
302 |
| Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs |
CVPR |
code |
294 |
| Neural Relational Inference for Interacting Systems |
ICML |
code |
289 |
| Adversarially Regularized Autoencoders |
ICML |
code |
282 |
| Learning to Adapt Structured Output Space for Semantic Segmentation |
CVPR |
code |
280 |
| Convolutional Neural Networks With Alternately Updated Clique |
CVPR |
code |
272 |
| Learning to Segment Every Thing |
CVPR |
code |
269 |
| Supervising Unsupervised Learning |
NIPS |
code |
262 |
| LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation |
CVPR |
code |
261 |
| Bilinear Attention Networks |
NIPS |
code |
258 |
| ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation |
ECCV |
code |
254 |
| An intriguing failing of convolutional neural networks and the CoordConv solution |
NIPS |
code |
249 |
| End-to-End Learning of Motion Representation for Video Understanding |
CVPR |
code |
238 |
| Image Super-Resolution Using Very Deep Residual Channel Attention Networks |
ECCV |
code |
234 |
| Iterative Visual Reasoning Beyond Convolutions |
CVPR |
code |
228 |
| Semi-Parametric Image Synthesis |
CVPR |
code |
226 |
| Compressed Video Action Recognition |
CVPR |
code |
225 |
| Style Aggregated Network for Facial Landmark Detection |
CVPR |
code |
223 |
| Pose-Robust Face Recognition via Deep Residual Equivariant Mapping |
CVPR |
code |
220 |
| Multi-Content GAN for Few-Shot Font Style Transfer |
CVPR |
code |
218 |
| GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models |
ICML |
code |
214 |
| Referring Relationships |
CVPR |
code |
210 |
| MoCoGAN: Decomposing Motion and Content for Video Generation |
CVPR |
code |
205 |
| Latent Alignment and Variational Attention |
NIPS |
code |
204 |
| LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image |
CVPR |
code |
202 |
| Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs |
CVPR |
code |
197 |
| An End-to-End TextSpotter With Explicit Alignment and Attention |
CVPR |
code |
195 |
| DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks |
CVPR |
code |
189 |
| SPLATNet: Sparse Lattice Networks for Point Cloud Processing |
CVPR |
code |
188 |
| Attentive Generative Adversarial Network for Raindrop Removal From a Single Image |
CVPR |
code |
186 |
| Single View Stereo Matching |
CVPR |
code |
182 |
| MegaDepth: Learning Single-View Depth Prediction From Internet Photos |
CVPR |
code |
181 |
| ECO: Efficient Convolutional Network for Online Video Understanding |
ECCV |
code |
180 |
| Unsupervised Feature Learning via Non-Parametric Instance Discrimination |
CVPR |
code |
180 |
| ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing |
CVPR |
code |
179 |
| Video Based Reconstruction of 3D People Models |
CVPR |
code |
179 |
| Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks |
CVPR |
code |
178 |
| Learning Category-Specific Mesh Reconstruction from Image Collections |
ECCV |
code |
176 |
| Realistic Evaluation of Deep Semi-Supervised Learning Algorithms |
NIPS |
code |
175 |
| BSN: Boundary Sensitive Network for Temporal Action Proposal Generation |
ECCV |
code |
175 |
| Group Normalization |
ECCV |
code |
175 |
| Real-Time Seamless Single Shot 6D Object Pose Prediction |
CVPR |
code |
174 |
| MVSNet: Depth Inference for Unstructured Multi-view Stereo |
ECCV |
code |
174 |
| Neural Motifs: Scene Graph Parsing With Global Context |
CVPR |
code |
171 |
| Learning a Single Convolutional Super-Resolution Network for Multiple Degradations |
CVPR |
code |
169 |
| Optimizing Video Object Detection via a Scale-Time Lattice |
CVPR |
code |
168 |
| MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network |
ECCV |
code |
167 |
| Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns |
CVPR |
code |
166 |
| Weakly Supervised Instance Segmentation Using Class Peak Response |
CVPR |
code |
166 |
| PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image |
CVPR |
code |
164 |
| Residual Dense Network for Image Super-Resolution |
CVPR |
code |
163 |
| Embodied Question Answering |
CVPR |
code |
162 |
| Evolved Policy Gradients |
NIPS |
code |
160 |
| Camera Style Adaptation for Person Re-Identification |
CVPR |
code |
159 |
| Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer |
CVPR |
code |
159 |
| Scale-Recurrent Network for Deep Image Deblurring |
CVPR |
code |
159 |
| Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction |
CVPR |
code |
158 |
| Relational recurrent neural networks |
NIPS |
code |
157 |
| Densely Connected Pyramid Dehazing Network |
CVPR |
code |
155 |
| Image Inpainting for Irregular Holes Using Partial Convolutions |
ECCV |
code |
153 |
| SO-Net: Self-Organizing Network for Point Cloud Analysis |
CVPR |
code |
152 |
| Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling |
CVPR |
code |
152 |
| ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices |
CVPR |
code |
152 |
| DenseASPP for Semantic Segmentation in Street Scenes |
CVPR |
code |
151 |
| Facelet-Bank for Fast Portrait Manipulation |
CVPR |
code |
150 |
| Self-Imitation Learning |
ICML |
code |
145 |
| Graph R-CNN for Scene Graph Generation |
ECCV |
code |
144 |
| A Closer Look at Spatiotemporal Convolutions for Action Recognition |
CVPR |
code |
143 |
| Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation |
CVPR |
code |
143 |
| Quantized Densely Connected U-Nets for Efficient Landmark Localization |
ECCV |
code |
143 |
| Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining |
ECCV |
code |
142 |
| Two-Stream Convolutional Networks for Dynamic Texture Synthesis |
CVPR |
code |
141 |
| Integral Human Pose Regression |
ECCV |
code |
141 |
| Adaptive Affinity Fields for Semantic Segmentation |
ECCV |
code |
141 |
| LSTM Pose Machines |
CVPR |
code |
141 |
| Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships |
CVPR |
code |
140 |
| Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform |
CVPR |
code |
139 |
| Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification |
CVPR |
code |
137 |
| Learning to Compare: Relation Network for Few-Shot Learning |
CVPR |
code |
135 |
| CosFace: Large Margin Cosine Loss for Deep Face Recognition |
CVPR |
code |
135 |
| Deep Depth Completion of a Single RGB-D Image |
CVPR |
code |
134 |
| Deep Back-Projection Networks for Super-Resolution |
CVPR |
code |
132 |
| Context Embedding Networks |
CVPR |
code |
131 |
| Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics |
CVPR |
code |
131 |
| Perturbative Neural Networks |
CVPR |
code |
130 |
| Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis |
ICML |
code |
129 |
| Fast and Accurate Online Video Object Segmentation via Tracking Parts |
CVPR |
code |
129 |
| Nonlinear 3D Face Morphable Model |
CVPR |
code |
128 |
| BodyNet: Volumetric Inference of 3D Human Body Shapes |
ECCV |
code |
126 |
| 3D-CODED: 3D Correspondences by Deep Deformation |
ECCV |
code |
125 |
| DeepMVS: Learning Multi-View Stereopsis |
CVPR |
code |
125 |
| Hierarchical Imitation and Reinforcement Learning |
ICML |
code |
124 |
| Domain Adaptive Faster R-CNN for Object Detection in the Wild |
CVPR |
code |
123 |
| L4: Practical loss-based stepsize adaptation for deep learning |
NIPS |
code |
123 |
| A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts |
CVPR |
code |
122 |
| Recurrent Relational Networks |
NIPS |
code |
121 |
| Gated Path Planning Networks |
ICML |
code |
121 |
| PSANet: Point-wise Spatial Attention Network for Scene Parsing |
ECCV |
code |
121 |
| Rethinking Feature Distribution for Loss Functions in Image Classification |
CVPR |
code |
120 |
| Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network |
CVPR |
code |
118 |
| FOTS: Fast Oriented Text Spotting With a Unified Network |
CVPR |
code |
118 |
| ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes |
ECCV |
code |
117 |
| PU-Net: Point Cloud Upsampling Network |
CVPR |
code |
117 |
| PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning |
CVPR |
code |
117 |
| Long-term Tracking in the Wild: a Benchmark |
ECCV |
code |
116 |
| Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene |
CVPR |
code |
114 |
| Repulsion Loss: Detecting Pedestrians in a Crowd |
CVPR |
code |
113 |
| Unsupervised Attention-guided Image-to-Image Translation |
NIPS |
code |
110 |
| Attention-based Deep Multiple Instance Learning |
ICML |
code |
109 |
| Learning Blind Video Temporal Consistency |
ECCV |
code |
109 |
| Noisy Natural Gradient as Variational Inference |
ICML |
code |
108 |
| End-to-End Weakly-Supervised Semantic Alignment |
CVPR |
code |
106 |
| Decoupled Networks |
CVPR |
code |
105 |
| LiDAR-Video Driving Dataset: Learning Driving Policies Effectively |
CVPR |
code |
104 |
| MAttNet: Modular Attention Network for Referring Expression Comprehension |
CVPR |
code |
104 |
| LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks |
ECCV |
code |
103 |
| FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors |
CVPR |
code |
100 |
| Deep Mutual Learning |
CVPR |
code |
100 |
| Macro-Micro Adversarial Network for Human Parsing |
ECCV |
code |
98 |
| ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans |
CVPR |
code |
97 |
| Learning Depth From Monocular Videos Using Direct Methods |
CVPR |
code |
97 |
| VITON: An Image-Based Virtual Try-On Network |
CVPR |
code |
95 |
| Cascade R-CNN: Delving Into High Quality Object Detection |
CVPR |
code |
93 |
| Learning Human-Object Interactions by Graph Parsing Neural Networks |
ECCV |
code |
93 |
| Future Frame Prediction for Anomaly Detection – A New Baseline |
CVPR |
code |
92 |
| Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence |
ECCV |
code |
92 |
| Tell Me Where to Look: Guided Attention Inference Network |
CVPR |
code |
91 |
| Neural Kinematic Networks for Unsupervised Motion Retargetting |
CVPR |
code |
90 |
| Learning SO(3) Equivariant Representations with Spherical CNNs |
ECCV |
code |
89 |
| One-Shot Unsupervised Cross Domain Translation |
NIPS |
code |
89 |
| Synthesizing Images of Humans in Unseen Poses |
CVPR |
code |
88 |
| Depth-aware CNN for RGB-D Segmentation |
ECCV |
code |
88 |
| Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights |
ECCV |
code |
88 |
| Knowledge Aided Consistency for Weakly Supervised Phrase Grounding |
CVPR |
code |
87 |
| CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes |
CVPR |
code |
87 |
| Neural Arithmetic Logic Units |
NIPS |
code |
87 |
| A PID Controller Approach for Stochastic Optimization of Deep Networks |
CVPR |
code |
87 |
| VITAL: VIsual Tracking via Adversarial Learning |
CVPR |
code |
86 |
| Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking |
CVPR |
code |
86 |
| Recurrent Pixel Embedding for Instance Grouping |
CVPR |
code |
85 |
| SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation |
CVPR |
code |
84 |
| Multi-Scale Location-Aware Kernel Representation for Object Detection |
CVPR |
code |
84 |
| Repeatability Is Not Enough: Learning Affine Regions via Discriminability |
ECCV |
code |
84 |
| “Zero-Shot” Super-Resolution Using Deep Internal Learning |
CVPR |
code |
84 |
| DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency |
ECCV |
code |
82 |
| Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction |
CVPR |
code |
80 |
| Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation |
ECCV |
code |
78 |
| Generalizing A Person Retrieval Model Hetero- and Homogeneously |
ECCV |
code |
78 |
| Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning |
CVPR |
code |
77 |
| Pairwise Confusion for Fine-Grained Visual Classification |
ECCV |
code |
77 |
| Learning to Reweight Examples for Robust Deep Learning |
ICML |
code |
76 |
| Improving Generalization via Scalable Neighborhood Component Analysis |
ECCV |
code |
76 |
| SparseMAP: Differentiable Sparse Structured Inference |
ICML |
code |
75 |
| PDE-Net: Learning PDEs from Data |
ICML |
code |
75 |
| Pose-Normalized Image Generation for Person Re-identification |
ECCV |
code |
75 |
| Disentangled Person Image Generation |
CVPR |
code |
75 |
| Learning to Navigate for Fine-grained Classification |
ECCV |
code |
74 |
| Superpixel Sampling Networks |
ECCV |
code |
74 |
| Shift-Net: Image Inpainting via Deep Feature Rearrangement |
ECCV |
code |
74 |
| 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation |
ECCV |
code |
74 |
| Ordinal Depth Supervision for 3D Human Pose Estimation |
CVPR |
code |
74 |
| Path-Level Network Transformation for Efficient Architecture Search |
ICML |
code |
73 |
| Diverse Image-to-Image Translation via Disentangled Representations |
ECCV |
code |
72 |
| Visual Feature Attribution Using Wasserstein GANs |
CVPR |
code |
72 |
| Real-World Anomaly Detection in Surveillance Videos |
CVPR |
code |
72 |
| Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval |
CVPR |
code |
72 |
| Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image |
ECCV |
code |
72 |
| Learning to Find Good Correspondences |
CVPR |
code |
72 |
| Learning Less Is More - 6D Camera Localization via 3D Surface Regression |
CVPR |
code |
72 |
| Object Level Visual Reasoning in Videos |
ECCV |
code |
71 |
| Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing |
CVPR |
code |
71 |
| Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration |
CVPR |
code |
71 |
| Fast and Accurate Single Image Super-Resolution via Information Distillation Network |
CVPR |
code |
71 |
| Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present |
CVPR |
code |
70 |
| Multi-Shot Pedestrian Re-Identification via Sequential Decision Making |
CVPR |
code |
70 |
| PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition |
CVPR |
code |
69 |
| Progressive Neural Architecture Search |
ECCV |
code |
68 |
| Generative Neural Machine Translation |
NIPS |
code |
68 |
| Learning Latent Super-Events to Detect Multiple Activities in Videos |
CVPR |
code |
67 |
| Generate to Adapt: Aligning Domains Using Generative Adversarial Networks |
CVPR |
code |
67 |
| Adversarial Feature Augmentation for Unsupervised Domain Adaptation |
CVPR |
code |
67 |
| Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking |
CVPR |
code |
67 |
| Pointwise Convolutional Neural Networks |
CVPR |
code |
67 |
| Optimizing the Latent Space of Generative Networks |
ICML |
code |
66 |
| Part-Aligned Bilinear Representations for Person Re-Identification |
ECCV |
code |
64 |
| Geometry-Aware Learning of Maps for Camera Localization |
CVPR |
code |
63 |
| Fighting Fake News: Image Splice Detection via Learned Self-Consistency |
ECCV |
code |
62 |
| Isolating Sources of Disentanglement in Variational Autoencoders |
NIPS |
code |
62 |
| Neural Program Synthesis from Diverse Demonstration Videos |
ICML |
code |
62 |
| Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation |
ECCV |
code |
61 |
| Rotation-Sensitive Regression for Oriented Scene Text Detection |
CVPR |
code |
61 |
| Human Semantic Parsing for Person Re-Identification |
CVPR |
code |
61 |
| Unsupervised Discovery of Object Landmarks as Structural Representations |
CVPR |
code |
61 |
| IQA: Visual Question Answering in Interactive Environments |
CVPR |
code |
60 |
| Hierarchical Long-term Video Prediction without Supervision |
ICML |
code |
60 |
| Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency |
ECCV |
code |
60 |
| Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning |
CVPR |
code |
59 |
| Neural Style Transfer via Meta Networks |
CVPR |
code |
59 |
| Frame-Recurrent Video Super-Resolution |
CVPR |
code |
58 |
| PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction |
ECCV |
code |
57 |
| CBAM: Convolutional Block Attention Module |
ECCV |
code |
57 |
| Decorrelated Batch Normalization |
CVPR |
code |
57 |
| Learning Conditioned Graph Structures for Interpretable Visual Question Answering |
NIPS |
code |
57 |
| Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition |
ECCV |
code |
57 |
| Leveraging Unlabeled Data for Crowd Counting by Learning to Rank |
CVPR |
code |
56 |
| Deep Marching Cubes: Learning Explicit Surface Representations |
CVPR |
code |
56 |
| Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation |
CVPR |
code |
56 |
| LF-Net: Learning Local Features from Images |
NIPS |
code |
55 |
| Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model |
ECCV |
code |
55 |
| Discriminability Objective for Training Descriptive Captions |
CVPR |
code |
54 |
| BlockDrop: Dynamic Inference Paths in Residual Networks |
CVPR |
code |
54 |
| Conditional Probability Models for Deep Image Compression |
CVPR |
code |
54 |
| Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation |
CVPR |
code |
54 |
| Learning towards Minimum Hyperspherical Energy |
NIPS |
code |
54 |
| DeepVS: A Deep Learning Based Video Saliency Prediction Approach |
ECCV |
code |
53 |
| Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting |
ECCV |
code |
52 |
| Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation |
CVPR |
code |
52 |
| Wasserstein Introspective Neural Networks |
CVPR |
code |
51 |
| SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis |
CVPR |
code |
51 |
| Self-produced Guidance for Weakly-supervised Object Localization |
ECCV |
code |
51 |
| Measuring abstract reasoning in neural networks |
ICML |
code |
51 |
| A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation |
NIPS |
code |
51 |
| RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials |
CVPR |
code |
51 |
| Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation |
ECCV |
code |
50 |
| Efficient end-to-end learning for quantizable representations |
ICML |
code |
50 |
| Visual Question Generation as Dual Task of Visual Question Answering |
CVPR |
code |
50 |
| Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam |
ICML |
code |
49 |
| Surface Networks |
CVPR |
code |
48 |
| Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions |
ICML |
code |
48 |
| Stacked Cross Attention for Image-Text Matching |
ECCV |
code |
48 |
| Actor and Observer: Joint Modeling of First and Third-Person Videos |
CVPR |
code |
48 |
| Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation |
CVPR |
code |
47 |
| Learning-based Video Motion Magnification |
ECCV |
code |
47 |
| Pose Partition Networks for Multi-Person Pose Estimation |
ECCV |
code |
47 |
| Neural Autoregressive Flows |
ICML |
code |
47 |
| Weakly- and Semi-Supervised Panoptic Segmentation |
ECCV |
code |
46 |
| Video Re-localization |
ECCV |
code |
46 |
| Real-time 'Actor-Critic' Tracking |
ECCV |
code |
46 |
| Black-box Adversarial Attacks with Limited Queries and Information |
ICML |
code |
46 |
| Hyperbolic Entailment Cones for Learning Hierarchical Embeddings |
ICML |
code |
46 |
| Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation |
CVPR |
code |
46 |
| Differentiable Compositional Kernel Learning for Gaussian Processes |
ICML |
code |
45 |
| Visualizing and Understanding Atari Agents |
ICML |
code |
45 |
| Image Manipulation with Perceptual Discriminators |
ECCV |
code |
45 |
| Learning Intrinsic Image Decomposition From Watching the World |
CVPR |
code |
45 |
| Overcoming Catastrophic Forgetting with Hard Attention to the Task |
ICML |
code |
44 |
| Learning Pose Specific Representations by Predicting Different Views |
CVPR |
code |
44 |
| Zero-Shot Object Detection |
ECCV |
code |
43 |
| Mean Field Multi-Agent Reinforcement Learning |
ICML |
code |
43 |
| Partial Adversarial Domain Adaptation |
ECCV |
code |
43 |
| Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation |
ECCV |
code |
43 |
| Robust Classification With Convolutional Prototype Learning |
CVPR |
code |
43 |
| SimplE Embedding for Link Prediction in Knowledge Graphs |
NIPS |
code |
42 |
| PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning |
ICML |
code |
42 |
| Learning to Blend Photos |
ECCV |
code |
42 |
| Mask-Guided Contrastive Attention Model for Person Re-Identification |
CVPR |
code |
41 |
| Link Prediction Based on Graph Neural Networks |
NIPS |
code |
41 |
| Generalisation in humans and deep neural networks |
NIPS |
code |
41 |
| Towards Binary-Valued Gates for Robust LSTM Training |
ICML |
code |
41 |
| Multi-scale Residual Network for Image Super-Resolution |
ECCV |
code |
41 |
| Fully Motion-Aware Network for Video Object Detection |
ECCV |
code |
41 |
| Interpretable Convolutional Neural Networks |
CVPR |
code |
40 |
| Generative Adversarial Perturbations |
CVPR |
code |
40 |
| The Sound of Pixels |
ECCV |
code |
40 |
| Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization |
CVPR |
code |
40 |
| Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance |
ECCV |
code |
40 |
| Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation |
NIPS |
code |
40 |
| Learning Warped Guidance for Blind Face Restoration |
ECCV |
code |
39 |
| Adversarial Complementary Learning for Weakly Supervised Object Localization |
CVPR |
code |
39 |
| Learning Semantic Representations for Unsupervised Domain Adaptation |
ICML |
code |
39 |
| Neural Architecture Search with Bayesian Optimisation and Optimal Transport |
NIPS |
code |
39 |
| Mutual Information Neural Estimation |
ICML |
code |
39 |
| NetGAN: Generating Graphs via Random Walks |
ICML |
code |
39 |
| Learning to Evaluate Image Captioning |
CVPR |
code |
38 |
| Hyperbolic Neural Networks |
NIPS |
code |
37 |
| Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation |
ECCV |
code |
37 |
| Adversarially Learned One-Class Classifier for Novelty Detection |
CVPR |
code |
37 |
| Disentangling by Factorising |
ICML |
code |
37 |
| Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples |
ICML |
code |
37 |
| Tangent Convolutions for Dense Prediction in 3D |
CVPR |
code |
37 |
| Few-Shot Image Recognition by Predicting Parameters From Activations |
CVPR |
code |
37 |
| Real-Time Monocular Depth Estimation Using Synthetic Data With Domain Adaptation via Image Style Transfer |
CVPR |
code |
37 |
| Generalizing to Unseen Domains via Adversarial Data Augmentation |
NIPS |
code |
36 |
| SeGAN: Segmenting and Generating the Invisible |
CVPR |
code |
36 |
| Graphical Generative Adversarial Networks |
NIPS |
code |
36 |
| PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference |
CVPR |
code |
36 |
| Gated Fusion Network for Single Image Dehazing |
CVPR |
code |
35 |
| Neural Code Comprehension: A Learnable Representation of Code Semantics |
NIPS |
code |
35 |
| Eye In-Painting With Exemplar Generative Adversarial Networks |
CVPR |
code |
35 |
| Deep One-Class Classification |
ICML |
code |
34 |
| Deep Regression Tracking with Shrinkage Loss |
ECCV |
code |
34 |
| Deflecting Adversarial Attacks With Pixel Deflection |
CVPR |
code |
34 |
| Learning Visual Question Answering by Bootstrapping Hard Attention |
ECCV |
code |
33 |
| Human-Centric Indoor Scene Synthesis Using Stochastic Grammar |
CVPR |
code |
33 |
| Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering |
CVPR |
code |
33 |
| CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise |
CVPR |
code |
33 |
| Speaker-Follower Models for Vision-and-Language Navigation |
NIPS |
code |
33 |
| Improving Shape Deformation in Unsupervised Image-to-Image Translation |
ECCV |
code |
33 |
| Learning Single-View 3D Reconstruction with Limited Pose Supervision |
ECCV |
code |
33 |
| 3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data |
NIPS |
code |
33 |
| Adversarial Logit Pairing |
NIPS |
code |
32 |
| Attention in Convolutional LSTM for Gesture Recognition |
NIPS |
code |
32 |
| Graph-Cut RANSAC |
CVPR |
code |
32 |
| Neural Guided Constraint Logic Programming for Program Synthesis |
NIPS |
code |
32 |
| Learning Dynamic Memory Networks for Object Tracking |
ECCV |
code |
32 |
| GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints |
ECCV |
code |
32 |
| A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks |
NIPS |
code |
32 |
| Flow-Grounded Spatial-Temporal Video Prediction from Still Images |
ECCV |
code |
32 |
| Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection |
ECCV |
code |
32 |
| On the Robustness of Semantic Segmentation Models to Adversarial Attacks |
CVPR |
code |
31 |
| Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning |
CVPR |
code |
31 |
| SketchyScene: Richly-Annotated Scene Sketches |
ECCV |
code |
31 |
| Deep Randomized Ensembles for Metric Learning |
ECCV |
code |
30 |
| Deep High Dynamic Range Imaging with Large Foreground Motions |
ECCV |
code |
30 |
| Revisiting Video Saliency: A Large-Scale Benchmark and a New Model |
CVPR |
code |
30 |
| Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning |
CVPR |
code |
30 |
| Deep Model-Based 6D Pose Refinement in RGB |
ECCV |
code |
30 |
| TOM-Net: Learning Transparent Object Matting From a Single Image |
CVPR |
code |
30 |
| Quaternion Convolutional Neural Networks |
ECCV |
code |
30 |
| Densely Connected Attention Propagation for Reading Comprehension |
NIPS |
code |
30 |
| A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising |
ECCV |
code |
30 |
| Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings |
ICML |
code |
29 |
| Video Rain Streak Removal by Multiscale Convolutional Sparse Coding |
CVPR |
code |
29 |
| Recurrent Scene Parsing With Perspective Understanding in the Loop |
CVPR |
code |
29 |
| Single Shot Scene Text Retrieval |
ECCV |
code |
29 |
| Toward Characteristic-Preserving Image-based Virtual Try-On Network |
ECCV |
code |
29 |
| Explainable Neural Computation via Stack Neural Module Networks |
ECCV |
code |
29 |
| Exploring Disentangled Feature Representation Beyond Face Identification |
CVPR |
code |
29 |
| Controllable Video Generation With Sparse Trajectories |
CVPR |
code |
28 |
| Layer-structured 3D Scene Inference via View Synthesis |
ECCV |
code |
28 |
| Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation |
ECCV |
code |
28 |
| PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection |
CVPR |
code |
28 |
| Learning Rich Features for Image Manipulation Detection |
CVPR |
code |
27 |
| Fast Video Object Segmentation by Reference-Guided Mask Propagation |
CVPR |
code |
27 |
| 3DFeat-Net: Weakly Supervised Local 3D Features for Point Cloud Registration |
ECCV |
code |
27 |
| Who Let the Dogs Out? Modeling Dog Behavior From Visual Data |
CVPR |
code |
27 |
| EC-Net: an Edge-aware Point set Consolidation Network |
ECCV |
code |
27 |
| Interpretable Intuitive Physics Model |
ECCV |
code |
27 |
| Learning a Discriminative Feature Network for Semantic Segmentation |
CVPR |
code |
26 |
| Partial Transfer Learning With Selective Adversarial Networks |
CVPR |
code |
26 |
| Cross-Modal Deep Variational Hand Pose Estimation |
CVPR |
code |
26 |
| Between-Class Learning for Image Classification |
CVPR |
code |
26 |
| AON: Towards Arbitrarily-Oriented Text Recognition |
CVPR |
code |
26 |
| Conditional Image-to-Image Translation |
CVPR |
code |
25 |
| Learning Convolutional Networks for Content-Weighted Image Compression |
CVPR |
code |
25 |
| Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification |
CVPR |
code |
25 |
| Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries |
ECCV |
code |
25 |
| CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation |
CVPR |
code |
25 |
| Deep Texture Manifold for Ground Terrain Recognition |
CVPR |
code |
25 |
| Audio-Visual Event Localization in Unconstrained Videos |
ECCV |
code |
25 |
| First Order Generative Adversarial Networks |
ICML |
code |
25 |
| Visual Coreference Resolution in Visual Dialog using Neural Module Networks |
ECCV |
code |
25 |
| SYQ: Learning Symmetric Quantization for Efficient Deep Neural Networks |
CVPR |
code |
24 |
| Deep Reinforcement Learning of Marked Temporal Point Processes |
NIPS |
code |
24 |
| Explicit Inductive Bias for Transfer Learning with Convolutional Networks |
ICML |
code |
24 |
| LEGO: Learning Edge With Geometry All at Once by Watching Videos |
CVPR |
code |
24 |
| Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes |
ECCV |
code |
24 |
| Multi-Agent Diverse Generative Adversarial Networks |
CVPR |
code |
23 |
| Face Aging With Identity-Preserved Conditional Generative Adversarial Networks |
CVPR |
code |
23 |
| Learning to Separate Object Sounds by Watching Unlabeled Video |
ECCV |
code |
23 |
| Exploiting the Potential of Standard Convolutional Autoencoders for Image Restoration by Evolutionary Search |
ICML |
code |
23 |
| To Trust Or Not To Trust A Classifier |
NIPS |
code |
23 |
| Im2Flow: Motion Hallucination From Static Images for Action Recognition |
CVPR |
code |
22 |
| ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing |
CVPR |
code |
22 |
| Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning |
CVPR |
code |
22 |
| Anonymous Walk Embeddings |
ICML |
code |
22 |
| Learning to Multitask |
NIPS |
code |
22 |
| CondenseNet: An Efficient DenseNet Using Learned Group Convolutions |
CVPR |
code |
22 |
| HashGAN: Deep Learning to Hash With Pair Conditional Wasserstein GAN |
CVPR |
code |
22 |
| Hierarchical Relational Networks for Group Activity Recognition and Retrieval |
ECCV |
code |
22 |
| Collaborative and Adversarial Network for Unsupervised Domain Adaptation |
CVPR |
code |
22 |
| Geometry-Aware Scene Text Detection With Instance Transformation Network |
CVPR |
code |
22 |
| Learning to Promote Saliency Detectors |
CVPR |
code |
21 |
| CSGNet: Neural Shape Parser for Constructive Solid Geometry |
CVPR |
code |
21 |
| Local Spectral Graph Convolution for Point Set Feature Learning |
ECCV |
code |
21 |
| HiDDeN: Hiding Data with Deep Networks |
ECCV |
code |
21 |
| GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning |
CVPR |
code |
20 |
| Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal |
CVPR |
code |
20 |
| Fully-Convolutional Point Networks for Large-Scale Point Clouds |
ECCV |
code |
20 |
| Learning Superpixels With Segmentation-Aware Affinity Loss |
CVPR |
code |
20 |
| Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks |
CVPR |
code |
20 |
| Crowd Counting With Deep Negative Correlation Learning |
CVPR |
code |
20 |
| Dimensionality-Driven Learning with Noisy Labels |
ICML |
code |
20 |
| Objects that Sound |
ECCV |
code |
20 |
| Deep Expander Networks: Efficient Deep Networks from Graph Theory |
ECCV |
code |
19 |
| Low-Shot Learning With Large-Scale Diffusion |
CVPR |
code |
19 |
| Low-Shot Learning With Imprinted Weights |
CVPR |
code |
19 |
| Cross-Domain Self-Supervised Multi-Task Feature Learning Using Synthetic Imagery |
CVPR |
code |
19 |
| Learning Descriptor Networks for 3D Shape Synthesis and Analysis |
CVPR |
code |
19 |
| Disentangling Factors of Variation with Cycle-Consistent Variational Auto-Encoders |
ECCV |
code |
19 |
| CTAP: Complementary Temporal Action Proposal Generation |
ECCV |
code |
18 |
| DVAE#: Discrete Variational Autoencoders with Relaxed Boltzmann Priors |
NIPS |
code |
18 |
| Conditional Image-Text Embedding Networks |
ECCV |
code |
18 |
| EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth From Light Field Images |
CVPR |
code |
18 |
| Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points |
CVPR |
code |
18 |
| Bayesian Optimization of Combinatorial Structures |
ICML |
code |
18 |
| FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis |
CVPR |
code |
18 |
| Learning Type-Aware Embeddings for Fashion Compatibility |
ECCV |
code |
17 |
| Sliced Wasserstein Distance for Learning Gaussian Mixture Models |
CVPR |
code |
17 |
| Revisiting Deep Intrinsic Image Decompositions |
CVPR |
code |
17 |
| A Spectral Approach to Gradient Estimation for Implicit Distributions |
ICML |
code |
17 |
| Hierarchical Novelty Detection for Visual Object Recognition |
CVPR |
code |
17 |
| Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies |
CVPR |
code |
17 |
| Learning Generative ConvNets via Multi-Grid Modeling and Sampling |
CVPR |
code |
17 |
| Learning 3D Shape Completion From Laser Scan Data With Weak Supervision |
CVPR |
code |
17 |
| Triplet Loss in Siamese Network for Object Tracking |
ECCV |
code |
17 |
| Adversarial Attack on Graph Structured Data |
ICML |
code |
17 |
| Arbitrary Style Transfer With Deep Feature Reshuffle |
CVPR |
code |
17 |
| Visual Question Reasoning on General Dependency Tree |
CVPR |
code |
17 |
| Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition |
ECCV |
code |
16 |
| Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks |
NIPS |
code |
16 |
| Coded Sparse Matrix Multiplication |
ICML |
code |
16 |
| Weakly-Supervised Action Segmentation With Iterative Soft Boundary Assignment |
CVPR |
code |
16 |
| Recovering 3D Planes from a Single Image via Convolutional Neural Networks |
ECCV |
code |
16 |
| SegStereo: Exploiting Semantic Information for Disparity Estimation |
ECCV |
code |
16 |
| Functional Gradient Boosting based on Residual Network Perception |
ICML |
code |
16 |
| NAG: Network for Adversary Generation |
CVPR |
code |
16 |
| Generative Probabilistic Novelty Detection with Adversarial Autoencoders |
NIPS |
code |
16 |
| Hashing as Tie-Aware Learning to Rank |
CVPR |
code |
15 |
| Pose Proposal Networks |
ECCV |
code |
15 |
| Convolutional Sequence to Sequence Model for Human Dynamics |
CVPR |
code |
15 |
| Joint Pose and Expression Modeling for Facial Expression Recognition |
CVPR |
code |
15 |
| Grounding Referring Expressions in Images by Variational Context |
CVPR |
code |
15 |
| Rethinking the Form of Latent States in Image Captioning |
ECCV |
code |
15 |
| Open Set Domain Adaptation by Backpropagation |
ECCV |
code |
15 |
| Neural Sign Language Translation |
CVPR |
code |
15 |
| SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters |
ECCV |
code |
15 |
| Efficient Neural Audio Synthesis |
ICML |
code |
15 |
| Deep Learning Under Privileged Information Using Heteroscedastic Dropout |
CVPR |
code |
14 |
| Image Transformer |
ICML |
code |
14 |
| Learning to Understand Image Blur |
CVPR |
code |
14 |
| Learning and Using the Arrow of Time |
CVPR |
code |
14 |
| Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints |
CVPR |
code |
14 |
| Learning to Forecast and Refine Residual Motion for Image-to-Video Generation |
ECCV |
code |
14 |
| Multi-Scale Weighted Nuclear Norm Image Restoration |
CVPR |
code |
14 |
| Synthesizing Robust Adversarial Examples |
ICML |
code |
13 |
| Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data |
ECCV |
code |
13 |
| Assessing Generative Models via Precision and Recall |
NIPS |
code |
13 |
| Deep Diffeomorphic Transformer Networks |
CVPR |
code |
13 |
| Learning by Asking Questions |
CVPR |
code |
13 |
| Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection |
CVPR |
code |
13 |
| Variational Autoencoders for Deforming 3D Mesh Models |
CVPR |
code |
13 |
| Min-Entropy Latent Model for Weakly Supervised Object Detection |
CVPR |
code |
13 |
| Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering |
CVPR |
code |
13 |
| Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace |
ICML |
code |
13 |
| Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition |
CVPR |
code |
13 |
| Finding Influential Training Samples for Gradient Boosted Decision Trees |
ICML |
code |
13 |
| Gesture Recognition: Focus on the Hands |
CVPR |
code |
12 |
| Cross-View Image Synthesis Using Conditional GANs |
CVPR |
code |
12 |
| Joint Optimization Framework for Learning With Noisy Labels |
CVPR |
code |
12 |
| Future Person Localization in First-Person Videos |
CVPR |
code |
12 |
| AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos |
ECCV |
code |
12 |
| Learning Transferable Architectures for Scalable Image Recognition |
CVPR |
code |
12 |
| Clipped Action Policy Gradient |
ICML |
code |
12 |
| Mix and Match Networks: Encoder-Decoder Alignment for Zero-Pair Image Translation |
CVPR |
code |
12 |
| Decouple Learning for Parameterized Image Operators |
ECCV |
code |
12 |
| Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction |
ICML |
code |
12 |
| Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models |
NIPS |
code |
12 |
| AMNet: Memorability Estimation With Attention |
CVPR |
code |
12 |
| Adversarial Time-to-Event Modeling |
ICML |
code |
12 |
| Reversible Recurrent Neural Networks |
NIPS |
code |
12 |
| Human Pose Estimation With Parsing Induced Learner |
CVPR |
code |
11 |
| ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking |
ECCV |
code |
11 |
| A Joint Sequence Fusion Model for Video Question Answering and Retrieval |
ECCV |
code |
11 |
| Learning Face Age Progression: A Pyramid Architecture of GANs |
CVPR |
code |
11 |
| Robust Physical-World Attacks on Deep Learning Visual Classification |
CVPR |
code |
11 |
| High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach |
ICML |
code |
11 |
| Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory |
ICML |
code |
11 |
| Multimodal Explanations: Justifying Decisions and Pointing to the Evidence |
CVPR |
code |
11 |
| Accelerating Natural Gradient with Higher-Order Invariance |
ICML |
code |
11 |
| Hierarchical Multi-Label Classification Networks |
ICML |
code |
11 |
| Convolutional Image Captioning |
CVPR |
code |
11 |
| Boosting Domain Adaptation by Discovering Latent Domains |
CVPR |
code |
11 |
| Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks |
CVPR |
code |
10 |
| PacGAN: The power of two samples in generative adversarial networks |
NIPS |
code |
10 |
| Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification |
CVPR |
code |
10 |
| End-to-End Incremental Learning |
ECCV |
code |
10 |
| Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation |
CVPR |
code |
10 |
| On GANs and GMMs |
NIPS |
code |
10 |
| Salient Object Detection Driven by Fixation Prediction |
CVPR |
code |
9 |
| Semantic Video Segmentation by Gated Recurrent Flow Propagation |
CVPR |
code |
9 |
| Constraint-Aware Deep Neural Network Compression |
ECCV |
code |
9 |
| Statistically-motivated Second-order Pooling |
ECCV |
code |
9 |
| Excitation Backprop for RNNs |
CVPR |
code |
9 |
| Analyzing Uncertainty in Neural Machine Translation |
ICML |
code |
9 |
| Learning Dynamics of Linear Denoising Autoencoders |
ICML |
code |
9 |
| Saliency Detection in 360° Videos |
ECCV |
code |
9 |
| Density Adaptive Point Set Registration |
CVPR |
code |
9 |
| Decoupled Parallel Backpropagation with Convergence Guarantee |
ICML |
code |
9 |
| Classification from Pairwise Similarity and Unlabeled Data |
ICML |
code |
9 |
| oi-VAE: Output Interpretable VAEs for Nonlinear Group Factor Analysis |
ICML |
code |
9 |
| Modeling Sparse Deviations for Compressed Sensing using Generative Models |
ICML |
code |
9 |
| Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction |
CVPR |
code |
9 |
| Towards Open-Set Identity Preserving Face Synthesis |
CVPR |
code |
9 |
| Five-Point Fundamental Matrix Estimation for Uncalibrated Cameras |
CVPR |
code |
8 |
| BourGAN: Generative Networks with Metric Embeddings |
NIPS |
code |
8 |
| Fast Information-theoretic Bayesian Optimisation |
ICML |
code |
8 |
| Deep Variational Reinforcement Learning for POMDPs |
ICML |
code |
8 |
| Specular-to-Diffuse Translation for Multi-View Reconstruction |
ECCV |
code |
8 |
| Dynamic Conditional Networks for Few-Shot Learning |
ECCV |
code |
8 |
| Learning Facial Action Units From Web Images With Scalable Weakly Supervised Clustering |
CVPR |
code |
8 |
| High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs |
CVPR |
code |
8 |
| Deep Defense: Training DNNs with Improved Adversarial Robustness |
NIPS |
code |
8 |
| Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations |
ICML |
code |
8 |
| Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based Modeling |
ECCV |
code |
7 |
| Non-metric Similarity Graphs for Maximum Inner Product Search |
NIPS |
code |
7 |
| Towards Realistic Predictors |
ECCV |
code |
7 |
| Deep Non-Blind Deconvolution via Generalized Low-Rank Approximation |
NIPS |
code |
7 |
| Don’t Just Assume Look and Answer: Overcoming Priors for Visual Question Answering |
CVPR |
code |
7 |
| Learning Dual Convolutional Neural Networks for Low-Level Vision |
CVPR |
code |
7 |
| The Mirage of Action-Dependent Baselines in Reinforcement Learning |
ICML |
code |
7 |
| DVQA: Understanding Data Visualizations via Question Answering |
CVPR |
code |
7 |
| A Two-Step Disentanglement Method |
CVPR |
code |
7 |
| Detecting and Correcting for Label Shift with Black Box Predictors |
ICML |
code |
7 |
| Conditional Prior Networks for Optical Flow |
ECCV |
code |
7 |
| Generative Adversarial Learning Towards Fast Weakly Supervised Detection |
CVPR |
code |
7 |
| Adversarial Learning with Local Coordinate Coding |
ICML |
code |
7 |
| Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks |
CVPR |
code |
7 |
| AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks |
CVPR |
code |
7 |
| Learning to Explain: An Information-Theoretic Perspective on Model Interpretation |
ICML |
code |
7 |
| Banach Wasserstein GAN |
NIPS |
code |
7 |
| Gradually Updated Neural Networks for Large-Scale Image Recognition |
ICML |
code |
7 |
| Learning Steady-States of Iterative Algorithms over Graphs |
ICML |
code |
7 |
| Progressive Attention Guided Recurrent Network for Salient Object Detection |
CVPR |
code |
7 |
| Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains |
CVPR |
code |
6 |
| Unsupervised holistic image generation from key local patches |
ECCV |
code |
6 |
| Inner Space Preserving Generative Pose Machine |
ECCV |
code |
6 |
| Bilevel Programming for Hyperparameter Optimization and Meta-Learning |
ICML |
code |
6 |
| Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition |
CVPR |
code |
6 |
| Breaking the Activation Function Bottleneck through Adaptive Parameterization |
NIPS |
code |
6 |
| Ultra Large-Scale Feature Selection using Count-Sketches |
ICML |
code |
6 |
| Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks |
CVPR |
code |
6 |
| Orthogonally Decoupled Variational Gaussian Processes |
NIPS |
code |
6 |
| Batch Bayesian Optimization via Multi-objective Acquisition Ensemble for Automated Analog Circuit Design |
ICML |
code |
6 |
| A Modulation Module for Multi-task Learning with Applications in Image Retrieval |
ECCV |
code |
6 |
| A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos |
CVPR |
code |
6 |
| Towards Effective Low-Bitwidth Convolutional Neural Networks |
CVPR |
code |
5 |
| Disentangling Factors of Variation by Mixing Them |
CVPR |
code |
5 |
| Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior |
ECCV |
code |
5 |
| Learning Longer-term Dependencies in RNNs with Auxiliary Losses |
ICML |
code |
5 |
| Contour Knowledge Transfer for Salient Object Detection |
ECCV |
code |
5 |
| HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning |
ECCV |
code |
5 |
| Sidekick Policy Learning for Active Visual Exploration |
ECCV |
code |
5 |
| Learning to Localize Sound Source in Visual Scenes |
CVPR |
code |
5 |
| Neural Architecture Optimization |
NIPS |
code |
5 |
| COLA: Decentralized Linear Learning |
NIPS |
code |
5 |
| Diverse and Coherent Paragraph Generation from Images |
ECCV |
code |
5 |
| DRACO: Byzantine-resilient Distributed Training via Redundant Gradients |
ICML |
code |
5 |
| Inter and Intra Topic Structure Learning with Word Embeddings |
ICML |
code |
5 |
| Estimating the Success of Unsupervised Image to Image Translation |
ECCV |
code |
5 |
| Dynamic-Structured Semantic Propagation Network |
CVPR |
code |
5 |
| The Description Length of Deep Learning models |
NIPS |
code |
5 |
| Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving |
ECCV |
code |
5 |
| Blind Justice: Fairness with Encrypted Sensitive Attributes |
ICML |
code |
5 |
| Transfer Learning via Learning to Transfer |
ICML |
code |
5 |
| Deepcode: Feedback Codes via Deep Learning |
NIPS |
code |
4 |
| Configurable Markov Decision Processes |
ICML |
code |
4 |
| A Framework for Evaluating 6-DOF Object Trackers |
ECCV |
code |
4 |
| Differentially Private Database Release via Kernel Mean Embeddings |
ICML |
code |
4 |
| Recognizing Human Actions as the Evolution of Pose Estimation Maps |
CVPR |
code |
4 |
| Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images |
CVPR |
code |
4 |
| DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map |
CVPR |
code |
4 |
| Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification |
ECCV |
code |
4 |
| Tracking Emerges by Colorizing Videos |
ECCV |
code |
4 |
| Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out Codes |
ECCV |
code |
4 |
| Inference Suboptimality in Variational Autoencoders |
ICML |
code |
4 |
| Black Box FDR |
ICML |
code |
4 |
| Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence |
CVPR |
code |
4 |
| Quadrature-based features for kernel approximation |
NIPS |
code |
4 |
| Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking |
ECCV |
code |
4 |
| Transferable Adversarial Perturbations |
ECCV |
code |
4 |
| Single Image Water Hazard Detection using FCN with Reflection Attention Units |
ECCV |
code |
4 |
| Multimodal Generative Models for Scalable Weakly-Supervised Learning |
NIPS |
code |
4 |
| Importance Weighted Transfer of Samples in Reinforcement Learning |
ICML |
code |
3 |
| Feature Generating Networks for Zero-Shot Learning |
CVPR |
code |
3 |
| DICOD: Distributed Convolutional Coordinate Descent for Convolutional Sparse Coding |
ICML |
code |
3 |
| CapProNet: Deep Feature Learning via Orthogonal Projections onto Capsule Subspaces |
NIPS |
code |
3 |
| Bidirectional Retrieval Made Simple |
CVPR |
code |
3 |
| Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages |
NIPS |
code |
3 |
| A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping |
CVPR |
code |
3 |
| Spatially-Adaptive Filter Units for Deep Neural Networks |
CVPR |
code |
3 |
| Learning to Branch |
ICML |
code |
3 |
| Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives |
NIPS |
code |
3 |
| Lifelong Learning via Progressive Distillation and Retrospection |
ECCV |
code |
3 |
| CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition |
CVPR |
code |
3 |
| Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care |
ICML |
code |
3 |
| Learning Answer Embeddings for Visual Question Answering |
CVPR |
code |
3 |
| Information Constraints on Auto-Encoding Variational Bayes |
NIPS |
code |
3 |
| Parallel Bayesian Network Structure Learning |
ICML |
code |
3 |
| Ring Loss: Convex Feature Normalization for Face Recognition |
CVPR |
code |
3 |
| Teaching Categories to Human Learners With Visual Explanations |
CVPR |
code |
3 |
| Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization |
ICML |
code |
3 |
| Deep Burst Denoising |
ECCV |
code |
3 |
| Convergent Tree Backup and Retrace with Function Approximation |
ICML |
code |
3 |
| Gaze Prediction in Dynamic 360° Immersive Videos |
CVPR |
code |
3 |
| Statistical Recurrent Models on Manifold valued Data |
NIPS |
code |
3 |
| End-to-End Flow Correlation Tracking With Spatial-Temporal Attention |
CVPR |
code |
3 |