HRNet-Applications-Collection
HRNet-Applications-Collection copied to clipboard
A collection of HRNet applications (Please feel freely add your applications if not included)
A collection of HRNet applications
(Please feel freely add your applications if not included)
Classification, segmentation and detection
ImageNet classification
Deep High-Resolution Representation Learning for Visual Recognition. Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, and Bin Xiao. TPAMI. 2020. code
Semantic segmentation
Deep High-Resolution Representation Learning for Visual Recognition. Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, and Bin Xiao. TPAMI. 2020. code
Hierarchical Multi-Scale Attention for Semantic Segmentation. Andrew Tao, Karan Sapra, and Bryan Catanzaro: CoRR abs/2005.10821 (2020). Rank #1, cityscapes benchmark
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation. John Lambert, Zhuang Liu, Ozan Sener, James Hays, and Vladlen Koltun. CVPR 2020. code
Object-Contextual Representations for Semantic Segmentation. Yuhui Yuan, Xilin Chen, Jingdong Wang. ECCV 2020. code, code
Disentangled Non-Local Neural Networks. Minghao Yin, Zhuliang Yao, Yue Cao, Xiu Li, Zheng Zhang, Stephen Lin, Han Hu. CoRR abs/2006.06668 (2020)
Instance segmentation and panoptic segmentation
Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation. Bowen Cheng, Maxwell D. Collins, Yukun Zhu, Ting Liu, Thomas S. Huang, Hartwig Adam, Liang-Chieh Chen. CVPR 2020. The winner of Mapillary Vistas Panoptic Segmentation Task, COCO + Mapillary Joint Recognition Challenge Workshop at ICCV 2019. code
1st Place Solutions for OpenImage2019 - Object Detection and Instance Segmentation. Yu Liu, Guanglu Song, Yuhang Zang, Yan Gao, Enze Xie, Junjie Yan, Chen Change Loy, Xiaogang Wang. CoRR abs/2003.07557 (2020)
Object detection
Deep High-Resolution Representation Learning for Visual Recognition. Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, and Bin Xiao. TPAMI. 2020. code on MMDetection, code on Mask RCNN, code with FCOS
1st Place Solutions of Waymo Open Dataset Challenge 2020: 2D Object Detection Track. Zehao Huang, Zehui Chen, Qiaofei Li, Hongkai Zhang and Naiyan Wang. CVPRW 2020.
CenterNet: Keypoint Triplets for Object Detection. Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, Qi Tian. ICCV 2019.
FCOS: Fully Convolutional One-Stage Object Detection. Zhi Tian, Chunhua Shen, Hao Chen, and Tong He. ICCV 2019. code
Human-centric vision
Human pose estimation
Deep High-Resolution Representation Learning for Human Pose Estimation. Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang. CVPR 2019. code video
Distribution-Aware Coordinate Representation for Human Pose Estimation. Feng Zhang, Xiatian Zhu, Hanbin Dai, Mao Ye, and Ce Zhu. CVPR 2020. code
The Devil Is in the Details: Delving Into Unbiased Data Processing for Human Pose Estimation. Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang. CVPR 2020. code
Video pose estimation
Learning Temporal Pose Estimation from Sparsely-Labeled Videos. Gedas Bertasius, Christoph Feichtenhofer, Du Tran, Jianbo Shi, Lorenzo Torresani. NeurIPS 2019. code
3D human pose estimation
Cascaded deep monocular 3D human pose estimation with evolutionary training data. Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-Keung Tang, Kwang-Ting Cheng. CoRR abs/2006.07778 (2020)
Motion Guided 3D Pose Estimation from Videos. Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin. CoRR abs/2004.13985 (2020)
Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild. Umar Iqbal, Pavlo Molchanov, and Jan Kautz. CVPR 2020
Pedestrian Detection
Pedestrian Detection: The Elephant In The Room. Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao. CoRR abs/2003.08799 (2020). code
Face alignment
Deep High-Resolution Representation Learning for Visual Recognition. Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, and Bin Xiao. TPAMI. 2020. code
Face recognition
FAN-Face: a Simple Orthogonal Improvement to Deep Face Recognition. Jing Yang, Adrian Bulat, Georgios Tzimiropoulos. AAAI 2020.
Sign language recognition
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition. Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li. AAAI 2020.
Multi-Object Tracking
A Simple Baseline for Multi-Object Tracking. Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu. CoRR abs/2004.01888 (2020). code
Fashion Image Retrieval
Which Is Plagiarism: Fashion Image Retrieval Based on Regional Representation for Design Protection. Yining Lang, Yuan He, Fan Yang, Jianfeng Dong, Hui Xue. CVPR 2020.
Fine-grained visual categorization
Semi-Supervised Recognition under a Noisy and Fine-grained Dataset. Cheng Cui, Zhi Ye, Yangxi Li, Xinjian Li, Min Yang, Kai Wei, Bing Dai, Yanmei Zhao, Zhongji Liu, Rong Pang. CoRR abs/2006.10702 (2020). code
Pretraing
Learning High-Resolution Domain-Specific Representations with a GAN Generator. Danil Galeev, Konstantin Sofiiuk, Danila Rukhovich, Mikhail Romanov, Olga Barinova, Anton Konushin. CoRR abs/2006.10451 (2020)
Table detection
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents. Devashish Prasad, Ayan Gadpal, Kshitij Kapadni, Manish Visave, Kavita Sultanpure. CVPRW 2020. code
Computational photography
Foreground-aware Semantic Representations for Image Harmonization. Konstantin Sofiiuk, Polina Popenova, Anton Konushin. CoRR abs/2006.00809 (2020). code
High-Resolution Network for Photorealistic Style Transfer. Ming Li, Chunyang Ye, Wei Li. CoRR abs/1904.11617 (2019). code
Progressive Image Inpainting with Full-Resolution Residual Network. Zongyu Guo, Zhibo Chen, Tao Yu, Jiale Chen, Sen Liu. ACM Multimedia 2019: 2496-2504. code
NTIRE 2019 Challenge on Image Enhancement: Methods and Results. CVPR Workshops 2019. The winner, the Mt.Stars team, adopted the HRNet.
6-DoF Pose Estimation
Neural Mesh Refiner for 6-DoF Pose Estimation. Di Wu, Yihao Chen, Xianbiao Qi, Yongjian Yu, Weixuan Chen, Rong Xiao. CoRR abs/2003.07561 (2020). code
Co-Segmentation
Deep Object Co-Segmentation via Spatial-Semantic Network Modulation. Kaihua Zhang, Jin Chen, Bo Liu, Qingshan Liu. AAAI 2020.
DL platform
Baidu PaddlePaddle. PaddleSeg model_zoo PaddleSeg HRNet tutorial
GLUON-CV HRNet