awesome-visual-question-answering icon indicating copy to clipboard operation
awesome-visual-question-answering copied to clipboard

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

Awesome Visual Question Answering:Awesome

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

Contributing

Please feel free to send me pull requests or email ([email protected]) to add links. Markdown format:

- [Paper Name](link) - Author 1 et al, **Conference Year**. [[code]](link)

Change Log

  • Mar.3rd,2019 The First version released.

Table of Contents

  • Contributing
  • Change Log
  • Table of Contents
  • Papers
    • Survey
    • 2022
      • ACL 2022
      • CVPR 2022
      • AAAI 2022
      • IJCAI 2022
    • 2021
      • NeurIPS 2021
      • EMNLP 2021
      • ICCV 2021
      • ACL 2021
      • SIGIR 2021
      • CVPR 2021
      • ICLR 2021
      • NAACL-HLT 2021
      • AAAI 2021
    • 2020
      • EMNLP 2020
      • NeurIPS 2020
      • ECCV 2020
      • CVPR 2020
      • ACL 2020
      • WACV 2020
      • AAAI 2020
    • 2019
      • ACL 2019
      • ICCV 2019
      • NeurIPS 2019
      • CVPR 2019
      • AAAI 2019
      • OTHER
    • 2018
      • NIPS 2018
      • AAAI 2018
      • IJCAI 2018
      • CVPR 2018
      • ACM MM 2018
      • ECCV 2018
      • OTHER
    • 2017-2015
      • OTHER
      • ICCV 2017
  • VQA Challenge Leaderboard
    • test-std 2018
    • test-std 2017
  • Licenses
  • Reference and Acknowledgement

Papers

Survey

2022

ACL 2022

CVPR 2022

AAAI 2022

IJCAI 2022

2021

NeurIPS 2021

EMNLP 2021

ICCV 2021

ACL 2021

SIGIR 2021

CVPR 2021

ICLR 2021

NAACL-HLT 2021

AAAI 2021

2020

EMNLP 2020

NeurIPS 2020

ECCV 2020

CVPR 2020

ACL 2020

WACV 2020

AAAI 2020

  • Multi‐Question Learning for Visual Question Answering - Chenyi Lei et al, AAAI 2020.
  • Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA - Badri N. Patro et al, AAAI 2020.
  • Overcoming Language Priors in VQA via Decomposed Linguistic Representations - Chenchen Jing et al, AAAI 2020.
  • Unified Vision-Language Pre-Training for Image Captioning and VQA - Luowei Zhou et al, AAAI 2020.
  • Re‐Attention for Visual Question Answering - Wenya Guo et al, AAAI 2020.
  • Divide and Conquer: Question­‐Guided Spatio­‐Temporal Contextual Attention for Video Question Answering - Jianwen Jiang et al, AAAI 2020.
  • Reasoning with Heterogeneous Graph Alignment for Video Question Answering - Pin Jiang et al, AAAI 2020.
  • Location­‐aware Graph Convolutional Networks for Video Question Answering - Deng Huang et al, AAAI 2020.
  • KnowIT VQA: Answering Knowledge­‐Based Questions about Videos - Noa Garcia et al, AAAI 2020.

2019

ACL 2019

ICCV 2019

NeurIPS 2019

CVPR 2019

AAAI 2019

OTHER

2018

NIPS 2018

AAAI 2018

IJCAI 2018

CVPR 2018

ACM MM 2018

ECCV 2018

OTHER

2017-2015

OTHER

Please check the other papers list from VQA area between 2017-2015 in awesome-vqa from JamesChuanggg, it seems that he hasn't maintained that project for a long time. Really appreciate for his work. I will merge his work to this list in the future.Stay tuned...

ICCV 2017

VQA Challenge Leaderboard

I will collect the leaderboard's implementations in the future.Stay tuned...

test-std 2018

test-std 2017

TextVQA

VQA-CP

Licenses

CC0

To the extent possible under law, Jokie Leung has waived all copyright and related or neighboring rights to this work.

Reference and Acknowledgement

Really appreciate for their contributions in this area.