Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Generation
Computer Vision
›
Generation
›
Visual Question Answering
106 directly classified papers
Papers per year
2015: 1
2016: 7
2017: 3
2018: 11
2019: 7
2020: 20
2021: 11
2022: 11
2023: 11
2024: 9
2025: 15
Papers
Learning from Inside: Self-driven Siamese Sampling and Reasoning for Video Question Answering
NIPS 2021
MiniVQA - A resource to build your tailored VQA competition
NAACL 2021
QACE: Asking Questions to Evaluate an Image Caption
EMNLP 2021
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
EMNLP 2021
Towards Visual Question Answering on Pathology Images
ACL 2021
Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering
CVPR 2021
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
CVPR 2021
VQA With No Questions-Answers Training
CVPR 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020
Overcoming Language Priors in VQA via Decomposed Linguistic Representations
AAAI 2020
Multi-Question Learning for Visual Question Answering
AAAI 2020
KnowIT VQA: Answering Knowledge-Based Questions about Videos
AAAI 2020
Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries
AAAI 2020
Temporally Grounding Language Queries in Videos by Contextual Boundary-Aware Prediction
AAAI 2020
Exploring Weaknesses of VQA Models through Attribution Driven Insights
ACL 2020
ISAAQ - Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention
EMNLP 2020
STL-CQA: Structure-based Transformers with Localization and Encoding for Chart Question Answering
EMNLP 2020
Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering
AAAI 2020
Unified Vision-Language Pre-Training for Image Captioning and VQA
AAAI 2020
Reasoning with Heterogeneous Graph Alignment for Video Question Answering
AAAI 2020
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue
AAAI 2020
Re-Attention for Visual Question Answering
AAAI 2020
Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing
CVPR 2020
Counterfactual Samples Synthesizing for Robust Visual Question Answering
CVPR 2020
Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA
CVPR 2020
<
1
2
3
4
5
>