Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Computer Vision
›
Applications
›
Visual Question Answering
107 directly classified papers
Papers per year
2016: 2
2017: 5
2018: 8
2019: 12
2020: 14
2021: 7
2022: 5
2023: 12
2024: 20
2025: 22
Papers
Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering
EMNLP 2020
Language-Conditioned Feature Pyramids for Visual Selection Tasks
EMNLP 2020
Unsupervised Keyword Extraction for Full-Sentence VQA
EMNLP 2020
Reasoning Over History: Context Aware Visual Dialog
EMNLP 2020
Cross-Modality Relevance for Reasoning on Language and Vision
ACL 2020
Towards Task Understanding in Visual Settings
AAAI 2019
Differential Networks for Visual Question Answering
AAAI 2019
A Novel Framework for Robustness Analysis of Visual QA Models
AAAI 2019
Answer Them All! Toward Universal Visual Question Answering Models
CVPR 2019
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
CVPR 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
CVPR 2019
KVQA: Knowledge-Aware Visual Question Answering
AAAI 2019
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
AAAI 2019
The Meaning of “Most” for Visual Question Answering Models
ACL 2019
Free VQA Models from Knowledge Inertia by Pairwise Inconformity Learning
AAAI 2019
Dynamic Capsule Attention for Visual Question Answering
AAAI 2019
Learning by Abstraction: The Neural State Machine
NIPS 2019
TVQA: Localized, Compositional Video Question Answering
EMNLP 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
CVPR 2018
Visual Question Generation as Dual Task of Visual Question Answering
CVPR 2018
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
CVPR 2018
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
CVPR 2018
VizWiz Grand Challenge: Answering Visual Questions From Blind People
CVPR 2018
Think Visually: Question Answering through Virtual Imagery
ACL 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
EMNLP 2018
<
1
2
3
4
5
>