Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Natural Language Processing
›
Applications
›
Visual Question Answering
219 directly classified papers
Papers per year
2016: 1
2017: 6
2018: 13
2019: 26
2020: 22
2021: 23
2022: 20
2023: 20
2024: 37
2025: 49
2026: 2
Papers
Visual Query Answering by Entity-Attribute Graph Matching and Reasoning
CVPR 2019
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
CVPR 2019
Explicit Bias Discovery in Visual Question Answering Models
CVPR 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
NIPS 2019
Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects
NAACL 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
NIPS 2019
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension
IJCNLP 2019
Fusion of Detected Objects in Text for Visual Question Answering
EMNLP 2019
Multi-grained Attention with Object-level Grounding for Visual Question Answering
ACL 2019
Generating Question Relevant Captions to Aid Visual Question Answering
ACL 2019
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
AAAI 2019
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection
AAAI 2019
Phrase Grounding by Soft-Label Chain Conditional Random Field
IJCNLP 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
IJCNLP 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
EMNLP 2019
Structure Learning for Neural Module Networks
EMNLP 2019
Multi-Modality Latent Interaction Network for Visual Question Answering
ICCV 2019
Densely Connected Attention Flow for Visual Question Answering
IJCAI 2019
Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence
CVPR 2019
Visual Question Answering as Reading Comprehension
CVPR 2019
Deep Modular Co-Attention Networks for Visual Question Answering
CVPR 2019
Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model
CVPR 2019
Representing Movie Characters in Dialogues
CONLL 2019
Multi-Scale Visual Semantics Aggregation with Self-Attention for End-to-End Image-Text Matching
ACML 2019
Learning by Asking Questions
CVPR 2018
<
1
…
5
6
7
8
9
>