Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Generation
Computer Vision
›
Generation
›
Visual Question Answering
106 directly classified papers
Papers per year
2015: 1
2016: 7
2017: 3
2018: 11
2019: 7
2020: 20
2021: 11
2022: 11
2023: 11
2024: 9
2025: 15
Papers
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
ACL 2023
HybridPrompt: Bridging Language Models and Human Priors in Prompt Tuning for Visual Question Answering
AAAI 2023
COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering
AAAI 2023
Modular Visual Question Answering via Code Generation
ACL 2023
Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer
AAAI 2023
Location-Aware Visual Question Generation with Lightweight Models
EMNLP 2023
LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering
NIPS 2023
Learning Situation Hyper-Graphs for Video Question Answering
CVPR 2023
HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language
ACL 2023
Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
EMNLP 2023
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
CVPR 2022
Multi-VQG: Generating Engaging Questions for Multiple Images
EMNLP 2022
Multi-Modal Answer Validation for Knowledge-Based VQA
AAAI 2022
Explore Inter-contrast between Videos via Composition for Weakly Supervised Temporal Sentence Grounding
AAAI 2022
Measuring Compositional Consistency for Video Question Answering
CVPR 2022
Towards Video Text Visual Question Answering: Benchmark and Baseline
NIPS 2022
Query and Attention Augmentation for Knowledge-Based Explainable Reasoning
CVPR 2022
Maintaining Reasoning Consistency in Compositional Visual Question Answering
CVPR 2022
WebQA: Multihop and Multimodal QA
CVPR 2022
Dynamic Key-Value Memory Enhanced Multi-Step Graph Reasoning for Knowledge-Based Visual Question Answering
AAAI 2022
CLIP Models are Few-Shot Learners: Empirical Studies on VQA and Visual Entailment
ACL 2022
Towards Visual Question Answering on Pathology Images
ACL 2021
Beyond Accuracy: A Consolidated Tool for Visual Question Answering Benchmarking
EMNLP 2021
MiniVQA - A resource to build your tailored VQA competition
NAACL 2021
Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering
CVPR 2021
<
1
2
3
4
5
>