Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Generation
Computer Vision
›
Generation
›
Image Captioning
781 directly classified papers
Papers per year
2003: 1
2008: 1
2011: 1
2012: 1
2013: 5
2014: 2
2015: 21
2016: 17
2017: 36
2018: 47
2019: 92
2020: 73
2021: 96
2022: 91
2023: 107
2024: 86
2025: 96
2026: 8
Papers
HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning
CVPR 2023
MetaCLUE: Towards Comprehensive Visual Metaphors Research
CVPR 2023
Learning from Children: Improving Image-Caption Pretraining via Curriculum
ACL 2023
Cross-Domain Image Captioning With Discriminative Finetuning
CVPR 2023
Boosting Radiology Report Generation by Infusing Comparison Prior
ACL 2023
Quality-agnostic Image Captioning to Safely Assist People with Vision Impairment
IJCAI 2023
JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models
CONLL 2023
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
ACL 2023
StFX NLP at SemEval-2023 Task 1: Multimodal Encoding-based Methods for Visual Word Sense Disambiguation
ACL 2023
Exploring the Impact of Vision Features in News Image Captioning
ACL 2023
Pragmatic Inference with a CLIP Listener for Contrastive Captioning
ACL 2023
Improving multimodal datasets with image captioning
NIPS 2023
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing
ACL 2023
Exploring Diverse In-Context Configurations for Image Captioning
NIPS 2023
“Let’s not Quote out of Context”: Unified Vision-Language Pretraining for Context Assisted Image Captioning
ACL 2023
Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment
ACL 2023
JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models
EMNLP 2023
Scalable 3D Captioning with Pretrained Models
NIPS 2023
IC3: Image Captioning by Committee Consensus
EMNLP 2023
Attractive Storyteller: Stylized Visual Storytelling with Unpaired Text
ACL 2023
Incorporating Unlikely Negative Cues for Distinctive Image Captioning
IJCAI 2023
Transferring General Multimodal Pretrained Models to Text Recognition
ACL 2023
JourneyDB: A Benchmark for Generative Image Understanding
NIPS 2023
InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation
ACL 2023
Evaluating pragmatic abilities of image captioners on A3DS
ACL 2023
<
1
…
8
9
10
…
32
>