Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Natural Language Processing
›
Applications
›
Image Captioning
51 directly classified papers
Papers per year
2015: 3
2016: 2
2017: 2
2018: 4
2019: 6
2020: 6
2021: 11
2022: 5
2023: 5
2024: 3
2025: 4
Papers
DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
CVPR 2025
VC4VG: Optimizing Video Captions for Text-to-Video Generation
EMNLP 2025
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures
AAAI 2025
Semantic and Expressive Variations in Image Captions Across Languages
CVPR 2025
TAME-RD: Text Assisted Replication of Image Multi-Adjustments for Reverse Designing
ACL 2024
Describing Differences in Image Sets with Natural Language
CVPR 2024
MeaCap: Memory-Augmented Zero-shot Image Captioning
CVPR 2024
Semantic-Conditional Diffusion Networks for Image Captioning
CVPR 2023
SceneTrilogy: On Human Scene-Sketch and Its Complementarity With Photo and Text
CVPR 2023
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
CVPR 2023
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
EMNLP 2023
Evaluating pragmatic abilities of image captioners on A3DS
ACL 2023
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation
EMNLP 2022
Paraphrasing Is All You Need for Novel Object Captioning
NIPS 2022
Concadia: Towards Image-Based Text Generation with a Purpose
EMNLP 2022
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
CVPR 2022
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset
EMNLP 2022
RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words
CVPR 2021
Text Embedding Bank for Detailed Image Paragraph Captioning
AAAI 2021
Language Resource Efficient Learning for Captioning
EMNLP 2021
Connecting What To Say With Where To Look by Modeling Human Attention Traces
CVPR 2021
Consensus Graph Representation Learning for Better Grounded Image Captioning
AAAI 2021
Transitional Adaptation of Pretrained Models for Visual Storytelling
CVPR 2021
Dual-level Collaborative Transformer for Image Captioning
AAAI 2021
A Self-Boosting Framework for Automated Radiographic Report Generation
CVPR 2021
<
1
2
3
>