Meta Learning for Image Captioning

Nannan Li; Zhenzhong Chen; Shan Liu

2019 AAAI AAAI 2019

Meta Learning for Image Captioning

Abstract

Abstract Reinforcement learning (RL) has shown its advantages in image captioning by optimizing the non-differentiable metric directly in the reward learning process. However, due to the reward hacking problem in RL, maximizing reward may not lead to better quality of the caption, especially from the aspects of propositional content and distinctiveness. In this work, we propose to use a new learning method, meta learning, to utilize supervision from the ground truth whilst optimizing the reward function in RL. To improve the propositional content and the distinctiveness of the generated captions, the proposed model provides the global optimal solution by taking different gradient steps towards the supervision task and the reinforcement task, simultaneously. Experimental results on MS COCO validate the effectiveness of our approach when compared with the state-of-the-art methods.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning and Natural Language Processing and Reinforcement Learning

📈 Trend Setter — Image Captioning

🧭 Keyword Pioneer — non-differentiable metric

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nannan Li , Zhenzhong Chen , Shan Liu

Topics

Artificial Intelligence > Learning Paradigms > Meta-Learning Computer Vision > Generation > Image Captioning Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Paradigms > Meta-Learning Deep Learning > Learning Types > Meta-Learning Natural Language Processing > Applications > Image Captioning

Keywords

meta learning reinforcement learning image captioning reward function reward hacking gradient step caption generation non-differentiable metric propositional content

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019