Explain-then-translate: an analysis on improving program translation with self-generated explanations

Zilu Tang; Mayank Agarwal; Alexander Shypula; bailin wang; Derry Wijaya; Jie Chen; Yoon Kim

2023 EMNLP EMNLP 2023

Explain-then-translate: an analysis on improving program translation with self-generated explanations

Abstract

AbstractThis work explores the use of self-generated natural language explanations as an intermediate step for code-to-code translation with language models. Across three types of explanations and 19 programming languages constructed from the MultiPL-E dataset, we find the explanations to be particularly effective in the zero-shot case, improving performance by 12% on average. Improvements with natural language explanations are particularly pronounced on difficult programs. We release our dataset, code, and canonical solutions in all 19 languages.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — self-generated explanation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zilu Tang , Mayank Agarwal , Alexander Shypula , bailin wang , Derry Wijaya , Jie Chen , Yoon Kim

Topics

Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Large Language Models

Keywords

zero-shot learning language model code translation program translation self-generated explanation

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023