2020 COLING COLING 2020

Pointing to Subwords for Generating Function Names in Source Code

Abstract

AbstractWe tackle the task of automatically generating a function name from source code. Existing generators face difficulties in generating low-frequency or out-of-vocabulary subwords. In this paper, we propose two strategies for copying low-frequency or out-of-vocabulary subwords in inputs. Our best performing model showed an improvement over the conventional method in terms of our modified F1 and accuracy on the Java-small and Java-large datasets.

🌱 Topic Pioneer — Code Generation
🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing
📈 Trend Setter — Code Generation
🧭 Keyword Pioneer — code tokenization
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Natural Language Processing, Reinforcement Learning, Speech & Audio