Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion

Hengzhi Pei; Jinman Zhao; Leonard Lausen; Sheng Zha; George Karypis

2023 AAAI AAAI 2023

Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion

Abstract

Abstract Pretrained code language models have enabled great progress towards program synthesis. However, common approaches only consider in-file local context and thus miss information and constraints imposed by other parts of the codebase and its external dependencies. Existing code completion benchmarks also lack such context. To resolve these restrictions we curate a new dataset of permissively licensed Python packages that includes full projects and their dependencies and provide tools to extract non-local information with the help of program analyzers. We then focus on the task of function call argument completion which requires predicting the arguments to function calls. We show that existing code completion models do not yield good results on our completion task. To better solve this task, we query a program analyzer for information relevant to a given function call, and consider ways to provide the analyzer results to different code completion models during inference and training. Our experiments show that providing access to the function implementation and function usages greatly improves the argument completion performance. Our ablation study provides further insights on how different types of information available from the program analyzer and different ways of incorporating the information affect the model performance.

🌉 Interdisciplinary Bridge — Computer Science and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — argument completion

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Hengzhi Pei , Jinman Zhao , Leonard Lausen , Sheng Zha , George Karypis

Topics

Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Large Language Models Computer Science > Applications > Software Engineering Deep Learning > Learning Types > Representation Learning Deep Learning > Models > Language Models

Keywords

code completion program synthesis program analysis code language model argument completion function call argument completion non-local context function call argument program analyzer

Download PDF

Related papers

A Model-Agnostic Heuristics for Selective Classification 2023

Tackling Safe and Efficient Multi-Agent Reinforcement Learning via Dynamic Shielding (Student Abstract) 2023

Head-Free Lightweight Semantic Segmentation with Linear Transformer 2023

Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning 2023

Deep Spiking Neural Networks with High Representation Similarity Model Visual Pathways of Macaque and Mouse 2023