Ranking LLM-Generated Loop Invariants for Program Verification

Saikat Chakraborty; Shuvendu Lahiri; Sarah Fakhoury; Akash Lal; Madanlal Musuvathi; Aseem Rastogi; Aditya Senthilnathan; Rahul Sharma; Nikhil Swamy

2023 EMNLP EMNLP 2023

Ranking LLM-Generated Loop Invariants for Program Verification

Abstract

AbstractSynthesizing inductive loop invariants is fundamental to automating program verification. In this work we observe that Large Language Models (such as gpt-3.5 or gpt-4) are capable of synthesizing loop invariants for a class of programs in a 0-shot setting, yet require several samples to generate the correct invariants. This can lead to a large number a calls to a program verifier to establish an invariant. To address this issue, we propose a re-ranking approach for the generated results of LLMs. We have designed a ranker that can distinguish between correct inductive invariants and incorrect attempts based on the problem definition. The ranker is optimized as a contrastive ranker. Experimental results demonstrate that this re-ranking mechanism significantly improves the ranking of correct invariants among the generated candidates, leading to a notable reduction in the number of calls to a verifier.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Science and Deep Learning and Machine Learning

🧭 Keyword Pioneer — contrastive ranker

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Saikat Chakraborty , Shuvendu Lahiri , Sarah Fakhoury , Akash Lal , Madanlal Musuvathi , Aseem Rastogi , Aditya Senthilnathan , Rahul Sharma , Nikhil Swamy

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Learning Types > Contrastive Learning Computer Science > Applications > Software Engineering Artificial Intelligence > Core AI > Reasoning Machine Learning > Learning Types > Ranking Deep Learning > Learning Types > Large Language Models

Keywords

program verification zero-shot setting contrastive ranking large language model loop invariant synthesis contrastive ranker synthesizing inductive loop invariant

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023