ContraCLM: Contrastive Learning For Causal Language Model

Nihal Jain; Dejiao Zhang; Wasi Uddin Ahmad; Zijian Wang; Feng Nan; Xiaopeng Li; Ming Tan; Ramesh Nallapati; Baishakhi Ray; Parminder Bhatia; Xiaofei Ma; Bing Xiang

2023 ACL ACL 2023

ContraCLM: Contrastive Learning For Causal Language Model

Abstract

AbstractDespite exciting progress in causal language models, the expressiveness of their representations is largely limited due to poor discrimination ability. To remedy this issue, we present CONTRACLM, a novel contrastive learning framework at both the token-level and the sequence-level. We assess CONTRACLM on a variety of downstream tasks. We show that CONTRACLM enhances the discrimination of representations and bridges the gap with encoder-only models, which makes causal language models better suited for tasks beyond language generation. Specifically, we attain 44% relative improvement on the Semantic Textual Similarity tasks and 34% on Code-to-Code Search tasks. Furthermore, by improving the expressiveness of representations, CONTRACLM also boosts the source code generation capability with 9% relative improvement on execution accuracy on the HumanEval benchmark.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nihal Jain , Dejiao Zhang , Wasi Uddin Ahmad , Zijian Wang , Feng Nan , Xiaopeng Li , Ming Tan , Ramesh Nallapati , Baishakhi Ray , Parminder Bhatia , Xiaofei Ma , Bing Xiang

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Contrastive Learning Natural Language Processing > Generation > Language Modeling Natural Language Processing > Resources & Methods > Language Modeling Deep Learning > Techniques > Contrastive Learning Deep Learning > Learning Types > Contrastive Learning Deep Learning > Learning Types > Representation Learning

Keywords

representation learning contrastive learning code generation semantic similarity semantic textual similarity causal language model

Download PDF

History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling 2023

Efficient Transformers with Dynamic Token Pooling 2023

HHU at SemEval-2023 Task 3: An Adapter-based Approach for News Genre Classification 2023

NAP at SemEval-2023 Task 3: Is Less Really More? (Back-)Translation as Data Augmentation Strategies for Detecting Persuasion Techniques 2023

ContraCLM: Contrastive Learning For Causal Language Model

Abstract

Authors

Topics

Keywords

Related papers