StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Ze Yang; Wei Wu; Can Xu; Xinnian Liang; Jiaqi Bai; Liran Wang; Wei Wang; Zhoujun Li

2020 EMNLP EMNLP 2020

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Abstract

AbstractGenerating responses following a desired style has great potentials to extend applications of open-domain dialogue systems, yet is refrained by lacking of parallel data for training. In this work, we explore the challenging task with pre-trained language models that have brought breakthrough to various natural language tasks. To this end, we introduce a KL loss and a style classifier to the fine-tuning step in order to steer response generation towards the target style in both a word-level and a sentence-level. Comprehensive empirical studies with two public datasets indicate that our model can significantly outperform state-of-the-art methods in terms of both style consistency and contextual coherence.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ze Yang , Wei Wu , Can Xu , Xinnian Liang , Jiaqi Bai , Liran Wang , Wei Wang , Zhoujun Li

Topics

Natural Language Processing > Generation > Dialogue Systems Natural Language Processing > Resources & Methods > Large Language Models Deep Learning > Models > Language Models Artificial Intelligence > Core AI > Natural Language Generation Artificial Intelligence > Core AI > Dialogue Systems

Keywords

style transfer response generation text generation pre-trained language model dialogue system style consistency

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020