Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems

Ivan Sekulic; Silvia Terragni; Victor Guimaraes; Nghia Khau; Bruna Guedes; Modestas Filipavicius; Andre Ferreira Manso; Roland Mathis

2024 EACL EACL 2024

Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems

Abstract

AbstractIn the realm of dialogue systems, user simulation techniques have emerged as a game-changer, redefining the evaluation and enhancement of task-oriented dialogue (TOD) systems. These methods are crucial for replicating real user interactions, enabling applications like synthetic data augmentation, error detection, and robust evaluation. However, existing approaches often rely on rigid rule-based methods or on annotated data. This paper introduces DAUS, a Domain-Aware User Simulator. Leveraging large language models, we fine-tune DAUS on real examples of task-oriented dialogues. Results on two relevant benchmarks showcase significant improvements in terms of user goal fulfillment. Notably, we have observed that fine-tuning enhances the simulator’s coherence with user goals, effectively mitigating hallucinations—a major source of inconsistencies in simulator responses.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ivan Sekulic , Silvia Terragni , Victor Guimaraes , Nghia Khau , Bruna Guedes , Modestas Filipavicius , Andre Ferreira Manso , Roland Mathis

Topics

Artificial Intelligence > Core AI > Agent Systems Natural Language Processing > Generation > Dialogue Systems Machine Learning > Learning Types > Transfer Learning Natural Language Processing > Applications > Dialogue Systems Artificial Intelligence > Core AI > Language

Keywords

task-oriented dialogue hallucination mitigation synthetic data augmentation dialogue system user simulator large language model user simulation

Download PDF

Related papers

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry 2024

PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation 2024

Overview of the Hate Speech Detection in Turkish and Arabic Tweets (HSD-2Lang) Shared Task at CASE 2024 2024

Evaluating In-Context Learning for Computational Literary Studies: A Case Study Based on the Automatic Recognition of Knowledge Transfer in German Drama 2024

Selam@DravidianLangTech 2024:Identifying Hate Speech and Offensive Language 2024