Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward

Mengyuan Yang; Mengying Zhu; Yan Wang; Linxun Chen; Yilei Zhao; Xiuyuan Wang; Bing Han; Xiaolin Zheng; Jianwei Yin

2024 AAAI AAAI 2024

Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward

Abstract

Abstract Large language model-based explainable recommendation (LLM-based ER) systems can provide remarkable human-like explanations and have widely received attention from researchers. However, the original LLM-based ER systems face three low-quality problems in their generated explanations, i.e., lack of personalization, inconsistency, and questionable explanation data. To address these problems, we propose a novel LLM-based ER model denoted as LLM2ER to serve as a backbone and devise two innovative explainable quality reward models for fine-tuning such a backbone in a reinforcement learning paradigm, ultimately yielding a fine-tuned model denoted as LLM2ER-EQR, which can provide high-quality explanations. LLM2ER-EQR can generate personalized, informative, and consistent high-quality explanations learned from questionable-quality explanation datasets. Extensive experiments conducted on three real-world datasets demonstrate that our model can generate fluent, diverse, informative, and highly personalized explanations.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Data Science & Analytics and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mengyuan Yang , Mengying Zhu , Yan Wang , Linxun Chen , Yilei Zhao , Xiuyuan Wang , Bing Han , Xiaolin Zheng , Jianwei Yin

Topics

Artificial Intelligence > Core AI > Foundation Models Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Large Language Models Data Science & Analytics > Applications > Recommender Systems Machine Learning > Learning Types > Reinforcement Learning Deep Learning > Models > Large Language Models

Keywords

model compression reinforcement learning reward modeling natural language generation text generation language model fine-tuning explainable recommendation large language model

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024