Can Large Language Models Capture Dissenting Human Voices?

Noah Lee; Na Min An; James Thorne

2023 EMNLP EMNLP 2023

Can Large Language Models Capture Dissenting Human Voices?

Abstract

AbstractLarge language models (LLMs) have shown impressive achievements in solving a broad range of tasks. Augmented by instruction fine-tuning, LLMs have also been shown to generalize in zero-shot settings as well. However, whether LLMs closely align with the human disagreement distribution has not been well-studied, especially within the scope of natural language inference (NLI). In this paper, we evaluate the performance and alignment of LLM distribution with humans using two different techniques to estimate the multinomial distribution: Monte Carlo Estimation (MCE) and Log Probability Estimation (LPE). As a result, we show LLMs exhibit limited ability in solving NLI tasks and simultaneously fail to capture human disagreement distribution. The inference and human alignment performances plunge even further on data samples with high human disagreement levels, raising concerns about their natural language understanding (NLU) ability and their representativeness to a larger human population.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — dissenting voice

🐣 Hot Topic Early Bird — human alignment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Noah Lee , Na Min An , James Thorne

Topics

Artificial Intelligence > Core AI > Human-AI Interaction Natural Language Processing > Resources & Methods > Natural Language Inference Artificial Intelligence > Core AI > Large Language Models Natural Language Processing > Applications > Natural Language Inference Machine Learning > Learning Types > Evaluation

Keywords

zero-shot learning natural language inference natural language understanding monte carlo estimation human alignment dissenting voice disagreement distribution

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023