Coding Open-Ended Responses using Pseudo Response Generation by Large Language Models

Yuki Zenimoto; Ryo Hasegawa; Takehito Utsuro; Masaharu Yoshioka; Noriko Kando

2024 NAACL NAACL 2024

Coding Open-Ended Responses using Pseudo Response Generation by Large Language Models

Abstract

AbstractSurvey research using open-ended responses is an important method thatcontributes to the discovery of unknown issues and new needs. However,survey research generally requires time and cost-consuming manual dataprocessing, indicating that it is difficult to analyze large dataset.To address this issue, we propose an LLM-based method to automate partsof the grounded theory approach (GTA), a representative approach of thequalitative data analysis. We generated and annotated pseudo open-endedresponses, and used them as the training data for the coding proceduresof GTA. Through evaluations, we showed that the models trained withpseudo open-ended responses are quite effective compared with thosetrained with manually annotated open-ended responses. We alsodemonstrate that the LLM-based approach is highly efficient andcost-saving compared to human-based approach.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuki Zenimoto , Ryo Hasegawa , Takehito Utsuro , Masaharu Yoshioka , Noriko Kando

Topics

Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Large Language Models

Keywords

text generation pseudo labeling qualitative data analysis survey research large language model

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024