Findings from the First Shared Task on Automated Prediction of Difficulty and Response Time for Multiple-Choice Questions

Victoria Yaneva; Kai North; Peter Baldwin; Le An Ha; Saed Rezayi; Yiyun Zhou; Sagnik Ray Choudhury; Polina Harik; Brian Clauser

2024 NAACL NAACL 2024

Findings from the First Shared Task on Automated Prediction of Difficulty and Response Time for Multiple-Choice Questions

Abstract

AbstractThis paper reports findings from the First Shared Task on Automated Prediction of Difficulty and Response Time for Multiple-Choice Questions. The task was organized as part of the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA’24), held in conjunction with NAACL 2024, and called upon the research community to contribute solutions to the problem of modeling difficulty and response time for clinical multiple-choice questions (MCQs). A set of 667 previously used and now retired MCQs from the United States Medical Licensing Examination (USMLE®) and their corresponding difficulties and mean response times were made available for experimentation. A total of 17 teams submitted solutions and 12 teams submitted system report papers describing their approaches. This paper summarizes the findings from the shared task and analyzes the main approaches proposed by the participants.

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning

🐣 Hot Topic Early Bird — educational assessment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Victoria Yaneva , Kai North , Peter Baldwin , Le An Ha , Saed Rezayi , Yiyun Zhou , Sagnik Ray Choudhury , Polina Harik , Brian Clauser

Topics

Machine Learning > Core Methods > Regression Interdisciplinary > Social > Education

Keywords

machine learning response time difficulty prediction educational assessment multiple choice question

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024