On the Effectiveness of Prompt-Moderated LLMs for Math Tutoring at the Tertiary Level

Sebastian Steindl; Fabian Brunner; Nada Sissouno; Dominik Schwagerl; Florian Schöler-Niewiera; Ulrich Schäfer

2025 EMNLP EMNLP 2025

On the Effectiveness of Prompt-Moderated LLMs for Math Tutoring at the Tertiary Level

Abstract

AbstractLarge Language Models (LLMs) have been studied intensively in the context of education, yielding heterogeneous results. Nowadays, these models are also deployed in formal education institutes. While specialized models exist, using prompt-moderated LLMs is widespread. In this study, we therefore investigate the effectiveness of prompt-moderated LLMs for math tutoring at a tertiary-level. We conduct a three-phase study with students (N=49) first receiving a review of the topics, then solving exercises, and finally writing an exam. During the exercises, they are presented with different types of assistance. We analyze the effect of LLM usage on the students’ performance, their engagement with the LLM, and their conversation strategies. Our results show that the prompt-moderation had a negative influence when compared to an unmoderated LLM. However, when the assistance was removed again, both LLM groups performed better than the control group, contradicting concerns about shallow learning. We publish the annotated conversations as a dataset to foster future research.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Interdisciplinary

🧭 Keyword Pioneer — llm assistance

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sebastian Steindl , Fabian Brunner , Nada Sissouno , Dominik Schwagerl , Florian Schöler-Niewiera , Ulrich Schäfer

Topics

Artificial Intelligence > Core AI > Human-AI Interaction Artificial Intelligence > Learning Paradigms > Transfer Learning Artificial Intelligence > Core AI > Large Language Models Interdisciplinary > Education

Keywords

prompt engineering educational technology large language model student performance math tutoring higher education llm assistance

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025