2025
ACL
ACL 2025
KazBench-KK: A Cultural-Knowledge Benchmark for Kazakh
Abstract
AbstractWe introduce KazBench-KK, a comprehensive 7,111-question multiple-choice benchmark designed to assess large language models’ understanding of culturally grounded Kazakh knowledge. By combining expert-curated topics with LLM-assisted web mining, we create a diverse dataset spanning 17 culturally salient domains, including pastoral traditions, social hierarchies, and contemporary politics. Beyond evaluation, KazBench-KK serves as a practical tool for field linguists, enabling rapid lexical elicitation, glossing, and topic prioritization. Our benchmarking of various open-source LLMs reveals that reinforcement-tuned models outperform others, but smaller, domain-focused fine-tunes can rival larger models in specific cultural contexts.
🌉
Interdisciplinary Bridge
— Deep Learning and Interdisciplinary and Machine Learning and Natural Language Processing
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Natural Language Processing > Applications > Machine Translation
Natural Language Processing > Applications > Text Classification
Natural Language Processing > Resources & Methods > Multilingual NLP
Interdisciplinary > Linguistics > Computational Linguistics
Deep Learning > Models > Large Language Models
Machine Learning > Optimization & Theory > Evaluation
Machine Learning > Learning Types > Reinforcement Learning from Human Feedback