2020
ACL
ACL 2020
MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs
Abstract
AbstractThe prosperity of Massive Open Online Courses (MOOCs) provides fodder for many NLP and AI research for education applications, e.g., course concept extraction, prerequisite relation discovery, etc. However, the publicly available datasets of MOOC are limited in size with few types of data, which hinders advanced models and novel attempts in related topics. Therefore, we present MOOCCube, a large-scale data repository of over 700 MOOC courses, 100k concepts, 8 million student behaviors with an external resource. Moreover, we conduct a prerequisite discovery task as an example application to show the potential of MOOCCube in facilitating relevant research. The data repository is now available at http://moocdata.cn/data/MOOCCube.
🌉
Interdisciplinary Bridge
— Data Science & Analytics and Interdisciplinary and Natural Language Processing
🧭
Keyword Pioneer
— education data mining
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Jifan Yu
,
Gan Luo
,
Tong Xiao
,
Qingyang Zhong
,
Yuquan Wang
,
Wenzheng Feng
,
Junyi Luo
,
Chenyu Wang
,
Lei Hou
,
Juanzi Li
,
Zhiyuan Liu
,
Jie Tang