2017
JMLR
JMLR 2017
Refinery: An Open Source Topic Modeling Web Platform
Abstract
We introduce Refinery, an open source platform for exploring large text document collections with topic models. Refinery is a standalone web application driven by a graphical interface, so it is usable by those without machine learning or programming expertise. Users can interactively organize articles by topic and also refine this organization with phrase-level analysis. Under the hood, we train Bayesian nonparametric topic models that can adapt model complexity to the provided data with scalable learning algorithms. The project website contains Python code and further documentation. [abs] [ pdf ][ bib ] [ code ] [ webpage ] © JMLR 2017. (edit, beta)
🌉
Interdisciplinary Bridge
— Data Science & Analytics and Machine Learning
🧭
Keyword Pioneer
— document organization
🐣
Hot Topic Early Bird
— text mining
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio