2011 AISTATS AISTATS 2011

Bridging the Language Gap: Topic Adaptation for Documents with Different Technicality

Abstract

The language-gap, for example between low-literacy laypersons and highly-technical experts, is a fundamental barrier for cross-domain knowledge transfer. This paper seeks to close the gap at the thematic level via topic adaptation, i.e., adjusting topical structures for cross-domain documents according to a domain factor such as technicality. We present a probabilistic model for this purpose based on joint modeling of topic and technicality. The proposed $\tau$LDA model explicitly encodes the interplay between topic and technicality hierarchies, providing an effective topic-bridge between lay and expert documents. We demonstrate the usefulness of $\tau$LDA with an application to consumer medical informatics.

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning and Natural Language Processing
📈 Trend Setter — Text Representation
🧭 Keyword Pioneer — cross-domain transfer
🐣 Hot Topic Early Bird — domain adaptation
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio