2023 ACL ACL 2023

HyHTM: Hyperbolic Geometry-based Hierarchical Topic Model

Abstract

AbstractHierarchical Topic Models (HTMs) are useful for discovering topic hierarchies in a collection of documents. However, traditional HTMs often produce hierarchies where lower-level topics are unrelated and not specific enough to their higher-level topics. Additionally, these methods can be computationally expensive. We present HyHTM - a Hyperbolic geometry-based Hierarchical Topic Model - that addresses these limitations by incorporating hierarchical information from hyperbolic geometry to explicitly model hierarchies in topic models. Experimental results with four baselines show that HyHTM can better attend to parent-child relationships among topics. HyHTM produces coherent topic hierarchies that specialize in granularity from generic higher-level topics to specific lower-level topics. Further, our model is significantly faster and leaves a much smaller memory footprint than our best-performing baseline. We have made the source code for our algorithm publicly accessible.

🧭 Keyword Pioneer — parent-child relationship
🐝 Cross-Pollinator — Data Science & Analytics, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing
🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Mathematics & Optimization