2009
NIPS
NeurIPS 2009
Streaming Pointwise Mutual Information
Abstract
Recent work has led to the ability to perform space efficient, approximate counting over large vocabularies in a streaming context. Motivated by the existence of data structures of this type, we explore the computation of associativity scores, other- wise known as pointwise mutual information (PMI), in a streaming context. We give theoretical bounds showing the impracticality of perfect online PMI compu- tation, and detail an algorithm with high expected accuracy. Experiments on news articles show our approach gives high accuracy on real world data.
🌉
Interdisciplinary Bridge
— Data Science & Analytics and Mathematics & Optimization
📈
Trend Setter
— Data Mining
🧭
Keyword Pioneer
— pointwise mutual information
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Data Science & Analytics, Machine Learning, Mathematics & Optimization, Natural Language Processing
Authors
Topics
Natural Language Processing > Applications > Information Retrieval
Natural Language Processing > Resources & Methods > Text Representation
Data Science & Analytics > Methods > Data Mining
Mathematics & Optimization > Mathematics > Information Theory
Mathematics & Optimization > Optimization > Stochastic Methods