2013 ICML ICML 2013

Copy or Coincidence? A Model for Detecting Social Influence and Duplication Events

Abstract

In this paper, we analyze the task of inferring rare links between pairs of entities that seem too similar to have occurred by chance. Variations of this task appear in such diverse areas as social network analysis, security, fraud detection, and entity resolution. To address the task in a general form, we propose a simple, flexible mixture model in which most entities are generated independently from a distribution but a small number of pairs are constrained to be similar. We predict the true pairs using a likelihood ratio that trades off the entitiesโ€™ similarity with their rarity. This method always outperforms using only similarity; however, with certain parameter settings, similarity turns out to be surprisingly competitive. Using real data, we apply the model to detect twins given their birth weights and to re-identify cell phone users based on distinctive usage patterns.

โ“ The Questioner
๐Ÿš€ Conference Pioneer โ€” ICML 2013
๐ŸŒ‰ Interdisciplinary Bridge โ€” Data Science & Analytics and Machine Learning
๐Ÿงญ Keyword Pioneer โ€” entity resolution
๐Ÿฃ Hot Topic Early Bird โ€” social network
๐Ÿ Cross-Pollinator โ€” Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio