Diffusion Source Identification on Networks with Statistical Confidence

Quinlan E Dawkins; Tianxi Li; Haifeng Xu

2021 ICML ICML 2021

Diffusion Source Identification on Networks with Statistical Confidence

Abstract

Diffusion source identification on networks is a problem of fundamental importance in a broad class of applications, including controlling the spreading of rumors on social media, identifying a computer virus over cyber networks, or identifying the disease center during epidemiology. Though this problem has received significant recent attention, most known approaches are well-studied in only very restrictive settings and lack theoretical guarantees for more realistic networks. We introduce a statistical framework for the study of this problem and develop a confidence set inference approach inspired by hypothesis testing. Our method efficiently produces a small subset of nodes, which provably covers the source node with any pre-specified confidence level without restrictive assumptions on network structures. To our knowledge, this is the first diffusion source identification method with a practically useful theoretical guarantee on general networks. We demonstrate our approach via extensive synthetic experiments on well-known random network models, a large data set of real-world networks as well as a mobility network between cities concerning the COVID-19 spreading in January 2020.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Data Science & Analytics and Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — diffusion source identification

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Quinlan E Dawkins , Tianxi Li , Haifeng Xu

Topics

Artificial Intelligence > Core AI > Causal Inference Machine Learning > Optimization & Theory > Statistical Learning Data Science & Analytics > Applications > Disease Surveillance Mathematics & Optimization > Mathematics > Graph Theory Machine Learning > Core Methods > Graphical Models Machine Learning > Optimization & Theory > Statistics Mathematics & Optimization > Probability > Stochastic Processes

Keywords

network analysis hypothesis testing network diffusion graphical model confidence set diffusion source identification source detection confidence set inference

Download PDF

Related papers

GRAND: Graph Neural Diffusion 2021

Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits 2021

Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation 2021

Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution 2021

Dataset Dynamics via Gradient Flows in Probability Space 2021