Where Does This Data Come From? Enhanced Source Inference Attacks in Federated Learning

Haiyang Chen; Xiaolong Xu; Xiang Zhu; Xiaokang Zhou; Fei Dai; Yansong Gao; Xiao Chen; Shuo Wang; Hongsheng Hu

2025 IJCAI IJCAI 2025

Where Does This Data Come From? Enhanced Source Inference Attacks in Federated Learning

Abstract

Federated learning (FL) enables collaborative model training without exposing raw data, offering a privacy-aware alternative to centralized learning. However, FL remains vulnerable to various privacy attacks that exploit shared model updates, including membership inference, property inference, and gradient inversion. Source inference attacks further threaten FL by identifying which client contributed a specific training sample, posing severe risks to user and institutional privacy. Existing source inference attacks mainly assume passive adversaries and overlook more realistic scenarios where the server actively manipulates the training process. In this paper, we present an enhanced source inference attack that demonstrates how a malicious server can amplify behavioral differences between clients to more accurately infer data origin. Our approach introduces active training manipulation and data augmentation to expose client-specific patterns. Experimental results across five representative FL algorithms and multiple datasets show that our method significantly outperforms prior passive attacks. These findings reveal a deeper level of privacy vulnerability in FL and call for stronger defense mechanisms under active threat models.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — source inference attack

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Haiyang Chen , Xiaolong Xu , Xiang Zhu , Xiaokang Zhou , Fei Dai , Yansong Gao , Xiao Chen , Shuo Wang , Hongsheng Hu

Topics

Artificial Intelligence > Core AI > AI Safety Machine Learning > Application Areas > Privacy Machine Learning > Learning Types > Federated Learning Security & Privacy > Privacy Machine Learning > Learning Paradigms > Federated Learning Machine Learning > Learning Types > Privacy

Keywords

federated learning privacy attack membership inference gradient inversion source inference attack active training manipulation

Download PDF

Related papers

Learning Advanced Self-Attention for Linear Transformers in the Singular Value Domain 2025

Responsibility Anticipation and Attribution in LTLf 2025

Argument-based Multi-Issue Negotiation 2025

Online Resource Sharing: Better Robust Guarantees via Randomized Strategies 2025

Equitable Mechanism Design for Facility Location 2025