REGLO: Provable Neural Network Repair for Global Robustness Properties

Feisi Fu; Zhilu Wang; Weichao Zhou; Yixuan Wang; Jiameng Fan; Chao Huang; Qi Zhu; Xin Chen; Wenchao Li

2024 AAAI AAAI 2024

REGLO: Provable Neural Network Repair for Global Robustness Properties

Abstract

Abstract We present REGLO, a novel methodology for repairing pretrained neural networks to satisfy global robustness and individual fairness properties. A neural network is said to be globally robust with respect to a given input region if and only if all the input points in the region are locally robust. This notion of global robustness also captures the notion of individual fairness as a special case. We prove that any counterexample to a global robustness property must exhibit a corresponding large gradient. For ReLU networks, this result allows us to efficiently identify the linear regions that violate a given global robustness property. By formulating and solving a suitable robust convex optimization problem, REGLO then computes a minimal weight change that will provably repair these violating linear regions.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Feisi Fu , Zhilu Wang , Weichao Zhou , Yixuan Wang , Jiameng Fan , Chao Huang , Qi Zhu , Xin Chen , Wenchao Li

Topics

Artificial Intelligence > Core AI > AI Safety Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Fairness Deep Learning > Architectures > Neural Networks Deep Learning > Optimization & Theory > Optimization

Keywords

convex optimization global robustness relu network individual fairness neural network repair

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024