Discovering Agents (Abstract Reprint)

Zachary Kenton; Ramana Kumar; Sebastian Farquhar; Jonathan Richens; Matt MacDermott; Tom Everitt

2024 AAAI AAAI 2024

Discovering Agents (Abstract Reprint)

Abstract

Abstract Causal models of agents have been used to analyse the safety aspects of machine learning systems. But identifying agents is non-trivial – often the causal model is just assumed by the modeller without much justification – and modelling failures can lead to mistakes in the safety analysis. This paper proposes the first formal causal definition of agents – roughly that agents are systems that would adapt their policy if their actions influenced the world in a different way. From this we derive the first causal discovery algorithm for discovering the presence of agents from empirical data, given a set of variables and under certain assumptions. We also provide algorithms for translating between causal models and game-theoretic influence diagrams. We demonstrate our approach by resolving some previous confusions caused by incorrect causal modelling of agents.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Knowledge & Reasoning and Machine Learning

🧭 Keyword Pioneer — game-theoretic influence diagram

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zachary Kenton , Ramana Kumar , Sebastian Farquhar , Jonathan Richens , Matt MacDermott , Tom Everitt

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Core AI > AI Safety Artificial Intelligence > Core AI > Causal Inference Machine Learning > Optimization & Theory > Learning Theory Knowledge & Reasoning > Reasoning > Causal Inference Artificial Intelligence > Core AI > Game Theory

Keywords

causal inference game theory causal discovery policy learning ai safety causal model agent system influence diagram causal modeling game-theoretic influence diagram

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024