Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Avinash Kori; Francesco Locatello; Ainkaran Santhirasekaram; Francesca Toni; Ben Glocker; Fabio De Sousa Ribeiro

2024 NIPS NeurIPS 2024

Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Abstract

Learning modular object-centric representations is said to be crucial for systematic generalization. Existing methods show promising object-binding capabilities empirically, but theoretical identifiability guarantees remain relatively underdeveloped. Understanding when object-centric representations can theoretically be identified is important for scaling slot-based methods to high-dimensional images with correctness guarantees. To that end, we propose a probabilistic slot-attention algorithm that imposes an aggregate mixture prior over object-centric slot representations, thereby providing slot identifiability guarantees without supervision, up to an equivalence relation. We provide empirical verification of our theoretical identifiability result using both simple 2-dimensional data and high-resolution imaging datasets.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

Authors

Avinash Kori , Francesco Locatello , Ainkaran Santhirasekaram , Francesca Toni , Ben Glocker , Fabio De Sousa Ribeiro

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Deep Learning > Learning Types > Self-Supervised Learning Artificial Intelligence > Core AI > Computer Vision Deep Learning > Learning Types > Representation Learning Deep Learning > Learning Types > Multimodal Learning

Keywords

unsupervised learning representation learning probabilistic modeling generative model mixture model object-centric learning object-centric representation slot attention identifiability guarantee mixture prior

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024