Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

Oren Kraus; Kian Kenyon-Dean; Saber Saberian; Maryam Fallah; Peter McLean; Jess Leung; Vasudev Sharma; Ayla Khan; Jia Balakrishnan; Safiye Celik; Dominique Beaini; Maciej Sypetkowski; Chi Vicky Cheng; Kristen Morse; Maureen Makes; Ben Mabey; Berton Earnshaw

2024 CVPR CVPR 2024

Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

Abstract

Featurizing microscopy images for use in biological research remains a significant challenge especially for large-scale experiments spanning millions of images. This work explores the scaling properties of weakly supervised classifiers and self-supervised masked autoencoders (MAEs) when training with increasingly larger model backbones and microscopy datasets. Our results show that ViT-based MAEs outperform weakly supervised classifiers on a variety of tasks achieving as much as a 11.5% relative improvement when recalling known biological relationships curated from public databases. Additionally we develop a new channel-agnostic MAE architecture (CA-MAE) that allows for inputting images of different numbers and orders of channels at inference time. We demonstrate that CA-MAEs effectively generalize by inferring and evaluating on a microscopy image dataset (JUMP-CP) generated under different experimental conditions with a different channel structure than our pretraining data (RPI-93M). Our findings motivate continued research into scaling self-supervised learning on microscopy data in order to create powerful foundation models of cellular biology that have the potential to catalyze advancements in drug discovery and beyond.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Healthcare & Medicine and Machine Learning

🧭 Keyword Pioneer — cellular biology

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Oren Kraus , Kian Kenyon-Dean , Saber Saberian , Maryam Fallah , Peter McLean , Jess Leung , Vasudev Sharma , Ayla Khan , Jia Balakrishnan , Safiye Celik , Dominique Beaini , Maciej Sypetkowski , Chi Vicky Cheng , Kristen Morse , Maureen Makes , Ben Mabey , Berton Earnshaw

Topics

Machine Learning > Learning Types > Self-Supervised Learning Deep Learning > Architectures > Transformers Computer Vision > Domain-Specific > Medical Imaging Healthcare & Medicine > Research > Bioinformatics Deep Learning > Techniques > Self-Supervised Learning Deep Learning > Learning Types > Self-Supervised Learning Deep Learning > Models > Foundation Models

Keywords

vision transformer self-supervised learning foundation model masked autoencoder weakly supervised microscopy image microscopy imaging cellular biology

Download PDF

Related papers

DUSt3R: Geometric 3D Vision Made Easy 2024

Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs 2024

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows 2024

Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization 2024

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models 2024