Cross-Subject Mind Decoding from Inaccurate Representations

Yangyang Xu; Bangzhen Liu; Wenqi Shao; Yong Du; Shengfeng He; Tingting Zhu

2025 ICCV ICCV 2025

Cross-Subject Mind Decoding from Inaccurate Representations

Abstract

Decoding stimulus images from fMRI signals has advanced with pre-trained generative models. However, existing methods struggle with cross-subject mappings due to cognitive variability and subject-specific differences. This challenge arises from sequential errors, where unidirectional mappings generate partially inaccurate representations that, when fed into diffusion models, accumulate errors and degrade reconstruction fidelity. To address this, we propose the Bidirectional Autoencoder Intertwining framework for accurate mind representation prediction. Our approach unifies multiple subjects through a Subject Bias Modulation Module while leveraging bidirectional mapping to better capture data distributions for precise representation prediction. To further enhance fidelity when decoding representations into stimulus images, we introduce a Semantic Refinement Module to improve semantic representations and a Visual Coherence Module to mitigate the effects of inaccurate visual representations. Integrated with ControlNet and Stable Diffusion, our method outperforms state-of-the-art approaches on benchmark datasets in both qualitative and quantitative evaluations. Moreover, our framework exhibits strong adaptability to new subjects with minimal training samples.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Healthcare & Medicine and Machine Learning

🧭 Keyword Pioneer — bidirectional autoencoder

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yangyang Xu , Bangzhen Liu , Wenqi Shao , Yong Du , Shengfeng He , Tingting Zhu

Topics

Machine Learning > Application Areas > Domain Adaptation Deep Learning > Architectures > Autoencoders Deep Learning > Models > Diffusion Models Computer Vision > Generation > Image Generation Healthcare & Medicine > Clinical > Medical AI

Keywords

image reconstruction semantic representation brain decoding diffusion model fmri signal bidirectional autoencoder cross-subject mapping semantic refinement mind decoding

Download PDF

Related papers

MA-CIR: A Multimodal Arithmetic Benchmark for Composed Image Retrieval 2025

SimMLM: A Simple Framework for Multi-modal Learning with Missing Modality 2025

MonSTeR: a Unified Model for Motion, Scene, Text Retrieval 2025

ASGS: Single-Domain Generalizable Open-Set Object Detection via Adaptive Subgraph Searching 2025

Robust Dataset Condensation using Supervised Contrastive Learning 2025