Cross-Domain and Cross-Dimension Learning for Image-to-Graph Transformers

Alexander H. Berger; Laurin Lux; Suprosanna Shit; Ivan Ezhof; Georgios Kaissis; Martin J. Menten; Daniel Rueckert; Johannes C. Paetzold

2025 WACV WACV 2025

Cross-Domain and Cross-Dimension Learning for Image-to-Graph Transformers

Abstract

Direct image-to-graph transformation is a challenging task solving object detection and relationship prediction in a single model. Due to this task's complexity large training datasets are rare in many domains making the training of deep-learning methods challenging. This data sparsity necessitates transfer learning strategies akin to the state-of-the-art in general computer vision. In this work we introduce a set of methods enabling cross-domain and cross-dimension learning for image-to-graph transformers. We propose (1) a regularized edge sampling loss to effectively learn object relations in multiple domains with different numbers of edges (2) a domain adaptation framework for image-to-graph transformers aligning image- and graph-level features from different domains and (3) a projection function that allows using 2D data for training 3D transformers. We demonstrate our method's utility in cross-domain and cross-dimension experiments where we utilize labeled data from 2D road networks for simultaneous learning in vastly different target domains. Our method consistently outperforms standard transfer learning and self-supervised pretraining on challenging benchmarks such as retinal or whole-brain vessel graph extraction.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — image-to-graph transformation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Alexander H. Berger , Laurin Lux , Suprosanna Shit , Ivan Ezhof , Georgios Kaissis , Martin J. Menten , Daniel Rueckert , Johannes C. Paetzold

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Application Areas > Domain Adaptation Deep Learning > Architectures > Transformers Computer Vision > Analysis > Object Detection Computer Vision > Domain-Specific > Autonomous Driving Machine Learning > Learning Types > Transfer Learning Machine Learning > Learning Types > Domain Adaptation

Keywords

transfer learning domain adaptation object detection relation prediction graph transformer graph neural network image-to-graph transformation

Download PDF

Related papers

Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration 2025

ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting 2025

Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation 2025

Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach 2025

Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video 2025