2024 ECCV ECCV 2024

SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding