Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Analysis
Computer Vision
›
Analysis
›
Scene Understanding
1887 directly classified papers
Papers per year
2006: 14
2007: 12
2008: 12
2009: 20
2010: 14
2011: 13
2012: 13
2013: 108
2014: 43
2015: 83
2016: 42
2017: 61
2018: 58
2019: 138
2020: 128
2021: 197
2022: 132
2023: 222
2024: 243
2025: 287
2026: 47
Papers
Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing
ICCV 2025
Less is More: Empowering GUI Agent with Context-Aware Simplification
ICCV 2025
OURO: A Self-Bootstrapped Framework for Enhancing Multimodal Scene Understanding
ICCV 2025
Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching
ICCV 2025
Supercharging Floorplan Localization with Semantic Rays
ICCV 2025
Fine-Grained Perception in Panoramic Scenes: A Novel Task, Dataset, and Method for Object Importance Ranking
AAAI 2025
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
ICCV 2025
VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition
ICCV 2025
Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset
AAAI 2025
Transparent Vision: A Theory of Hierarchical Invariant Representations
ICCV 2025
PanSt3R: Multi-view Consistent Panoptic Segmentation
ICCV 2025
SANPO: A Scene Understanding Accessibility and Human Navigation Dataset
WACV 2025
PlaneRAS: Learning Planar Primitives for 3D Plane Recovery
ICCV 2025
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
ICCV 2025
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning
ICCV 2025
Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers
WACV 2025
Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation
ICCV 2025
ChartCap: Mitigating Hallucination of Dense Chart Captioning
ICCV 2025
Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding
NAACL 2025
TaiwanVQA: A Benchmark for Visual Question Answering for Taiwanese Daily Life
COLING 2025
Trial-Oriented Visual Rearrangement
ICCV 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
ICCV 2025
MAVias: Mitigate any Visual Bias
ICCV 2025
Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation
WACV 2025
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models
CVPR 2025
<
1
…
6
7
8
…
76
>