Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Models
Deep Learning
›
Models
›
Foundation Models
259 directly classified papers
Papers per year
2021: 5
2022: 13
2023: 23
2024: 104
2025: 109
2026: 5
Papers
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
WACV 2025
Transferring Foundation Models for Generalizable Robotic Manipulation
WACV 2025
AnomalyDINO: Boosting Patch-Based Few-Shot Anomaly Detection with DINOv2
WACV 2025
Towards Real-Time Open-Vocabulary Video Instance Segmentation
WACV 2025
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2
JMLR 2025
Physics-Guided Foundation Model for Scientific Discovery: An Application to Aquatic Science
AAAI 2025
Unified Multimodal Understanding via Byte-Pair Visual Encoding
ICCV 2025
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
ICCV 2025
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention
ICCV 2025
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
ICCV 2025
SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing
ICCV 2025
FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration
ICCV 2025
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
ICCV 2025
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration
ICCV 2025
DexVLG: Dexterous Vision-Language-Grasp Model at Scale
ICCV 2025
FLARE: A Framework for Stellar Flare Forecasting Using Stellar Physical Properties and Historical Records
IJCAI 2025
Generalizable Object Re-Identification via Visual In-Context Prompting
ICCV 2025
Detect Anything 3D in the Wild
ICCV 2025
Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection
ICCV 2025
Find Any Part in 3D
ICCV 2025
SAM4D: Segment Anything in Camera and LiDAR Streams
ICCV 2025
Scaling Laws for Native Multimodal Models
ICCV 2025
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
ICCV 2025
Enhancing Prompt Generation with Adaptive Refinement for Camouflaged Object Detection
ICCV 2025
Towards Foundational Models for Single-Chip Radar
ICCV 2025
<
1
2
3
4
5
…
11
>