Papers
2,121 papers found
Multi-Scale Contrastive Learning for Complex Scene Generation
Hanbit Lee, Youna Kim, Sang-goo Lee
Intention-Conditioned Long-Term Human Egocentric Action Anticipation
Esteve Valls MascarĂ³, Hyemin Ahn, Dongheui Lee
Textual Alchemy: CoFormer for Scene Text Understanding
Gayatri Deshmukh, Onkar Susladkar, Dhruv Makwana et al.
Leveraging Synthetic Data To Learn Video Stabilization Under Adverse Conditions
Abdulrahman Kerim, Washington L. S. Ramos, Leandro Soriano Marcolino et al.
Understanding Dark Scenes by Contrasting Multi-Modal Observations
Xiaoyu Dong, Naoto Yokoya
Multi-Modal Gaze Following in Conversational Scenarios
Yuqi Hou, Zhongqun Zhang, Nora Horanyi et al.
PMVC: Promoting Multi-View Consistency for 3D Scene Reconstruction
Chushan Zhang, Jinguang Tong, Tao Jun Lin et al.
FacadeNet: Conditional Facade Synthesis via Selective Editing
Yiangos Georgiou, Marios Loizou, Tom Kelly et al.
Robust Object Detection in Challenging Weather Conditions
Himanshu Gupta, Oleksandr Kotlyar, Henrik Andreasson et al.
Robust Learning via Conditional Prevalence Adjustment
Minh Nguyen, Alan Q. Wang, Heejong Kim et al.
INCODE: Implicit Neural Conditioning With Prior Knowledge Embeddings
Amirhossein Kazerouni, Reza Azad, Alireza Hosseini et al.
Controlling Virtual Try-On Pipeline Through Rendering Policies
Kedan Li, Jeffrey Zhang, Shao-Yu Chang et al.
Revisiting Pixel-Level Contrastive Pre-Training on Scene Images
Zongshang Pang, Yuta Nakashima, Mayu Otani et al.
Decomposed Distribution Matching in Dataset Condensation
Sahar Rahimi Malakshan, Mohammad Saeed Ebrahimi Saadabadi, Ali Dabouei et al.
Tumor Synthesis Conditioned on Radiomics
Jonghun Kim, Inye Na, Eun Sook Ko et al.
Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
Lingdong Kong, Xiang Xu, Jun Cen et al.
Scene-LLM: Extending Language Model for 3D Visual Reasoning
Rao Fu, Jingyu Liu, Xilun Chen et al.
Situational Scene Graph for Structured Human-Centric Situation Understanding
Chinthani Sugandhika, Chen Li, Deepu Rajan et al.
Cross-Aligned Fusion for Multimodal Understanding
Abhishek Rajora, Shubham Gupta, Suman Kundu
Tuned Contrastive Learning
Chaitanya Animesh, Manmohan Chandraker
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios
Vittorio Pipoli, Federico Bolelli, Sara Sarto et al.
Rubric-Constrained Figure Skating Scoring
Arushi Rai, Adriana Kovashka
START: Spatial and Textual Learning for Chart Understanding
Zhuoming Liu, Xiaofeng Gao, Feiyang Niu et al.
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Ruiyuan Gao, Kai Chen, Zhihao Li et al.
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis
Hou In Ivan Tam, Hou In Derek Pun, Austin T. Wang et al.