Papers
310 papers found
3D Hand Pose Estimation in Everyday Egocentric Images
Aditya Prakash, Ruisen Tu, Matthew Chang et al.
3D Reconstruction of Objects in Hands without Real World 3D Supervision
Aditya Prakash, Matthew Chang, Matthew Jin et al.
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
Xunjiang Gu, Guanyu Song, Igor Gilitschenski et al.
A Compact Dynamic 3D Gaussian Representation for Real-Time Dynamic View Synthesis
Kai Katsumata, Duc Minh Vo, Hideki Nakayama
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang, Yaqing Wang, Caiwen Ding et al.
Adapting to Shifting Correlations with Unlabeled Data Calibration
Minh Nguyen, Alan Q Wang, Heejong Kim et al.
Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann et al.
Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Noam Elata, Tomer Michaeli, Michael Elad
AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale
Keenon Werling, Janelle M Kaneda, Tian Tan et al.
A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control
Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.
Adversarial Diffusion Distillation
Axel Sauer, Dominik Lorenz, Andreas Blattmann et al.
Adversarial Robustification via Text-to-Image Diffusion Models
Daewon Choi, Jongheon Jeong, Huiwon Jang et al.
Affine steerers for structured keypoint description
Georg Bökman, Johan Edstedt, Michael Felsberg et al.
"A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation"
Riccardo Fogliato, Pratik Patil, Mathew Monfort et al.
A high-quality robust diffusion framework for corrupted dataset
Quan Dao, Binh Ta, Tung Pham et al.
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models
Rining Wu, Feixiang Zhou, Ziwei Yin et al.
A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis
Xiang Liu, Zhaoxiang Liu, Huan Hu et al.
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
Zhiyu Tan, Mengping Yang, Luozheng Qin et al.
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Liang Chen, Haozhe Zhao, Tianyu Liu et al.
Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos
Remy Sabathier, David Novotny, Niloy Mitra
Augmented Neural Fine-tuning for Efficient Backdoor Purification
Nazmul Karim, Abdullah Al Arafat, Umar Khalid et al.
BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling
Cheng Peng, Yutao Tang, Yifan Zhou et al.
Benchmarking Spurious Bias in Few-Shot Image Classifiers
Guangtao Zheng, Wenqian Ye, Aidong Zhang
Benchmarking the Robustness of Cross-view Geo-localization Models
Qingwang Zhang, Yingying Zhu
Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning
Xinyuan Gao, Songlin Dong, Yuhang He et al.