Papers
2,737 papers found
Munich to Dubai: How far is it for Semantic Segmentation?
Shyam Nandan Rai, Vineeth N Balasubramanian, Anbumani Subramanian et al.
Intro and Recap Detection for Movies and TV Series
Xiang Hao, Kripa Chettiar, Ben Cheung et al.
Attention-Based Spatial Guidance for Image-to-Image Translation
Yu Lin, Yigong Wang, Yifan Li et al.
HyperCon: Image-to-Video Model Transfer for Video-to-Video Translation Tasks
Ryan Szeto, Mostafa El-Khamy, Jungwon Lee et al.
Autonomous Tracking for Volumetric Video Sequences
Matthew Moynihan, Susana Ruano, Rafael Pages et al.
Towards Enhancing Fine-Grained Details for Image Matting
Chang Liu, Henghui Ding, Xudong Jiang
edge-SR: Super-Resolution for the Masses
Pablo Navarrete Michelini, Yunhua Lu, Xingqun Jiang
Time-Space Transformers for Video Panoptic Segmentation
Andra Petrovai, Sergiu Nedevschi
TA-Net: Topology-Aware Network for Gland Segmentation
Haotian Wang, Min Xian, Aleksandar Vakanski
Dynamic Re-Weighting for Long-Tailed Semi-Supervised Learning
Hanyu Peng, Weiguo Pian, Mingming Sun et al.
Refign: Align and Refine for Adaptation of Semantic Segmentation to Adverse Conditions
David Brüggemann, Christos Sakaridis, Prune Truong et al.
CTrGAN: Cycle Transformers GAN for Gait Transfer
Shahar Mahpod, Noam Gaash, Hay Hoffman et al.
End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes
Khanh Quoc Dinh, Kwang Pyo Choi
Image-Text Pre-Training for Logo Recognition
Mark Hubenthal, Suren Kumar
Holistic Interaction Transformer Network for Action Detection
Gueter Josmy Faure, Min-Hung Chen, Shang-Hong Lai
Back to MLP: A Simple Baseline for Human Motion Prediction
Wen Guo, Yuming Du, Xi Shen et al.
A Quality Aware Sample-to-Sample Comparison for Face Recognition
Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Zafari et al.
Modeling Stroke Mask for End-to-End Text Erasing
Xiangcheng Du, Zhao Zhou, Yingbin Zheng et al.
Mutual Learning for Long-Tailed Recognition
Changhwa Park, Junho Yim, Eunji Jun
TriPlaneNet: An Encoder for EG3D Inversion
Ananta R. Bhattarai, Matthias Nießner, Artem Sevastopolsky
Embedding Task Structure for Action Detection
Michael Peven, Gregory D. Hager
Shape From Shading for Robotic Manipulation
Arkadeep Narayan Chaudhury, Leonid Keselman, Christopher G. Atkeson
Bag of Tricks for Fully Test-Time Adaptation
Saypraseuth Mounsaveng, Florent Chiaroni, Malik Boudiaf et al.
TriCoLo: Trimodal Contrastive Loss for Text To Shape Retrieval
Yue Ruan, Han-Hung Lee, Yiming Zhang et al.
TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation
Paul Grimal, Hervé Le Borgne, Olivier Ferret et al.