Papers
3,399 papers found
DeepPatent: Large Scale Patent Drawing Recognition and Retrieval
Michal Kucer, Diane Oyen, Juan Castorena et al.
Single Image Object Counting and Localizing Using Active-Learning
Inbar Huberman-Spiegelglas, Raanan Fattal
Composite Learning for Robust and Effective Dense Predictions
Menelaos Kanakis, Thomas E. Huang, David Brüggemann et al.
AFPSNet: Multi-Class Part Parsing Based on Scaled Attention and Feature Fusion
Njuod Alsudays, Jing Wu, Yu-Kun Lai et al.
DRAMA: Joint Risk Localization and Captioning in Driving
Srikanth Malla, Chiho Choi, Isht Dwivedi et al.
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping
Felix Rosberg, Eren Erdal Aksoy, Fernando Alonso-Fernandez et al.
PatchDropout: Economizing Vision Transformers Using Patch Dropout
Yue Liu, Christos Matsoukas, Fredrik Strand et al.
Tracking Growth and Decay of Plant Roots in Minirhizotron Images
Alexander Gillert, Bo Peters, Uwe Freiherr von Lukas et al.
Motif Mining: Finding and Summarizing Remixed Image Content
William Theisen, Daniel Gonzalez Cedre, Zachariah Carmichael et al.
Pik-Fix: Restoring and Colorizing Old Photos
Runsheng Xu, Zhengzhong Tu, Yuanqi Du et al.
PatchZero: Defending Against Adversarial Patch Attacks by Detecting and Zeroing the Patch
Ke Xu, Yao Xiao, Zhaoheng Zheng et al.
Compact and Optimal Deep Learning With Recurrent Parameter Generators
Jiayun Wang, Yubei Chen, Stella X. Yu et al.
LAVA: Label-Efficient Visual Learning and Adaptation
Islam Nassar, Munawar Hayat, Ehsan Abbasnejad et al.
SONGs: Self-Organizing Neural Graphs
Łukasz Struski, Tomasz Danel, Marek Śmieja et al.
Patch-Based Selection and Refinement for Early Object Detection
Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo et al.
Token Fusion: Bridging the Gap Between Token Pruning and Token Merging
Minchul Kim, Shangqian Gao, Yen-Chang Hsu et al.
Taming Normalizing Flows
Shimon Malnick, Shai Avidan, Ohad Fried
RGB-D Mapping and Tracking in a Plenoxel Radiance Field
Andreas L. Teigen, Yeonsoo Park, Annette Stahl et al.
Learning To Recognize Occluded and Small Objects With Partial Inputs
Hasib Zunair, A. Ben Hamza
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
Cheng Yang, Rui Xu, Ye Guo et al.
Synthesizing Anyone, Anywhere, in Any Pose
Håkon Hukkelås, Frank Lindseth
P2D: Plug and Play Discriminator for Accelerating GAN Frameworks
Min Jin Chong, Krishna Kumar Singh, Yijun Li et al.
DiffBody: Diffusion-Based Pose and Shape Editing of Human Images
Yuta Okuyama, Yuki Endo, Yoshihiro Kanamori
Segment Anything, From Space?
Simiao Ren, Francesco Luzi, Saad Lahrichi et al.
StyleAvatar: Stylizing Animatable Head Avatars
Juan C. Pérez, Thu Nguyen-Phuoc, Chen Cao et al.