2023
CVPR
CVPR 2023
Category Query Learning for Human-Object Interaction Classification
Abstract
Unlike most previous HOI methods that focus on learning better human-object features, we propose a novel and complementary approach called category query learning. Such queries are explicitly associated to interaction categories, converted to image specific category representation via a transformer decoder, and learnt via an auxiliary image-level classification task. This idea is motivated by an earlier multi-label image classification method, but is for the first time applied for the challenging human-object interaction classification task. Our method is simple, general and effective. It is validated on three representative HOI baselines and achieves new state-of-the-art results on two benchmarks.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning
🧭
Keyword Pioneer
— category query
🐣
Hot Topic Early Bird
— visual understanding
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Classification
Deep Learning > Architectures > Transformers
Computer Vision > Analysis > Action Recognition
Computer Vision > Analysis > Object Detection
Machine Learning > Learning Types > Multi-Task Learning
Artificial Intelligence > Core AI > Computer Vision
Artificial Intelligence > Core AI > Multi-Modal Learning