Learning Attribute and Class-Specific Representation Duet for Fine-Grained Fashion Analysis

Yang Jiao; Yan Gao; Jingjing Meng; Jin Shang; Yi Sun

2023 CVPR CVPR 2023

Learning Attribute and Class-Specific Representation Duet for Fine-Grained Fashion Analysis

Abstract

Fashion representation learning involves the analysis and understanding of various visual elements at different granularities and the interactions among them. Existing works often learn fine-grained fashion representations at the attribute-level without considering their relationships and inter-dependencies across different classes. In this work, we propose to learn an attribute and class specific fashion representation duet to better model such attribute relationships and inter-dependencies by leveraging prior knowledge about the taxonomy of fashion attributes and classes. Through two sub-networks for the attributes and classes, respectively, our proposed an embedding network progressively learn and refine the visual representation of a fashion image to improve its robustness for fashion retrieval. A multi-granularity loss consisting of attribute-level and class-level losses is proposed to introduce appropriate inductive bias to learn across different granularities of the fashion representations. Experimental results on three benchmark datasets demonstrate the effectiveness of our method, which outperforms the state-of-the-art methods with a large margin.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — multi-granularity loss

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yang Jiao , Yan Gao , Jingjing Meng , Jin Shang , Yi Sun

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Core Methods > Metric Learning Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Semantic Segmentation Deep Learning > Learning Types > Representation Learning Computer Vision > Analysis > Image Classification

Keywords

representation learning metric learning embedding learning fine-grained classification attribute recognition fashion analysis fashion retrieval multi-granularity learning multi-granularity loss fashion image analysis fine-grained representation learning

Download PDF

Related papers

CORA: Adapting CLIP for Open-Vocabulary Detection With Region Prompting and Anchor Pre-Matching 2023

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars 2023

Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos 2023

Transductive Few-Shot Learning With Prototype-Based Label Propagation by Iterative Graph Refinement 2023

EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata 2023