A Unified and Interpretable Emotion Representation and Expression Generation

Reni Paskaleva; Mykyta Holubakha; Andela Ilic; Saman Motamed; Luc Van Gool; Danda Paudel

2024 CVPR CVPR 2024

A Unified and Interpretable Emotion Representation and Expression Generation

Abstract

Canonical emotions such as happy sad and fear are easy to understand and annotate. However emotions are often compound e.g. happily surprised and can be mapped to the action units (AUs) used for expressing emotions and trivially to the canonical ones. Intuitively emotions are continuous as represented by the arousal-valence (AV) model. An interpretable unification of these four modalities --namely Canonical Compound AUs and AV-- is highly desirable for a better representation and understanding of emotions. However such unification remains to be unknown in the current literature. In this work we propose an interpretable and unified emotion model referred as C2A2. We also develop a method that leverages labels of the non-unified models to annotate the novel unified one. Finally we modify the text-conditional diffusion models to understand continuous numbers which are then used to generate continuous expressions using our unified emotion model. Through quantitative and qualitative experiments we show that our generated images are rich and capture subtle expressions. Our work allows a fine-grained generation of expressions in conjunction with other textual inputs and offers a new label space for emotions at the same time.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Interdisciplinary and Machine Learning

🧭 Keyword Pioneer — canonical emotion

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Reni Paskaleva , Mykyta Holubakha , Andela Ilic , Saman Motamed , Luc Van Gool , Danda Paudel

Topics

Machine Learning > Core Methods > Representation Learning Deep Learning > Models > Diffusion Models Computer Vision > Generation > Image Generation Interdisciplinary > Social > Affective Computing

Keywords

image generation emotion recognition diffusion model facial expression continuous emotion action unit expression generation emotion representation canonical emotion

Download PDF

Related papers

DUSt3R: Geometric 3D Vision Made Easy 2024

Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs 2024

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows 2024

Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization 2024

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models 2024