RePFormer: Refinement Pyramid Transformer for Robust Facial Landmark Detection

Jinpeng Li; Haibo Jin; Shengcai Liao; Ling Shao; Pheng-Ann Heng

2022 IJCAI IJCAI 2022

RePFormer: Refinement Pyramid Transformer for Robust Facial Landmark Detection

Abstract

This paper presents a Refinement Pyramid Transformer (RePFormer) for robust facial landmark detection. Most facial landmark detectors focus on learning representative image features. However, these CNN-based feature representations are not robust enough to handle complex real-world scenarios due to ignoring the internal structure of landmarks, as well as the relations between landmarks and context. In this work, we formulate the facial landmark detection task as refining landmark queries along pyramid memories. Specifically, a pyramid transformer head (PTH) is introduced to build both homologous relations among landmarks and heterologous relations between landmarks and cross-scale contexts. Besides, a dynamic landmark refinement (DLR) module is designed to decompose the landmark regression into an end-to-end refinement procedure, where the dynamically aggregated queries are transformed to residual coordinates predictions. Extensive experimental results on four facial landmark detection benchmarks and their various subsets demonstrate the superior performance and high robustness of our framework.

🧭 Keyword Pioneer — facial landmark detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

Authors

Jinpeng Li , Haibo Jin , Shengcai Liao , Ling Shao , Pheng-Ann Heng

Topics

Deep Learning > Architectures > Transformers Computer Vision > Analysis > Face Recognition Computer Vision > Analysis > Human Pose Estimation Deep Learning > Learning Types > Representation Learning

Keywords

transformer architecture pose estimation human pose estimation facial landmark detection face alignment keypoint detection pyramid transformer query refinement refinement pyramid landmark regression

Download PDF

Related papers

Better Collective Decisions via Uncertainty Reduction 2022

Mixed Strategies for Security Games with General Defending Requirements 2022

Achieving Envy-Freeness with Limited Subsidies under Dichotomous Valuations 2022

Distortion in Voting with Top-t Preferences 2022

Let’s Agree to Agree: Targeting Consensus for Incomplete Preferences through Majority Dynamics 2022