Noisy label facial expression recognition via face-specific label distribution learning

Hyunuk Shin, Bokyeung Lee, Bonhwa Ku, Hanseok Ko

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

The uncertainty in Facial expression recognition (FER) data is caused by factors such as ambiguous facial expressions, low-resolution facial images, the subjectivity of the annotator or subject of expression, and compound expression. FER has been steadily evolving with the advent of deep learning, but the label inconsistency problem due to the uncertainty in large FER datasets is one of the challenges in FER. Noisy labels in the label inconsistency problem adversely affect emotion recognition results. In this paper, we propose a Face-Specific Label Distribution Learning (FSLDL) method, which encourages deep networks to predict the actual emotion distribution of the input itself, rather than the noisy single label. Under the assumption that the emotion distribution of a specific sample is similar to the emotion distribution of the augmented sample from a different viewpoint, we generate a new target label distribution using facial expression-specific augmented samples. In order to generate a sophisticated target label distribution, the importance weights of the augmented samples are extracted and multiplied by each predicted emotion distribution. In addition, we compensate for the lack of information in the target label distribution by calculating the uncertainty of the provided label, and use it for model training. Finally, we make a more robust model by adding a rank regularization loss function for the importance weights and a discriminative loss function for the feature vectors. Representative experiments demonstrated that FSLDL outperforms the state-of-the-art on FER datasets such as RAF-DB, AffectNet, and SFEW. In addition, we demonstrated the effectiveness of the proposed method through various noisy label injection experiments.

Original languageEnglish
Article number104901
JournalImage and Vision Computing
Volume143
DOIs
Publication statusPublished - 2024 Mar

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

Keywords

  • 0000
  • 1111
  • Ambiguity
  • Emotion recognition
  • Facial expression recognition (FER)
  • Label distribution learning (LDL)
  • Noisy label
  • Uncertainty

ASJC Scopus subject areas

  • Signal Processing
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Noisy label facial expression recognition via face-specific label distribution learning'. Together they form a unique fingerprint.

Cite this