GSMC: A Global-Local Scalable Multi-task Contrastive Learning Framework

  • Yongqi Huang
  • , Feng Liu*
  • , Aimin Zhou*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Due to the ambiguity of facial expressions, the subjectivity of annotators, and the low quality of images, noisy labels pose a significant challenge in the field of facial expression recognition. We address the noise label problem based on the principle of contrastive learning. Through CAM visualization, we found that when noisy labels are present, the model can make correct predictions due to its inability to memorize noisy labels from different data augmentation perspectives. Inspired by this observation, we propose the GSMC method, which enforces model consistency between the predictions of the original image and its augmented versions. Specifically, we divide the consistency requirement into two different tasks. The first task uses attention map consistency, emphasizing preventing the model from memorizing noise labels. The second task employs self-distillation learning, where the consistency is achieved through the predictions of student and self-distillation networks, enabling the model to learn more robust label distributions. Our framework, based on ResNet18, achieves a 90.54% accuracy on the RAF-DB dataset, achieving State-Of-The-Art performance. The code will be available at https://github.com/ECNU-Cross-Innovation-Lab/GSMC.

Original languageEnglish
Title of host publicationAdvances in Computer Graphics - 41st Computer Graphics International Conference, CGI 2024, Proceedings
EditorsNadia Magnenat-Thalmann, Jinman Kim, Bin Sheng, Zhigang Deng, Daniel Thalmann, Ping Li
PublisherSpringer Science and Business Media Deutschland GmbH
Pages297-308
Number of pages12
ISBN (Print)9783031818059
DOIs
StatePublished - 2025
Event41st Computer Graphics International Conference, CGI 2024 - Geneva, Switzerland
Duration: 1 Jul 20245 Jul 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15338 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference41st Computer Graphics International Conference, CGI 2024
Country/TerritorySwitzerland
CityGeneva
Period1/07/245/07/24

Keywords

  • attention consistency
  • computational perception
  • contrastive learning
  • emotion recognition
  • noisy labels

Fingerprint

Dive into the research topics of 'GSMC: A Global-Local Scalable Multi-task Contrastive Learning Framework'. Together they form a unique fingerprint.

Cite this