Feature Enhancement Module Based on Class-Centric Loss for Fine-Grained Visual Classification

  • Daohui Wang
  • , He Xinyu
  • , Shujing Lyu
  • , Wei Tian
  • , Yue Lu*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

We propose a novel feature enhancement module designed for fine-grained visual classification tasks, which can be seamlessly integrated into various backbone architectures, including both convolutional neural network (CNN)-based and Transformer-based networks. The plug-and-play module outputs pixel-level feature maps and performs a weighted fusion of filtered features to enhance fine-grained feature representation. We introduce a class-centric loss function that optimizes the alignment of samples with their target class centers by pulling them toward the center of the target class while simultaneously pushing them away from the center of the most visually similar nontarget classes. Soft labels are employed to mitigate overfitting, ensuring the model generalizes well to unseen examples. Our approach consistently delivers significant improvements in accuracy across various mainstream backbone architectures, underscoring its versatility and robustness. Furthermore, we achieved the highest accuracy on the NABirds (NAB) and our proprietary lock cylinder datasets.

Keywords

  • Class center
  • Transformer
  • convolutional neural network (CNN)
  • fine-grained visual classification (FGVC)
  • soft label

Fingerprint

Dive into the research topics of 'Feature Enhancement Module Based on Class-Centric Loss for Fine-Grained Visual Classification'. Together they form a unique fingerprint.

Cite this