跳到主要导航 跳到搜索 跳到主要内容

CFRL: Coarse-Fine Decoupled Representation Learning For Long-Tailed Recognition

  • Yiran Song
  • , Qianyu Zhou
  • , Kun Hu
  • , Lizhuang Ma
  • , Xuequan Lu
  • Shanghai Jiao Tong University
  • University of Sydney
  • La Trobe University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Data often faces a severe class imbalance issue in the real world, meaning that the number of instances within classes varies greatly, following a long-tailed distribution. In this case, the direct application of supervised learning yields poor performance. Existing long-tailed recognition (LTR) methods often heavily rely on the label information to enhance tail classes' accuracy at the expense of head class by an image-level end-to-end resampling strategy to address data distribution imbalance. Nevertheless, they neglect label bias, which can severely affect the LTR model's accuracy. In this paper, we propose a novel approach, namely Coarse-Fine Decoupled Representation Learning (CFRL) for LTR. Our core idea is to decouple data representations from the classifier and decompose representation learning into two stages: image-level and patch-level. Specifically, in the image-level stage, we leverage unsupervised learning on image-level information to reduce the impact of label bias caused by imbalanced datasets. In the patch-level stage, we introduce patch-level rotation augmentation as negative samples, forcing the model to acquire more comprehensive information. Our theoretical and empirical analyses demonstrate that the approach does not sacrifice the accuracy of head classes while significantly reducing the overfitting of tail classes, improving both of them. We showcase state-of-the-art results on CIFAR, ImageNet, and iNaturalist datasets. Furthermore, we illustrate that this training methodology can be combined with various existing Long-Tailed Recognition (LTR) methods, further enhancing their performance.

源语言英语
主期刊名Proceedings of the 6th ACM International Conference on Multimedia in Asia, MMAsia 2024
出版商Association for Computing Machinery, Inc
ISBN(电子版)9798400712739
DOI
出版状态已出版 - 28 12月 2024
已对外发布
活动6th ACM International Conference on Multimedia in Asia, MMAsia 2024 - Auckland, 新西兰
期限: 3 12月 20246 12月 2024

出版系列

姓名Proceedings of the 6th ACM International Conference on Multimedia in Asia, MMAsia 2024

会议

会议6th ACM International Conference on Multimedia in Asia, MMAsia 2024
国家/地区新西兰
Auckland
时期3/12/246/12/24

指纹

探究 'CFRL: Coarse-Fine Decoupled Representation Learning For Long-Tailed Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此