跳到主要导航 跳到搜索 跳到主要内容

Domain Generalization via Discrete Codebook Learning

  • Shaocong Long
  • , Qianyu Zhou
  • , Xikun Jiang
  • , Chenhao Ying*
  • , Lizhuang Ma
  • , Yuan Luo*
  • *此作品的通讯作者
  • Shanghai Jiao Tong University
  • Jilin University
  • University of Copenhagen

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Domain generalization (DG) strives to address distribution shifts across diverse environments to enhance model's generalizability. Current DG approaches are confined to acquiring robust representations with continuous features, specifically training at the pixel level. However, this DG paradigm may struggle to mitigate distribution gaps in dealing with a large space of continuous features, rendering it susceptible to pixel details that exhibit spurious correlations or noise. In this paper, we first theoretically demonstrate that the domain gaps in continuous representation learning can be reduced by the discretization process. Based on this inspiring finding, we introduce a novel learning paradigm for DG, termed Discrete Domain Generalization (DDG). DDG proposes to use a codebook to quantize the feature map into discrete codewords, aligning semantic-equivalent information in a shared discrete representation space that prioritizes semantic-level information over pixel-level intricacies. By learning at the semantic level, DDG diminishes the number of latent features, optimizing the utilization of the representation space and alleviating the risks associated with the wide-ranging space of continuous features. Extensive experiments across widely employed benchmarks in DG demonstrate DDG's superior performance compared to state-of-the-art approaches, underscoring its potential to reduce the distribution gaps and enhance the model's generalizability.

源语言英语
主期刊名2025 IEEE International Conference on Multimedia and Expo
主期刊副标题Journey to the Center of Machine Imagination, ICME 2025 - Conference Proceedings
出版商IEEE Computer Society
ISBN(电子版)9798331594954
DOI
出版状态已出版 - 2025
已对外发布
活动2025 IEEE International Conference on Multimedia and Expo, ICME 2025 - Nantes, 法国
期限: 30 6月 20254 7月 2025

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2025 IEEE International Conference on Multimedia and Expo, ICME 2025
国家/地区法国
Nantes
时期30/06/254/07/25

指纹

探究 'Domain Generalization via Discrete Codebook Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此