跳到主要导航 跳到搜索 跳到主要内容

Learning Dictionary for Visual Attention

  • East China Normal University
  • Chinese Academy of Sciences

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recently, the attention mechanism has shown outstanding competence in capturing global structure information and long-range relationships within data, thus enhancing the performance of deep vision models on various computer vision tasks. In this work, we propose a novel alternative dictionary learning-based attention (Dic-Attn) module, which models this issue as a decomposition and reconstruction problem with the sparsity prior, inspired by sparse coding in the human visual perception system. The proposed Dic-Attn module decomposes the input into a dictionary and corresponding sparse representations, allowing for the disentanglement of underlying nonlinear structural information in visual data and the reconstruction of an attention embedding. By applying transformation operations in the spatial and channel domains, the module dynamically selects the dictionary’s atoms and sparse representations. Finally, the updated dictionary and sparse representations capture the global contextual information and reconstruct the attention maps. The proposed Dic-Attn module is designed with plug-and-play compatibility and facilitates integration into deep attention encoders. Our approach offers an intuitive and elegant means to exploit the discriminative information from data, promoting visual attention construction. Extensive experimental results on various computer vision tasks, e.g., image and point cloud classification, validate that our method achieves promising performance, and shows a strong competitive comparison with state-of-the-art attention methods.

源语言英语
主期刊名Advances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
编辑A. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
出版商Neural information processing systems foundation
ISBN(电子版)9781713899921
出版状态已出版 - 2023
活动37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, 美国
期限: 10 12月 202316 12月 2023

出版系列

姓名Advances in Neural Information Processing Systems
36
ISSN(印刷版)1049-5258

会议

会议37th Conference on Neural Information Processing Systems, NeurIPS 2023
国家/地区美国
New Orleans
时期10/12/2316/12/23

指纹

探究 'Learning Dictionary for Visual Attention' 的科研主题。它们共同构成独一无二的指纹。

引用此