跳到主要导航 跳到搜索 跳到主要内容

Multiscale Brain-Like Neural Network for Saliency Prediction on Omnidirectional Images

  • Dandan Zhu
  • , Yongqing Chen
  • , Defang Zhao
  • , Yucheng Zhu
  • , Qiangqiang Zhou*
  • , Guangtao Zhai
  • , Xiaokang Yang
  • *此作品的通讯作者
  • Shanghai Jiao Tong University
  • Hainan University
  • Tongji University
  • Jiangxi Normal University

科研成果: 期刊稿件文章同行评审

摘要

Current top-performing saliency prediction methods of omnidirectional images (ODIs) depend on deep feedforward convolutional neural networks (CNNs), benefiting from their powerful multiscale representation ability. Although these methods adopt deep feedforward CNNs to achieve superb performance in saliency prediction task, they have the following limitations: 1) these deep feedforward CNNs are difficult to map to ventral stream structure of the brain visual system due to their vast number of layers and missing biologically important connections, such as recurrence and 2) most deep feedforward CNNs represent the multiscale features in a layerwise manner. To tackle these issues, models that could learn multiscale features yet share the similarities with human brain are needed. In this article, we propose a novel multiscale brain-like network (MBN) model to predict saliency of head fixations on ODIs. Specifically, our proposed model consists of two major modules: 1) a brain-like CORnet-S module and 2) a multiscale feature extraction module. The CORnet-S module is a lightweight backbone network with four anatomically mapped areas (V1, V2, V4, and IT) and it can simulate the visual processing mechanism of ventral visual stream in the human brain. The multiscale feature extraction module is inspired by the multiscale brain structure, which represents multiscale features at a granular level and increases the range of receptive fields for each network layer. Extensive experiments and ablation studies conducted on two major benchmarks demonstrate the superiority of the proposed MBN model over the state-of-the-art methods.

源语言英语
页(从-至)507-518
页数12
期刊IEEE Transactions on Cognitive and Developmental Systems
14
2
DOI
出版状态已出版 - 1 6月 2022
已对外发布

指纹

探究 'Multiscale Brain-Like Neural Network for Saliency Prediction on Omnidirectional Images' 的科研主题。它们共同构成独一无二的指纹。

引用此