Multiscale Brain-Like Neural Network for Saliency Prediction on Omnidirectional Images

  • Dandan Zhu
  • , Yongqing Chen
  • , Defang Zhao
  • , Yucheng Zhu
  • , Qiangqiang Zhou*
  • , Guangtao Zhai
  • , Xiaokang Yang
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

Current top-performing saliency prediction methods of omnidirectional images (ODIs) depend on deep feedforward convolutional neural networks (CNNs), benefiting from their powerful multiscale representation ability. Although these methods adopt deep feedforward CNNs to achieve superb performance in saliency prediction task, they have the following limitations: 1) these deep feedforward CNNs are difficult to map to ventral stream structure of the brain visual system due to their vast number of layers and missing biologically important connections, such as recurrence and 2) most deep feedforward CNNs represent the multiscale features in a layerwise manner. To tackle these issues, models that could learn multiscale features yet share the similarities with human brain are needed. In this article, we propose a novel multiscale brain-like network (MBN) model to predict saliency of head fixations on ODIs. Specifically, our proposed model consists of two major modules: 1) a brain-like CORnet-S module and 2) a multiscale feature extraction module. The CORnet-S module is a lightweight backbone network with four anatomically mapped areas (V1, V2, V4, and IT) and it can simulate the visual processing mechanism of ventral visual stream in the human brain. The multiscale feature extraction module is inspired by the multiscale brain structure, which represents multiscale features at a granular level and increases the range of receptive fields for each network layer. Extensive experiments and ablation studies conducted on two major benchmarks demonstrate the superiority of the proposed MBN model over the state-of-the-art methods.

Original languageEnglish
Pages (from-to)507-518
Number of pages12
JournalIEEE Transactions on Cognitive and Developmental Systems
Volume14
Issue number2
DOIs
StatePublished - 1 Jun 2022
Externally publishedYes

Keywords

  • Brain-like neural network
  • lightweight model
  • multiscale features
  • omnidirectional image
  • saliency prediction

Fingerprint

Dive into the research topics of 'Multiscale Brain-Like Neural Network for Saliency Prediction on Omnidirectional Images'. Together they form a unique fingerprint.

Cite this