跳到主要导航 跳到搜索 跳到主要内容

Saliency prediction on omnidirectional images with brain-like shallow neural network

  • Dandan Zhu*
  • , Yongqing Chen
  • , Xiongkuo Min
  • , Defang Zhao
  • , Yucheng Zhu
  • , Qiangqiang Zhou
  • , Xiaokang Yang
  • , Tian Han
  • *此作品的通讯作者
  • Shanghai Jiao Tong University
  • Hainan Air Traffic Management Sub-Bureau
  • Tongji University
  • Shanghai Business School
  • Stevens Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep feedforward convolutional neural networks (CNNs) perform well in the saliency prediction of omnidirectional images (ODIs), and have become the leading class of candidate models of the visual processing mechanism in the primate ventral stream. These CNNs have evolved from shallow network architecture to extremely deep and branching architecture to achieve superb performance in various vision tasks, yet it is unclear how brain-like they are. In particular, these deep feedforward CNNs are difficult to mapping to ventral stream structure of the brain visual system due to their vast number of layers and missing biologically-important connections, such as recurrence. To tackle this issue, some brain-like shallow neural networks are introduced. In this paper, we propose a novel brain-like network model for saliency prediction of head fixations on ODIs. Specifically, our proposed model consists of three modules: a CORnet-S module, a template feature extraction module and a ranking attention module (RAM). The CORnetS module is a lightweight artificial neural network (ANN) with four anatomically mapped areas (V1, V2, V4 and IT) and it can simulate the visual processing mechanism of ventral visual stream in the human brain. The template features extraction module is introduced to extract attention maps of ODIs and provide guidance for the feature ranking in the following RAM module. The RAM module is used to rank and select features that are important for fine-grained saliency prediction. Extensive experiments have validated the effectiveness of the proposed model in predicting saliency maps of ODIs, and the proposed model outperforms other state-of-the-art methods with similar scale.

源语言英语
主期刊名Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition
出版商Institute of Electrical and Electronics Engineers Inc.
1665-1671
页数7
ISBN(电子版)9781728188089
DOI
出版状态已出版 - 2020
已对外发布
活动25th International Conference on Pattern Recognition, ICPR 2020 - Virtual, Online, 意大利
期限: 10 1月 202115 1月 2021

出版系列

姓名Proceedings - International Conference on Pattern Recognition
ISSN(印刷版)1051-4651

会议

会议25th International Conference on Pattern Recognition, ICPR 2020
国家/地区意大利
Virtual, Online
时期10/01/2115/01/21

指纹

探究 'Saliency prediction on omnidirectional images with brain-like shallow neural network' 的科研主题。它们共同构成独一无二的指纹。

引用此