跳到主要导航 跳到搜索 跳到主要内容

Human Attention Based Movie Summarization: Dataset and Baseline Model

  • Defang Zhao
  • , Dandan Zhu*
  • , Xiongkuo Min
  • , Jiaomin Yue
  • , Kaiwei Zhang
  • , Qiangqiang Zhou
  • , Guangtao Zhai
  • , Xiaokang Yang
  • *此作品的通讯作者
  • CloudWalk Technology
  • Donghua University
  • Shanghai Jiao Tong University
  • Jiangxi Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The movie summarization model can automatically edit a condensed and succinct version of the movie by selecting the keyframes. Previous works mainly resort to hand-crafted heuristics and most of them are unsupervised. Supervised movie summarization is a new research field and, there is currently no publicly suitable dataset available. Moreover, existing works only focus on the movies themselves while neglecting the audiences, who have the most say in which part of the movie is more attractive. To deal with the aforementioned limitations, we establish a human attention based movie summarization dataset Movie50. Specifically, we explore the human attention variations when watching videos and have the following findings: (1) The attention of humans is concentrated when watching keyframes. (2) The attention of humans is distracted when watching non-keyframes. Inspired by these findings, we collect the eye fixations of 20 participants when watching 50 movies and propose a novel human attention based annotation pipeline. In addition, we introduce A/V-MSNet, an audiovisual neural network that takes advantage of spatio-temporal visual and auditory information to better model human attention as well as exploit more plentiful information. Extensive experiments demonstrate the superiority of the proposed method.

源语言英语
主期刊名ICME 2022 - IEEE International Conference on Multimedia and Expo 2022, Proceedings
出版商IEEE Computer Society
ISBN(电子版)9781665485630
DOI
出版状态已出版 - 2022
已对外发布
活动2022 IEEE International Conference on Multimedia and Expo, ICME 2022 - Taipei, 中国台湾
期限: 18 7月 202222 7月 2022

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
2022-July
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2022 IEEE International Conference on Multimedia and Expo, ICME 2022
国家/地区中国台湾
Taipei
时期18/07/2222/07/22

指纹

探究 'Human Attention Based Movie Summarization: Dataset and Baseline Model' 的科研主题。它们共同构成独一无二的指纹。

引用此