跳到主要导航 跳到搜索 跳到主要内容

Dual focus attention network for video emotion recognition

  • Haonan Qiu
  • , Liang He
  • , Feng Wang*
  • *此作品的通讯作者
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Video emotion recognition is a challenging task due to complex scenes and various forms of emotion expression. Most existing works focus on fusing multiple features over the whole video clips. According to our observations, given a long video clip, the emotion is usually presented by only several actions/objects in a few short snippets, and the meaningful cues are buried in the noisy background. When human judging the emotion in videos, we first find the informative clips and then closely look for emotional cues in the frames. In this paper, we propose Dual Focus Attention Network to mimic this process. First, three kinds of features including action, object, and scene are extracted from videos. Second, Two attention modules are used to focus on the visual features of the videos from temporal and spatial dimensions respectively. With our dual focus attention network, we can effectively discover the most emotional frames along the time dimension and the most emotional visual cues in each frame. Our experiments conducted on two widely used datasets Ekman and VideoEmotion show that our proposed approach outperforms the existing approaches.

源语言英语
主期刊名2020 IEEE International Conference on Multimedia and Expo, ICME 2020
出版商IEEE Computer Society
ISBN(电子版)9781728113319
DOI
出版状态已出版 - 7月 2020
活动2020 IEEE International Conference on Multimedia and Expo, ICME 2020 - London, 英国
期限: 6 7月 202010 7月 2020

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
2020-July
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2020 IEEE International Conference on Multimedia and Expo, ICME 2020
国家/地区英国
London
时期6/07/2010/07/20

指纹

探究 'Dual focus attention network for video emotion recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此