跳到主要导航 跳到搜索 跳到主要内容

Multi-scale 2D Representation Learning for weakly-supervised moment retrieval

  • CAS - Institute of Automation
  • University of Chinese Academy of Sciences
  • Horizon Robotics Inc.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Video moment retrieval aims to search the moment most relevant to a given language query. However, most existing methods in this community often require temporal boundary annotations which are expensive and time-consuming to label. Hence weakly supervised methods have been put forward recently by only using coarse video-level label. Despite effectiveness, these methods usually process moment candidates independently, while ignoring a critical issue that the natural temporal dependencies between candidates in different temporal scales. To cope with this issue, we propose a Multi-scale 2D Representation Learning method for weakly supervised video moment retrieval. Specifically, we first construct a two-dimensional map for each temporal scale to capture the temporal dependencies between candidates. Two dimensions in this map indicate the start and end time points of these candidates. Then, we select top-K candidates from each scale-varied map with a learnable convolutional neural network. With a newly designed Moments Evaluation Module, we obtain the alignment scores of the selected candidates. At last, the similarity between captions and language query is served as supervision for further training the candidates' selector. Experiments on two benchmark datasets Charades-STA and ActivityNet Captions demonstrate that our approach achieves superior performance to state-of-the-art results.

源语言英语
主期刊名Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition
出版商Institute of Electrical and Electronics Engineers Inc.
8616-8623
页数8
ISBN(电子版)9781728188089
DOI
出版状态已出版 - 2020
已对外发布
活动25th International Conference on Pattern Recognition, ICPR 2020 - Virtual, Online, 意大利
期限: 10 1月 202115 1月 2021

出版系列

姓名Proceedings - International Conference on Pattern Recognition
ISSN(印刷版)1051-4651

会议

会议25th International Conference on Pattern Recognition, ICPR 2020
国家/地区意大利
Virtual, Online
时期10/01/2115/01/21

指纹

探究 'Multi-scale 2D Representation Learning for weakly-supervised moment retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此