Content-based video relevance prediction with multi-view multi-level deep interest network

  • Zeyuan Chen*
  • , Kai Xu
  • , Wei Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

This paper presents our solution for the Hulu Content-Based Video Relevance Prediction (CBVRP) challenge, which focuses on cold-start videos as candidates. The keys to success of this prediction scenario are to learn effective user and video representations. To this end, we develop a multi-view multi-level deep interest network (MMDIN), which involves a multi-level deep interest network to learn user and video representations in a single-view, and a late fusion technique to integrate their multi-view representations corresponding to different types of video features. Through the above manner, the cold-start video prediction could be handled well with representations through their past interaction behaviors with videos and video representations based on their multiple types of content profiles.

Original languageEnglish
Title of host publicationMM 2019 - Proceedings of the 27th ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery, Inc
Pages2607-2611
Number of pages5
ISBN (Electronic)9781450368896
DOIs
StatePublished - 15 Oct 2019
Event27th ACM International Conference on Multimedia, MM 2019 - Nice, France
Duration: 21 Oct 201925 Oct 2019

Publication series

NameMM 2019 - Proceedings of the 27th ACM International Conference on Multimedia

Conference

Conference27th ACM International Conference on Multimedia, MM 2019
Country/TerritoryFrance
CityNice
Period21/10/1925/10/19

Keywords

  • Content-based video recommendation
  • Deep learning
  • Sequence modeling

Fingerprint

Dive into the research topics of 'Content-based video relevance prediction with multi-view multi-level deep interest network'. Together they form a unique fingerprint.

Cite this