跳到主要导航 跳到搜索 跳到主要内容

Dynamical textures modeling via joint video dictionary learning

  • Xian Wei
  • , Yuanxiang Li*
  • , Hao Shen
  • , Fang Chen
  • , Martin Kleinsteuber
  • , Zhongfeng Wang
  • *此作品的通讯作者
  • Shanghai Jiao Tong University
  • Technical University of Munich
  • Mercateo AG
  • Nanjing University

科研成果: 期刊稿件文章同行评审

摘要

Video representation is an important and challenging task in the computer vision community. In this paper, we consider the problem of modeling and classifying video sequences of dynamic scenes which could be modeled in a dynamic textures (DTs) framework. At first, we assume that image frames of a moving scene can be modeled as a Markov random process. We propose a sparse coding framework, named joint video dictionary learning (JVDL), to model a video adaptively. By treating the sparse coefficients of image frames over a learned dictionary as the underlying 'states', we learn an efficient and robust linear transition matrix between two adjacent frames of sparse events in time series. Hence, a dynamic scene sequence is represented by an appropriate transition matrix associated with a dictionary. In order to ensure the stability of JVDL, we impose several constraints on such transition matrix and dictionary. The developed framework is able to capture the dynamics of a moving scene by exploring both the sparse properties and the temporal correlations of consecutive video frames. Moreover, such learned JVDL parameters can be used for various DT applications, such as DT synthesis and recognition. Experimental results demonstrate the strong competitiveness of the proposed JVDL approach in comparison with the state-of-the-art video representation methods. Especially, it performs significantly better in dealing with DT synthesis and recognition on heavily corrupted data.

源语言英语
文章编号7893795
页(从-至)2929-2943
页数15
期刊IEEE Transactions on Image Processing
26
6
DOI
出版状态已出版 - 6月 2017
已对外发布

指纹

探究 'Dynamical textures modeling via joint video dictionary learning' 的科研主题。它们共同构成独一无二的指纹。

引用此