Animal Detection from Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification

Zhi Zhang, Zhihai He, Guitao Cao, Wenming Cao

Research output: Contribution to journalArticlepeer-review

111 Scopus citations

Abstract

In this paper, we consider the animal object detection and segmentation from wildlife monitoring videos captured by motion-triggered cameras, called camera-traps. For these types of videos, existing approaches often suffer from low detection rates due to low contrast between the foreground animals and the cluttered background, as well as high false positive rates due to the dynamic background. To address this issue, we first develop a new approach to generate animal object region proposals using multilevel graph cut in the spatiotemporal domain. We then develop a cross-frame temporal patch verification method to determine if these region proposals are true animals or background patches. We construct an efficient feature description for animal detection using joint deep learning and histogram of oriented gradient features encoded with Fisher vectors. Our extensive experimental results and performance comparisons over a diverse set of challenging camera-trap data demonstrate that the proposed spatiotemporal object proposal and patch verification framework outperforms the state-of-the-art methods, including the recent Faster-RCNN method, on animal object detection accuracy by up to 4.5%.

Original languageEnglish
Article number7523423
Pages (from-to)2079-2092
Number of pages14
JournalIEEE Transactions on Multimedia
Volume18
Issue number10
DOIs
StatePublished - Oct 2016

Keywords

  • Background modeling
  • Object verification
  • camera-trap images
  • graph cut
  • object proposal

Fingerprint

Dive into the research topics of 'Animal Detection from Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification'. Together they form a unique fingerprint.

Cite this