Task-Driven Progressive Part Localization for Fine-Grained Object Recognition

  • Chen Huang
  • , Zhihai He
  • , Guitao Cao*
  • , Wenming Cao
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

35 Scopus citations

Abstract

The problem of fine-grained object recognition is very challenging due to the subtle visual differences between different object categories. In this paper, we propose a task-driven progressive part localization (TPPL) approach for fine-grained object recognition. Most existing methods follow a two-step approach that first detects salient object parts to suppress the interference from background scenes and then classifies objects based on features extracted from these regions. The part detector and object classifier are often independently designed and trained. In this paper, our major finding is that the part detector should be jointly designed and progressively refined with the object classifier so that the detected regions can provide the most distinctive features for final object recognition. Specifically, we develop a part-based SPP-net (Part-SPP) as our baseline part detector. We then establish a TPPL framework, which takes the predicted boxes of Part-SPP as an initial guess, and then examines new regions in the neighborhood using a particle swarm optimization approach, searching for more discriminative image regions to maximize the objective function and the recognition performance. This procedure is performed in an iterative manner to progressively improve the joint part detection and object classification performance. Experimental results on the Caltech-UCSD-200-2011 dataset demonstrate that our method outperforms state-of-the-art fine-grained categorization methods both in part localization and classification, even without requiring a bounding box during testing.

Original languageEnglish
Article number7548295
Pages (from-to)2372-2383
Number of pages12
JournalIEEE Transactions on Multimedia
Volume18
Issue number12
DOIs
StatePublished - Dec 2016

Keywords

  • Deep learning
  • Spatial pyramid pooling
  • deformable part-based model
  • fine-grained recognition
  • regional convolutional neural network

Fingerprint

Dive into the research topics of 'Task-Driven Progressive Part Localization for Fine-Grained Object Recognition'. Together they form a unique fingerprint.

Cite this