TY - GEN
T1 - Lasso screening for object categories recognition using multi-directional context features
AU - Shen, Danfei
AU - Cao, Guitao
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/6/10
Y1 - 2016/6/10
N2 - Image representation using local features and sparse coding (SC) plays a very important role in image classification when the dataset is fairly large. Despite of its worldwide popularity, there are still some improving space in classification efficiency and computational investment in training and coding phrase of SC. In this paper, we put forward a novel object categories recognition method from two aspects. First, the contextual relevance between image patches are fully utilized by merging local feature of every sub-patch with its neighboring ones into strong context features to generate the multiple sparse representations, which are received by the SC and multi-scale max pooling SPM(Spatial Pyramid Matching), respectively. Second, while calculating the sparse coefficients of SC, we need to solve L1-regularized least square problem. Screening out the zero coefficients and discarding the corresponding inactive codewords before solving Lasso problem can remarkably speed up the optimization. The proposed method outperforms state-of-the-art performance in a large number of image categorization experiments on several benchmarks: the ground truth dataset (21 Land-Use database), the event dataset (UIUC-Sport dataset), and the object recognition dataset (Caltech101 dataset).
AB - Image representation using local features and sparse coding (SC) plays a very important role in image classification when the dataset is fairly large. Despite of its worldwide popularity, there are still some improving space in classification efficiency and computational investment in training and coding phrase of SC. In this paper, we put forward a novel object categories recognition method from two aspects. First, the contextual relevance between image patches are fully utilized by merging local feature of every sub-patch with its neighboring ones into strong context features to generate the multiple sparse representations, which are received by the SC and multi-scale max pooling SPM(Spatial Pyramid Matching), respectively. Second, while calculating the sparse coefficients of SC, we need to solve L1-regularized least square problem. Screening out the zero coefficients and discarding the corresponding inactive codewords before solving Lasso problem can remarkably speed up the optimization. The proposed method outperforms state-of-the-art performance in a large number of image categorization experiments on several benchmarks: the ground truth dataset (21 Land-Use database), the event dataset (UIUC-Sport dataset), and the object recognition dataset (Caltech101 dataset).
KW - Context Features
KW - Lasso Problem
KW - Object Categories Recognition
KW - Sparse Representation
UR - https://www.scopus.com/pages/publications/84979683479
U2 - 10.1109/PIC.2015.7489832
DO - 10.1109/PIC.2015.7489832
M3 - 会议稿件
AN - SCOPUS:84979683479
T3 - Proceedings of 2015 IEEE International Conference on Progress in Informatics and Computing, PIC 2015
SP - 173
EP - 180
BT - Proceedings of 2015 IEEE International Conference on Progress in Informatics and Computing, PIC 2015
A2 - Xiao, Liang
A2 - Wang, Yinglin
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE International Conference on Progress in Informatics and Computing, PIC 2015
Y2 - 18 December 2015 through 20 December 2015
ER -