TY - GEN
T1 - Lasso screening for object categories recognition using multi-directional context features
AU - Shen, Danfei
AU - Cao, Guitao
AU - Meng, Dan
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/1/13
Y1 - 2016/1/13
N2 - Image representation using local features and sparse coding (SC) plays a very important role in image classification when the dataset is fairly large. Despite of its worldwide popularity, there are still some improving space in classification efficiency and computational investment in training and coding phrase of SC. In this paper, we put forward a novel object categories recognition method from two aspects. First, the contextual relevance between image patches are fully utilized by merging local feature of every sub-patch with its neighboring ones into strong context features to generate the multiple sparse representations, which are received by the SC and multi-scale max pooling SPM(Spatial Pyramid Matching), respectively. Second, while calculating the sparse coefficients of SC, we need to solve L1-regularized least square problem. Screening out the zero coefficients and discarding the corresponding inactive codewords before solving Lasso problem can remarkably speed up the optimization. The proposed method outperforms state-of-the-art performancein a large number of image categorization experiments on several benchmarks: the ground truth dataset (21 Land-Use database), the event dataset (UIUC-Sport dataset), and the object recognition dataset (Caltech101 dataset).
AB - Image representation using local features and sparse coding (SC) plays a very important role in image classification when the dataset is fairly large. Despite of its worldwide popularity, there are still some improving space in classification efficiency and computational investment in training and coding phrase of SC. In this paper, we put forward a novel object categories recognition method from two aspects. First, the contextual relevance between image patches are fully utilized by merging local feature of every sub-patch with its neighboring ones into strong context features to generate the multiple sparse representations, which are received by the SC and multi-scale max pooling SPM(Spatial Pyramid Matching), respectively. Second, while calculating the sparse coefficients of SC, we need to solve L1-regularized least square problem. Screening out the zero coefficients and discarding the corresponding inactive codewords before solving Lasso problem can remarkably speed up the optimization. The proposed method outperforms state-of-the-art performancein a large number of image categorization experiments on several benchmarks: the ground truth dataset (21 Land-Use database), the event dataset (UIUC-Sport dataset), and the object recognition dataset (Caltech101 dataset).
KW - Context Features
KW - Lasso Problem
KW - Object Categories Recognition
KW - Sparse Representation
UR - https://www.scopus.com/pages/publications/84966687290
U2 - 10.1109/ISKE.2015.74
DO - 10.1109/ISKE.2015.74
M3 - 会议稿件
AN - SCOPUS:84966687290
T3 - Proceedings - The 2015 10th International Conference on Intelligent Systems and Knowledge Engineering, ISKE 2015
SP - 434
EP - 441
BT - Proceedings - The 2015 10th International Conference on Intelligent Systems and Knowledge Engineering, ISKE 2015
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th International Conference on Intelligent Systems and Knowledge Engineering, ISKE 2015
Y2 - 24 November 2015 through 27 November 2015
ER -