TY - GEN
T1 - A Divide-and-Conquer Approach for Large-Scale Multi-label Learning
AU - Zhang, Wenjie
AU - Wang, Xiangfeng
AU - Yan, Junchi
AU - Zha, Hongyuan
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/6/30
Y1 - 2017/6/30
N2 - Recently, the multi-label learning has drawn considerable attention as it has many applications in text classification, image annotation and query/keyword suggestions etc. In recent years, a number of remedies have been proposed to address this challenging task. However, they are either tree based methods which has the expensive train costs or embedding based methods which has relatively lower accuracy since using simple reduction techniques. This paper addresses the issue by developing an efficient divide-and-conquer based approach. Specifically, it involves: a) utilizing the feature vector to cluster the training data into several clusters, b) reformulating the multi-label problems as recommended problems by treating each label as an item to be recommended, and c) learning an advanced factorization model to recommend the subset of labels to each point for local cluster. Extensive experiments on several real world multi-label datasets demonstrate the efficiency of our proposed algorithm.
AB - Recently, the multi-label learning has drawn considerable attention as it has many applications in text classification, image annotation and query/keyword suggestions etc. In recent years, a number of remedies have been proposed to address this challenging task. However, they are either tree based methods which has the expensive train costs or embedding based methods which has relatively lower accuracy since using simple reduction techniques. This paper addresses the issue by developing an efficient divide-and-conquer based approach. Specifically, it involves: a) utilizing the feature vector to cluster the training data into several clusters, b) reformulating the multi-label problems as recommended problems by treating each label as an item to be recommended, and c) learning an advanced factorization model to recommend the subset of labels to each point for local cluster. Extensive experiments on several real world multi-label datasets demonstrate the efficiency of our proposed algorithm.
KW - factorization machines
KW - multi-label learning
UR - https://www.scopus.com/pages/publications/85027696532
U2 - 10.1109/BigMM.2017.35
DO - 10.1109/BigMM.2017.35
M3 - 会议稿件
AN - SCOPUS:85027696532
T3 - Proceedings - 2017 IEEE 3rd International Conference on Multimedia Big Data, BigMM 2017
SP - 398
EP - 401
BT - Proceedings - 2017 IEEE 3rd International Conference on Multimedia Big Data, BigMM 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE International Conference on Multimedia Big Data, BigMM 2017
Y2 - 19 April 2017 through 21 April 2017
ER -