TY - GEN
T1 - Scaling up link prediction with ensembles
AU - Duan, Liang
AU - Aggarwal, Charu
AU - Ma, Shuai
AU - Hu, Renjun
AU - Huai, Jinpeng
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/2/8
Y1 - 2016/2/8
N2 - A network with n nodes contains O(n2) possible links. Even for networks of modest size, it is often difficult to evaluate all pair-wise possibilities for links in a meaningful way. Furthermore, even though link prediction is closely related to missing value estimation problems, such as collaborative filtering, it is often difficult to use sophisticated models such as latent factor methods because of their computational complexity over very large networks. Due to this computational complexity, most known link prediction methods are designed for evaluating the link propensity over a specified subset of links, rather than for performing a global search over the entire networks. In practice, however, it is essential to perform an exhaustive search over the entire networks. In this paper, we propose an ensemble enabled approach to scaling up link prediction, which is able to decompose traditional link prediction problems into sub-problems of smaller size. These subproblems are each solved with the use of latent factor models, which can be effectively implemented over networks of modest size. Furthermore, the ensemble enabled approach has several advantages in terms of performance. We show the advantage of using ensemble-based latent factor models with experiments on very large networks. Experimental results demonstrate the effectiveness and scalability of our approach.
AB - A network with n nodes contains O(n2) possible links. Even for networks of modest size, it is often difficult to evaluate all pair-wise possibilities for links in a meaningful way. Furthermore, even though link prediction is closely related to missing value estimation problems, such as collaborative filtering, it is often difficult to use sophisticated models such as latent factor methods because of their computational complexity over very large networks. Due to this computational complexity, most known link prediction methods are designed for evaluating the link propensity over a specified subset of links, rather than for performing a global search over the entire networks. In practice, however, it is essential to perform an exhaustive search over the entire networks. In this paper, we propose an ensemble enabled approach to scaling up link prediction, which is able to decompose traditional link prediction problems into sub-problems of smaller size. These subproblems are each solved with the use of latent factor models, which can be effectively implemented over networks of modest size. Furthermore, the ensemble enabled approach has several advantages in terms of performance. We show the advantage of using ensemble-based latent factor models with experiments on very large networks. Experimental results demonstrate the effectiveness and scalability of our approach.
KW - Big data
KW - Ensembles
KW - Link prediction
KW - Networks
UR - https://www.scopus.com/pages/publications/84964328522
U2 - 10.1145/2835776.2835815
DO - 10.1145/2835776.2835815
M3 - 会议稿件
AN - SCOPUS:84964328522
T3 - WSDM 2016 - Proceedings of the 9th ACM International Conference on Web Search and Data Mining
SP - 367
EP - 376
BT - WSDM 2016 - Proceedings of the 9th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
T2 - 9th ACM International Conference on Web Search and Data Mining, WSDM 2016
Y2 - 22 February 2016 through 25 February 2016
ER -