跳到主要导航 跳到搜索 跳到主要内容

Prototype vector machine for large scale semi-supervised learning

  • Kai Zhang*
  • , James T. Kwok
  • , Bahram Parvin
  • *此作品的通讯作者
  • Lawrence Berkeley National Laboratory
  • Hong Kong University of Science and Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Practical data mining rarely falls exactly into the supervised learning scenario. Rather, the growing amount of unlabeled data poses a big challenge to large-scale semi-supervised learning (SSL). We note that the computational intensiveness of graph-based SSL arises largely from the manifold or graph regularization, which in turn lead to large models that are difficult to handle. To alleviate this, we proposed the prototype vector machine (PVM), a highly scalable, graph-based algorithm for large-scale SSL. Our key innovation is the use of "prototypes vectors" for efficient approximation on both the graph-based regularizer and model representation. The choice of prototypes are grounded upon two important criteria: they not only perform effective low-rank approximation of the kernel matrix, but also span a model suffering the minimum information loss compared with the complete model. We demonstrate encouraging performance and appealing scaling properties of the PVM on a number of machine learning benchmark data sets.

源语言英语
主期刊名Proceedings of the 26th Annual International Conference on Machine Learning, ICML'09
DOI
出版状态已出版 - 2009
已对外发布
活动26th Annual International Conference on Machine Learning, ICML'09 - Montreal, QC, 加拿大
期限: 14 6月 200918 6月 2009

出版系列

姓名ACM International Conference Proceeding Series
382

会议

会议26th Annual International Conference on Machine Learning, ICML'09
国家/地区加拿大
Montreal, QC
时期14/06/0918/06/09

指纹

探究 'Prototype vector machine for large scale semi-supervised learning' 的科研主题。它们共同构成独一无二的指纹。

引用此