跳到主要导航 跳到搜索 跳到主要内容

Distributed Estimation of Principal Support Vector Machines for Sufficient Dimension Reduction

  • Jun Jin
  • , Chao Ying
  • , Zhou Yu*
  • *此作品的通讯作者
  • Henry Ford Health System
  • University of Wisconsin-Madison

科研成果: 期刊稿件文章同行评审

摘要

The principal support vector machines method is a powerful tool for sufficient dimension reduction that replaces original predictors with their low-dimensional linear combinations while preserving the information for regression and classification. However, the computational burden of the principal support vector machines method constrains its use for massive data. To address this issue, we propose a naive and a refined distributed estimation algorithms for fast implementation when the sample size is large. Both distributed sufficient dimension reduction estimators exhibit the same statistical efficiency as when all the data is merged together, which provides rigorous statistical guarantees for their application to large-scale datasets, while the refined method requires smaller batch sample sizes and hence is more advantageous when memory limitations exist on distributed machines. The two distributed algorithms are further adapted to principal weighted support vector machines for sufficient dimension reduction in binary classification. The statistical accuracy and computational complexity of our proposed methods are examined through comprehensive simulation studies and in a real data application with more than 600,000 samples.

源语言英语
页(从-至)254-266
页数13
期刊Technometrics
67
2
DOI
出版状态已出版 - 2025

指纹

探究 'Distributed Estimation of Principal Support Vector Machines for Sufficient Dimension Reduction' 的科研主题。它们共同构成独一无二的指纹。

引用此