TY - JOUR
T1 - A General M-estimation Theory in Semi-Supervised Framework
AU - Song, Shanshan
AU - Lin, Yuanyuan
AU - Zhou, Yong
N1 - Publisher Copyright:
© 2023 American Statistical Association.
PY - 2024
Y1 - 2024
N2 - We study a class of general M-estimators in the semi-supervised setting, wherein the data are typically a combination of a relatively small labeled dataset and large amounts of unlabeled data. A new estimator, which efficiently uses the useful information contained in the unlabeled data, is proposed via a projection technique. We prove consistency and asymptotic normality, and provide an inference procedure based on (Formula presented.) -fold cross-validation. The optimal weights are derived to balance the contributions of the labeled and unlabeled data. It is shown that the proposed method, by taking advantage of the unlabeled data, produces asymptotically more efficient estimation of the target parameters than the supervised counterpart. Supportive numerical evidence is shown in simulation studies. Applications are illustrated in analysis of the homeless data in Los Angeles. Supplementary materials for this article are available online.
AB - We study a class of general M-estimators in the semi-supervised setting, wherein the data are typically a combination of a relatively small labeled dataset and large amounts of unlabeled data. A new estimator, which efficiently uses the useful information contained in the unlabeled data, is proposed via a projection technique. We prove consistency and asymptotic normality, and provide an inference procedure based on (Formula presented.) -fold cross-validation. The optimal weights are derived to balance the contributions of the labeled and unlabeled data. It is shown that the proposed method, by taking advantage of the unlabeled data, produces asymptotically more efficient estimation of the target parameters than the supervised counterpart. Supportive numerical evidence is shown in simulation studies. Applications are illustrated in analysis of the homeless data in Los Angeles. Supplementary materials for this article are available online.
KW - Projection method
KW - Semi-supervised inference
KW - Weighted loss function
UR - https://www.scopus.com/pages/publications/85149380793
U2 - 10.1080/01621459.2023.2169699
DO - 10.1080/01621459.2023.2169699
M3 - 文章
AN - SCOPUS:85149380793
SN - 0162-1459
VL - 119
SP - 1065
EP - 1075
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 546
ER -