Semi-supervised Inference for Block-wise Missing Data without Imputation

  • Shanshan Song
  • , Yuanyuan Lin*
  • , Yong Zhou*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

We consider statistical inference for single or low-dimensional parameters in a high-dimensional linear model under a semi-supervised setting, wherein the data are a combination of a labelled block-wise missing data set of a relatively small size and a large unlabelled data set. The proposed method utilises both labelled and unlabelled data without any imputation or removal of the missing observations. The asymptotic properties of the estimator are established under regularity conditions. Hypothesis testing for low-dimensional coefficients are also studied. Extensive simulations are conducted to examine the theoretical results. The method is evaluated on the Alzheimer’s Disease Neuroimaging Initiative data.

Original languageEnglish
JournalJournal of Machine Learning Research
Volume25
StatePublished - 2024

Keywords

  • block-missing data
  • confidence intervals
  • hypothesis testing
  • semi-supervised inference

Fingerprint

Dive into the research topics of 'Semi-supervised Inference for Block-wise Missing Data without Imputation'. Together they form a unique fingerprint.

Cite this