Skip to main navigation Skip to search Skip to main content

Optimal subsampling for linear quantile regression models

  • Yan Fan
  • , Yukun Liu*
  • , Lixing Zhu
  • *Corresponding author for this work
  • Shanghai University of International Business and Economics
  • Beijing Normal University

Research output: Contribution to journalArticlepeer-review

Abstract

Subsampling techniques are efficient methods for handling big data. Quite a few optimal sampling methods have been developed for parametric models in which the loss functions are differentiable with respect to parameters. However, they do not apply to quantile regression (QR) models as the involved check function is not differentiable. To circumvent the non-differentiability problem, we consider directly estimating the linear QR coefficient by minimizing the Hansen–Hurwitz estimator of the usual loss function for QR. We establish the asymptotic normality of the resulting estimator under a generic sampling method, and then develop optimal subsampling methods for linear QR. In particular, we propose a one-stage subsampling method, which depends only on the lengths of covariates, and a two-stage subsampling method, which is a combination of the one-stage sampling and the ideal optimal subsampling methods. Our simulation and real data based simulation studies show that the two recommended sampling methods always outperform simple random sampling in terms of mean square error, whether the linear QR model is valid or not.

Original languageEnglish
Pages (from-to)1039-1057
Number of pages19
JournalCanadian Journal of Statistics
Volume49
Issue number4
DOIs
StatePublished - Dec 2021

Keywords

  • Big data
  • Hansen–Hurwitz estimator
  • linear quantile regression
  • optimal subsampling
  • uniform sampling

Fingerprint

Dive into the research topics of 'Optimal subsampling for linear quantile regression models'. Together they form a unique fingerprint.

Cite this