Sparse spatially clustered coefficient model via adaptive regularization

Yan Zhong, Huiyan Sang, Scott J. Cook, Paul M. Kellstedt

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Large spatial datasets with many spatial covariates have become ubiquitous in many fields in recent years. A question of interest is to identify which covariates are likely to influence a spatial response, and whether and how the effects of these covariates vary across space, including potential abrupt changes from region to region. To solve this question, a new efficient regularized spatially clustered coefficient (RSCC) regression approach is proposed, which could achieve variable selection and identify latent spatially heterogeneous covariate effects with clustered patterns simultaneously. By carefully designing the regularization term of RSCC as a chain graph guided fusion penalty plus a group lasso penalty, the RSCC model is computationally efficient for large spatial datasets while still achieving the theoretical guarantees for estimation. RSCC also adopts the idea of adaptive learning to allow for adaptive weights and adaptive graphs in its regularization terms and further improves the estimation performance. RSCC is applied to study the acceptance of COVID-19 vaccines using county-level data in the United States and discover the determinants of vaccination acceptance with varying effects across counties, revealing important within-state and across-state spatially clustered patterns of covariates effects.

Original languageEnglish
Article number107581
JournalComputational Statistics and Data Analysis
Volume177
DOIs
StatePublished - Jan 2022

Keywords

  • COVID-19 vaccination acceptance
  • Spatial variable selection
  • Variable-dependent graph
  • Varying coefficient regression

Fingerprint

Dive into the research topics of 'Sparse spatially clustered coefficient model via adaptive regularization'. Together they form a unique fingerprint.

Cite this