TY - JOUR
T1 - Sparse spatially clustered coefficient model via adaptive regularization
AU - Zhong, Yan
AU - Sang, Huiyan
AU - Cook, Scott J.
AU - Kellstedt, Paul M.
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/1
Y1 - 2022/1
N2 - Large spatial datasets with many spatial covariates have become ubiquitous in many fields in recent years. A question of interest is to identify which covariates are likely to influence a spatial response, and whether and how the effects of these covariates vary across space, including potential abrupt changes from region to region. To solve this question, a new efficient regularized spatially clustered coefficient (RSCC) regression approach is proposed, which could achieve variable selection and identify latent spatially heterogeneous covariate effects with clustered patterns simultaneously. By carefully designing the regularization term of RSCC as a chain graph guided fusion penalty plus a group lasso penalty, the RSCC model is computationally efficient for large spatial datasets while still achieving the theoretical guarantees for estimation. RSCC also adopts the idea of adaptive learning to allow for adaptive weights and adaptive graphs in its regularization terms and further improves the estimation performance. RSCC is applied to study the acceptance of COVID-19 vaccines using county-level data in the United States and discover the determinants of vaccination acceptance with varying effects across counties, revealing important within-state and across-state spatially clustered patterns of covariates effects.
AB - Large spatial datasets with many spatial covariates have become ubiquitous in many fields in recent years. A question of interest is to identify which covariates are likely to influence a spatial response, and whether and how the effects of these covariates vary across space, including potential abrupt changes from region to region. To solve this question, a new efficient regularized spatially clustered coefficient (RSCC) regression approach is proposed, which could achieve variable selection and identify latent spatially heterogeneous covariate effects with clustered patterns simultaneously. By carefully designing the regularization term of RSCC as a chain graph guided fusion penalty plus a group lasso penalty, the RSCC model is computationally efficient for large spatial datasets while still achieving the theoretical guarantees for estimation. RSCC also adopts the idea of adaptive learning to allow for adaptive weights and adaptive graphs in its regularization terms and further improves the estimation performance. RSCC is applied to study the acceptance of COVID-19 vaccines using county-level data in the United States and discover the determinants of vaccination acceptance with varying effects across counties, revealing important within-state and across-state spatially clustered patterns of covariates effects.
KW - COVID-19 vaccination acceptance
KW - Spatial variable selection
KW - Variable-dependent graph
KW - Varying coefficient regression
UR - https://www.scopus.com/pages/publications/85135792864
U2 - 10.1016/j.csda.2022.107581
DO - 10.1016/j.csda.2022.107581
M3 - 文章
AN - SCOPUS:85135792864
SN - 0167-9473
VL - 177
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
M1 - 107581
ER -