Estimating equations inference with missing data

  • Yong Zhou*
  • , Alan T.K. Wan
  • , Xiaojing Wang
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

81 Scopus citations

Abstract

There is a large and growing body of literature on estimating equation (EE) as an estimation approach. One basic property of EE that has been universally adopted in practice is that of unbiasedness, and there are deep conceptual reasons why unbiasedness is a desirable EE characteristic. This article deals with inference from EEs when data are missing at random. The investigation is motivated by the observation that direct imputation of missing data in EEs generally leads to EEs that are biased and, thus, violates a basic assumption of the EE approach. The main contribution of this article is that it goes beyond existing imputation methods and proposes a procedure whereby one mitigates the effects of missing data through a reformulation of EEs imputed through a kernel regression method. These (modified) EEs then constitute a basis for inference by the generalized method of moments (GMM) and empirical likelihood (EL). Asymptotic properties of the GMM and EL estimators of the unknown parameters are derived and analyzed. Unlike most of the literature, which deals with missingness in either covariate values or response data, our method allows for missingness in both sets of variables. Another important strength of our approach is that it allows auxiliary information to be handled successfully. We illustrate the method using a well-known wormy-fruits dataset and data from a study on Duchenne muscular dystrophy detection and compare our results with several existing methods via a simulation study.

Original languageEnglish
Pages (from-to)1187-1199
Number of pages13
JournalJournal of the American Statistical Association
Volume103
Issue number483
DOIs
StatePublished - Sep 2008
Externally publishedYes

Keywords

  • Empirical likelihood
  • Estimating equations
  • Generalized method of moments
  • Kernel regression
  • Missing at random
  • Reduced dimension

Fingerprint

Dive into the research topics of 'Estimating equations inference with missing data'. Together they form a unique fingerprint.

Cite this