Statistical tests for non-independent partitions of large autocorrelated datasets

  • Anthony R. Ives*
  • , Likai Zhu
  • , Fangfang Wang
  • , Jun Zhu
  • , Clay J. Morrow
  • , Volker C. Radeloff
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Large sets of autocorrelated data are common in fields such as remote sensing and genomics. For example, remote sensing can produce maps of information for millions of pixels, and the information from nearby pixels will likely be spatially autocorrelated. Although there are well-established statistical methods for testing hypotheses using autocorrelated data, these methods become computationally impractical for large datasets. • The method developed here makes it feasible to perform F-tests, likelihood ratio tests, and t-tests for large autocorrelated datasets. The method involves subsetting the dataset into partitions, analyzing each partition separately, and then combining the separate tests to give an overall test. • The separate statistical tests on partitions are non-independent, because the points in different partitions are not independent. Therefore, combining separate analyses of partitions requires accounting for the non-independence of the test statistics among partitions. • The methods can be applied to a wide range of data, including not only purely spatial data but also spatiotemporal data. For spatiotemporal data, it is possible to estimate coefficients from time-series models at different spatial locations and then analyze the spatial distribution of the estimates. The spatial analysis can be simplified by estimating spatial autocorrelation directly from the spatial autocorrelation among time series.

Original languageEnglish
Article number101660
JournalMethodsX
Volume9
DOIs
StatePublished - Jan 2022
Externally publishedYes

Keywords

  • F-test
  • Hypothesis tests on large datasets
  • Likelihood ratio test
  • Method for performing statistical tests using non-independent data partitions
  • Remote sensing
  • Statistics for big data
  • t-test

Fingerprint

Dive into the research topics of 'Statistical tests for non-independent partitions of large autocorrelated datasets'. Together they form a unique fingerprint.

Cite this