Structure-adaptive canonical correlation analysis for microbiome multi-omics data

Linsui Deng, Yanlin Tang, Xianyang Zhang, Jun Chen

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Sparse canonical correlation analysis (sCCA) has been a useful approach for integrating different high-dimensional datasets by finding a subset of correlated features that explain the most correlation in the data. In the context of microbiome studies, investigators are always interested in knowing how the microbiome interacts with the host at different molecular levels such as genome, methylol, transcriptome, metabolome and proteome. sCCA provides a simple approach for exploiting the correlation structure among multiple omics data and finding a set of correlated omics features, which could contribute to understanding the host-microbiome interaction. However, existing sCCA methods do not address compositionality, and its application to microbiome data is thus not optimal. This paper proposes a new sCCA framework for integrating microbiome data with other high-dimensional omics data, accounting for the compositional nature of microbiome sequencing data. It also allows integrating prior structure information such as the grouping structure among bacterial taxa by imposing a “soft” constraint on the coefficients through varying penalization strength. As a result, the method provides significant improvement when the structure is informative while maintaining robustness against a misspecified structure. Through extensive simulation studies and real data analysis, we demonstrate the superiority of the proposed framework over the state-of-the-art approaches.

Original languageEnglish
Article number1489694
JournalFrontiers in Genetics
Volume15
DOIs
StatePublished - 2024

Keywords

  • canonical correlation analysis
  • compositional effect
  • dimension reduction
  • phylogenetic tree
  • structural information
  • variable selection

Fingerprint

Dive into the research topics of 'Structure-adaptive canonical correlation analysis for microbiome multi-omics data'. Together they form a unique fingerprint.

Cite this