DEEPMAP: DEEP LEARNING-BASED SINGLE-CELL DATA INTEGRATION USING ITERATIVE CELL MATCHING AND STRUCTURE PRESERVATION CONSTRAINTS

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Effective integration of single-cell data can facilitate the discovery of cell-type specific gene expression patterns and cellular interactions, ulti-mately leading to a better understanding of various biological processes and diseases. However, datasets from different platforms, species, and modali-ties exhibit various levels of heterogeneities, posing significant challenges in data alignment using a unified approach. Here we propose DeepMap, a flexible and efficient method for single-cell data integration, by taking advantage of the deep learning framework. Our method utilizes iterative cell matching based on mutual nearest neighbors, leverages an autoencoder framework to learn harmonized representations of cells from various datasets, and incorpo-rates a covariance penalty term into the framework for structure preservation. In addition to harmonization of data from different datasets, we specifically take account of the preservation of important biological variations within dataset, which is crucial to reliable downstream analysis. Comprehensive real data analysis demonstrates the flexibility of DeepMap for diverse datasets from different platforms, species, and modalities, and highlights its marked ability in preserving structures over existing integration methods with en-hanced computational efficiency and optimized memory usage. The robust DeepMap-integrated data offers promising prospects for advancing our understanding of cell biology, hence making it a highly attractive option for integrative single-cell data analysis.

Original languageEnglish
Pages (from-to)3596-3613
Number of pages18
JournalAnnals of Applied Statistics
Volume18
Issue number4
DOIs
StatePublished - Dec 2024

Keywords

  • Single-cell data integration
  • deep learning
  • iterative cell matching
  • structure preservation constraints

Fingerprint

Dive into the research topics of 'DEEPMAP: DEEP LEARNING-BASED SINGLE-CELL DATA INTEGRATION USING ITERATIVE CELL MATCHING AND STRUCTURE PRESERVATION CONSTRAINTS'. Together they form a unique fingerprint.

Cite this