Skip to main navigation Skip to search Skip to main content

Transformer-based multidimensional feature fusion for accurate prediction of lipid nanoparticles transfection efficiency

  • ShanghaiTech University
  • Lingang Laboratory
  • East China Normal University

Research output: Contribution to journalArticlepeer-review

Abstract

RNA-based technologies have demonstrated significant potential for diverse applications, ranging from vaccination to gene editing. However, their widespread adoption is limited by the critical challenge of efficient delivery. Lipid nanoparticles (LNPs) have emerged as a widely utilized RNA delivery system, yet their formulation design and optimization primarily rely on empirical trial-and-error, which is labor-intensive, time-consuming, and cost-prohibitive, thus hindering the rapid development of RNA therapeutics. To facilitate the early-stage design and optimization of LNPs for enhanced delivery efficiency, in this study, we construct LNPs-TE, a benchmark dataset comprising over 10 000 experimentally measured transfection efficiency (TE) values, and introduce LNPs integrated feature fusion Transformer (LIFT), a deep learning framework for LNPs TE prediction. Comprehensive experiments demonstrate that LIFT effectively integrates multidimensional molecular representations of ionizable lipids, the key component in LNPs formulation, achieving superior predictive performance, with an average Pearson correlation coefficient of 0.845 for regression and an area under the receiver operating characteristic curve (AUC-ROC) of 0.818 for multi-class classification across multiple datasets. Through scaffold-based splitting and activity cliff tasks, we further validated the exceptional generalization ability and robustness of LIFT, which achieved over a 10% improvement in the coefficient of determination (R2) compared with state-of-the-art baseline models, highlighting its potential as a practical and stable approach for the virtual screening of efficient LNPs formulation.

Original languageEnglish
Article numberbbag092
JournalBriefings in Bioinformatics
Volume27
Issue number2
DOIs
StatePublished - 1 Mar 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • deep learning
  • ionizable lipids
  • lipid nanoparticles
  • multidimensional feature fusion
  • nucleic acid delivery

Fingerprint

Dive into the research topics of 'Transformer-based multidimensional feature fusion for accurate prediction of lipid nanoparticles transfection efficiency'. Together they form a unique fingerprint.

Cite this