跳到主要导航 跳到搜索 跳到主要内容

Prototype-based contrastive substructure identification for molecular property prediction

  • East China Normal University
  • East China University of Science and Technology

科研成果: 期刊稿件文章同行评审

摘要

Substructure-based representation learning has emerged as a powerful approach to featurize complex attributed graphs, with promising results in molecular property prediction (MPP). However, existing MPP methods mainly rely on manually defined rules to extract substructures. It remains an open challenge to adaptively identify meaningful substructures from numerous molecular graphs to accommodate MPP tasks. To this end, this paper proposes Prototype-based cOntrastive Substructure IdentificaTion (POSIT), a self-supervised framework to autonomously discover substructural prototypes across graphs so as to guide end-to-end molecular fragmentation. During pre-training, POSIT emphasizes two key aspects of substructure identification: firstly, it imposes a soft connectivity constraint to encourage the generation of topologically meaningful substructures; secondly, it aligns resultant substructures with derived prototypes through a prototype-substructure contrastive clustering objective, ensuring attribute-based similarity within clusters. In the fine-tuning stage, a cross-scale attention mechanism is designed to integrate substructure-level information to enhance molecular representations. The effectiveness of the POSIT framework is demonstrated by experimental results from diverse real-world datasets, covering both classification and regression tasks. Moreover, visualization analysis validates the consistency of chemical priors with identified substructures. The source code is publicly available at https://github.com/VRPharmer/POSIT.

源语言英语
文章编号bbae565
期刊Briefings in Bioinformatics
25
6
DOI
出版状态已出版 - 1 11月 2024

指纹

探究 'Prototype-based contrastive substructure identification for molecular property prediction' 的科研主题。它们共同构成独一无二的指纹。

引用此