跳到主要导航 跳到搜索 跳到主要内容

AnnoPRO: a strategy for protein function annotation based on multi-scale protein representation and a hybrid deep learning of dual-path encoding

  • Lingyan Zheng
  • , Shuiyang Shi
  • , Mingkun Lu
  • , Pan Fang
  • , Ziqi Pan
  • , Hongning Zhang
  • , Zhimeng Zhou
  • , Hanyu Zhang
  • , Minjie Mou
  • , Shijie Huang
  • , Lin Tao
  • , Weiqi Xia
  • , Honglin Li
  • , Zhenyu Zeng
  • , Shun Zhang
  • , Yuzong Chen
  • , Zhaorong Li*
  • , Feng Zhu*
  • *此作品的通讯作者
  • The Second Affiliated Hospital of Zhejiang University School of Medicine
  • Alibaba Group Holding Ltd.
  • Zhejiang University
  • Hangzhou Normal University
  • Zhejiang Provincial People's Hospital
  • East China University of Science and Technology
  • Tsinghua University

科研成果: 期刊稿件文章同行评审

摘要

Protein function annotation has been one of the longstanding issues in biological sciences, and various computational methods have been developed. However, the existing methods suffer from a serious long-tail problem, with a large number of GO families containing few annotated proteins. Herein, an innovative strategy named AnnoPRO was therefore constructed by enabling sequence-based multi-scale protein representation, dual-path protein encoding using pre-training, and function annotation by long short-term memory-based decoding. A variety of case studies based on different benchmarks were conducted, which confirmed the superior performance of AnnoPRO among available methods. Source code and models have been made freely available at: https://github.com/idrblab/AnnoPRO and https://zenodo.org/records/10012272

源语言英语
文章编号41
期刊Genome Biology
25
1
DOI
出版状态已出版 - 12月 2024
已对外发布

指纹

探究 'AnnoPRO: a strategy for protein function annotation based on multi-scale protein representation and a hybrid deep learning of dual-path encoding' 的科研主题。它们共同构成独一无二的指纹。

引用此