跳到主要导航 跳到搜索 跳到主要内容

Using Amino Acid Physicochemical Distance Transformation for Fast Protein Remote Homology Detection

  • Bin Liu*
  • , Xiaolong Wang
  • , Qingcai Chen
  • , Qiwen Dong
  • , Xun Lan
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Protein remote homology detection is one of the most important problems in bioinformatics. Discriminative methods such as support vector machines (SVM) have shown superior performance. However, the performance of SVM-based methods depends on the vector representations of the protein sequences. Prior works have demonstrated that sequence-order effects are relevant for discrimination, but little work has explored how to incorporate the sequence-order information along with the amino acid physicochemical properties into the prediction. In order to incorporate the sequence-order effects into the protein remote homology detection, the physicochemical distance transformation (PDT) method is proposed. Each protein sequence is converted into a series of numbers by using the physicochemical property scores in the amino acid index (AAIndex), and then the sequence is converted into a fixed length vector by PDT. The sequence-order information can be efficiently included into the feature vector with little computational cost by this approach. Finally, the feature vectors are input into a support vector machine classifier to detect the protein remote homologies. Our experiments on a well-known benchmark show the proposed method SVM-PDT achieves superior or comparable performance with current state-of-the-art methods and its computational cost is considerably superior to those of other methods. When the evolutionary information extracted from the frequency profiles is combined with the PDT method, the profile-based PDT approach can improve the performance by 3.4% and 11.4% in terms of ROC score and ROC50 score respectively. The local sequence-order information of the protein can be efficiently captured by the proposed PDT and the physicochemical properties extracted from the amino acid index are incorporated into the prediction. The physicochemical distance transformation provides a general framework, which would be a valuable tool for protein-level study.

源语言英语
文章编号e46633
期刊PLoS ONE
7
9
DOI
出版状态已出版 - 28 9月 2012
已对外发布

指纹

探究 'Using Amino Acid Physicochemical Distance Transformation for Fast Protein Remote Homology Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此