TY - GEN
T1 - Contact-based simulated annealing protein sequence alignment method
AU - Dong, Qi Wen
AU - Lin, Lei
AU - Wang, Xiao Long
AU - Li, Ming Hui
PY - 2005
Y1 - 2005
N2 - Protein sequence alignments reveal the evolutionary information between homologous sequences. Traditional sequence alignment methods only use sequence information and the structure information from template Is ignored. Recently, Kleinjung et al. developed a contact-based sequence alignment method that used the structural information from side-chain contacts. Alignment scores are provided by the CAO (Contact Accepted mutatiOn) substitution matrices. Kleinjung et al. devised an approximate dynamic programming algorithm for protein sequence alignment, on the assumption that the distance between the contacting residues during evolution has been conserved. However, such assumption is not suitable for insertion/deletion events during evolution. In this paper, the contact-based simulated annealing alignment method has been proposed, which can find the optimal alignment solution between two protein sequences without any restriction. The alignment score is calculated by the sequence-based scores and the weighted contact-based scores. A new parameter, the contact-penalty r, has been introduced. When the contacting residue in the template aligns with gap in the query sequence, the total alignment score is decreased by a contact-penalty. All the parameters including relative weight w of CAO scores versus Blosum62 scores, matrix constant c for CAO scores, gap-open penalty p, gap-extension penalty q and contact-penalty r are re-optimized by genetic algorithm. Testing on the Homstrad database shows that the accuracy of this method is 85.4%, which is higher than that of Kleinjung's method by about 3.6 percent. Such method can be useful in many biological problems such as protein remote homology detection, comparative modeling and fold recognition.
AB - Protein sequence alignments reveal the evolutionary information between homologous sequences. Traditional sequence alignment methods only use sequence information and the structure information from template Is ignored. Recently, Kleinjung et al. developed a contact-based sequence alignment method that used the structural information from side-chain contacts. Alignment scores are provided by the CAO (Contact Accepted mutatiOn) substitution matrices. Kleinjung et al. devised an approximate dynamic programming algorithm for protein sequence alignment, on the assumption that the distance between the contacting residues during evolution has been conserved. However, such assumption is not suitable for insertion/deletion events during evolution. In this paper, the contact-based simulated annealing alignment method has been proposed, which can find the optimal alignment solution between two protein sequences without any restriction. The alignment score is calculated by the sequence-based scores and the weighted contact-based scores. A new parameter, the contact-penalty r, has been introduced. When the contacting residue in the template aligns with gap in the query sequence, the total alignment score is decreased by a contact-penalty. All the parameters including relative weight w of CAO scores versus Blosum62 scores, matrix constant c for CAO scores, gap-open penalty p, gap-extension penalty q and contact-penalty r are re-optimized by genetic algorithm. Testing on the Homstrad database shows that the accuracy of this method is 85.4%, which is higher than that of Kleinjung's method by about 3.6 percent. Such method can be useful in many biological problems such as protein remote homology detection, comparative modeling and fold recognition.
UR - https://www.scopus.com/pages/publications/33846916958
M3 - 会议稿件
AN - SCOPUS:33846916958
SN - 0780387406
SN - 9780780387409
T3 - Annual International Conference of the IEEE Engineering in Medicine and Biology - Proceedings
SP - 2798
EP - 2801
BT - Proceedings of the 2005 27th Annual International Conference of the Engineering in Medicine and Biology Society, IEEE-EMBS 2005
T2 - 2005 27th Annual International Conference of the Engineering in Medicine and Biology Society, IEEE-EMBS 2005
Y2 - 1 September 2005 through 4 September 2005
ER -