CpG-Discover: A machine learning approach for CpG islands identification from human DNA sequence

Man Lan*, Yu Xu, Lin Li, Fei Wang, Ying Zuo, Yuan Chen, Chew Lim Tan, Jian Su

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

CpG islands (CGIs) play a fundamental role in genome analysis as genomic markers and tumor markers. Identification of potential CGIs has contributed not only to the prediction of promoters of most house-keeping genes and many tissue-specific genes but also to the understanding of the epigenetic causes of cancer. The most current methods for identifying CGIs suffered from various limitations and involved a lot of human intervention for search purpose. In this paper, we implement a HMM-based CGIs identification system, namely CpG-Discover. Experiments have been conducted on the EMBL human DNA database and in comparison with other widely-used tools. The controlled experimental results indicate that our system is a promising tool and has the capability of locating CGIs accurately. In addition, our system has significant differences from other tools in that it avoids the disadvantages of using sliding windows and it reduces the large amount of human intervention needed to search for or to combine potential CGIs (such as, the thresholds of initial density or distance seed). Therefore, given annotated training data set, our system has the adaptability to find other specific nucleotides sequences in DNA.

Original languageEnglish
Title of host publication2009 International Joint Conference on Neural Networks, IJCNN 2009
Pages1702-1707
Number of pages6
DOIs
StatePublished - 2009
Event2009 International Joint Conference on Neural Networks, IJCNN 2009 - Atlanta, GA, United States
Duration: 14 Jun 200919 Jun 2009

Publication series

NameProceedings of the International Joint Conference on Neural Networks

Conference

Conference2009 International Joint Conference on Neural Networks, IJCNN 2009
Country/TerritoryUnited States
CityAtlanta, GA
Period14/06/0919/06/09

Fingerprint

Dive into the research topics of 'CpG-Discover: A machine learning approach for CpG islands identification from human DNA sequence'. Together they form a unique fingerprint.

Cite this