跳到主要导航 跳到搜索 跳到主要内容

Numerical sequence representation of DNA sequences and methods to distinguish coding and non-coding sequences in a complete genome

  • Zu Guo Yu*
  • , Vo Anh
  • , Yu Zhou
  • , Li Qian Zhou
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this presentation we introduce two methods to distinguish coding and non-coding sequences in a complete genome. A numerical sequence representation of DNA sequences is introduced first. There exists a one-to-one correspondence between a DNA sequence and its numerical sequence representation. In the first method, three exponents from a multifractal analysis are selected to construct the parameter space. In the second method, which is based on a Fourier transform approach, three parameters from the power spectrum of the numerical sequence representation are selected to construct the parameter space. Each DNA may be represented by a point in these three-dimensional spaces. We found that the points corresponding to coding and non-coding sequences in the complete genomes of prokaryotes are divided into different regions in both parameter spaces. If the point for a DNA sequence is situated in the region corresponding to coding sequences, the sequence is recognized as a coding sequence; otherwise, the sequence is classified as a non-coding one. The average accuracies using Fisher's discriminant algorithm for coding and non-coding sequences are satisfactory.

源语言英语
主期刊名WMSCI 2007 - The 11th World Multi-Conference on Systemics, Cybernetics and Informatics, Jointly with the 13th International Conference on Information Systems Analysis and Synthesis, ISAS 2007 - Proc.
171-176
页数6
出版状态已出版 - 2007
已对外发布
活动11th World Multi-Conference on Systemics, Cybernetics and Informatics, WMSCI 2007, Jointly with the 13th International Conference on Information Systems Analysis and Synthesis, ISAS 2007 - Orlando, FL, 美国
期限: 8 7月 200711 7月 2007

出版系列

姓名WMSCI 2007 - The 11th World Multi-Conference on Systemics, Cybernetics and Informatics, Jointly with the 13th International Conference on Information Systems Analysis and Synthesis, ISAS 2007 - Proc.
1

会议

会议11th World Multi-Conference on Systemics, Cybernetics and Informatics, WMSCI 2007, Jointly with the 13th International Conference on Information Systems Analysis and Synthesis, ISAS 2007
国家/地区美国
Orlando, FL
时期8/07/0711/07/07

指纹

探究 'Numerical sequence representation of DNA sequences and methods to distinguish coding and non-coding sequences in a complete genome' 的科研主题。它们共同构成独一无二的指纹。

引用此