An SVM-based approach to discover microRNA precursors in plant genomes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

MicroRNAs (miRNAs) are noncoding RNAs of ∼22 nucleotides that play versatile regulatory roles in multicelluler organisms. Since the cloning methods for miRNAs identification are biased towards abundant miRNAs, the computational approaches provide useful complements to identify miRNAs which are highly constrained by tissue- and time-specifically expression manners. In this paper, we propose a novel Support Vector Machine (SVM) based detector, named MiR-PD, to identify pre-miRNAs in plants. The classifier is constructed based on twelve features of pre-miRNAs, inclusive of five global features and seven sub-structure features. Trained on 790 plant pre-miRNAs and 7,900 pseudo pre-miRNAs, MiR-PD achieves 96.43% five-fold cross-validation accuracy. Tested on the newly identified 441 plant pre-miRNAs and 62,883 pseudo pre-miRNAs, MiR-PD reports an accuracy of 99.71% with 77.55% sensitivity and 99.87% specificity, suggesting a feasible genome-wide application of this miRNAs detector so as to identify novel miRNAs (especially for those species-specific miRNAs) in plants without relying on phylogenetical conservation.

Original languageEnglish
Title of host publicationNew Frontiers in Applied Data Mining - PAKDD 2011 International Workshops, Revised Selected Papers
Pages304-315
Number of pages12
DOIs
StatePublished - 2012
Event15th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2011 - Shenzhen, China
Duration: 24 May 201127 May 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7104 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2011
Country/TerritoryChina
CityShenzhen
Period24/05/1127/05/11

Keywords

  • MiR-PD
  • MicroRNAs
  • plant
  • support vector machine

Fingerprint

Dive into the research topics of 'An SVM-based approach to discover microRNA precursors in plant genomes'. Together they form a unique fingerprint.

Cite this