Abstract
In microarray data analysis, filter methods with low time complexity neglect correlation among genes. Metrics to calculate the correlation in some of the methods can not effectively reflect function similarity among genes and time complexity is based on the whole gene set. Therefore, a novel selection model called Mutual-Information-based Minimum Spanning Trees (MIMST) is proposed in this paper, which first uses filter methods to remove non-relevant genes, then computes the interdependence of top-ranked genes, and eliminates the redundant genes. The empirical results show that MIMST can find the smallest significant genes subset with higher classification accuracy compared with other methods.
| Original language | English |
|---|---|
| Pages (from-to) | 187-203 |
| Number of pages | 17 |
| Journal | International Journal of Computational Biology and Drug Design |
| Volume | 2 |
| Issue number | 2 |
| DOIs | |
| State | Published - Oct 2009 |
| Externally published | Yes |
Keywords
- Gene selection
- MST
- Microarray gene expression data analysis
- Minimum Spanning Trees
- Mutual Information