Abstract
In order to achieve informative gene from thousands of candidate genes contributing to the symptom of cancer, two novel gene selection approaches for classification of multiclass microarray datasets are proposed. In the first, method we use k-means clustering to remove redundancy, and then apply Random Forest (RF) to rank each gene in every cluster to remove irrelevance. The top scored genes from each cluster is gathered and a new feature subset (filtered genes) is generated. At the last stage filtered genes is used as input to eight benchmark classification methods. In the second approach we develop a novel method utilizing Particle Swarm Optimization combined with BoostedC5.0 decision tree as the classifier. We apply filtered genes that achieved by first proposed method as input to PSO+BoostedC5.0 classifier and compare the performance of it with 8 classifiers. Experimental results show that by using clustering technique and RF ranking we can give a true pattern which select a smaller number of feature subset and obtain better classification accuracy. Also by applying this method on ten microarray datasets and using filtered genes as input for 9 classifiers we showed that proposed PSO+BoostedC5.0 simplifies features effectively and obtains higher classification accuracy compared to the other classification methods.
Original language | English |
---|---|
Title of host publication | 2015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781479969265 |
DOIs | |
Publication status | Published - 16 Oct 2015 |
Externally published | Yes |
Event | IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015 - Niagara Falls, Canada Duration: 12 Aug 2015 → 15 Aug 2015 |
Publication series
Name | 2015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015 |
---|
Conference
Conference | IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015 |
---|---|
Country/Territory | Canada |
City | Niagara Falls |
Period | 12/08/15 → 15/08/15 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.
Keywords
- Decision tree classifier
- Gene expression
- Particle swarm optimization
- Random Forest