A novel gene selection algorithm for cancer identification based on random forest and particle swarm optimization

Elnaz Pashaei, Mustafa Ozen, Nizamettin Aydin

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

15 Atıf (Scopus)

Özet

In order to achieve informative gene from thousands of candidate genes contributing to the symptom of cancer, two novel gene selection approaches for classification of multiclass microarray datasets are proposed. In the first, method we use k-means clustering to remove redundancy, and then apply Random Forest (RF) to rank each gene in every cluster to remove irrelevance. The top scored genes from each cluster is gathered and a new feature subset (filtered genes) is generated. At the last stage filtered genes is used as input to eight benchmark classification methods. In the second approach we develop a novel method utilizing Particle Swarm Optimization combined with BoostedC5.0 decision tree as the classifier. We apply filtered genes that achieved by first proposed method as input to PSO+BoostedC5.0 classifier and compare the performance of it with 8 classifiers. Experimental results show that by using clustering technique and RF ranking we can give a true pattern which select a smaller number of feature subset and obtain better classification accuracy. Also by applying this method on ten microarray datasets and using filtered genes as input for 9 classifiers we showed that proposed PSO+BoostedC5.0 simplifies features effectively and obtains higher classification accuracy compared to the other classification methods.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığı2015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015
YayınlayanInstitute of Electrical and Electronics Engineers Inc.
ISBN (Elektronik)9781479969265
DOI'lar
Yayın durumuYayınlandı - 16 Eki 2015
Harici olarak yayınlandıEvet
EtkinlikIEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015 - Niagara Falls, Canada
Süre: 12 Ağu 201515 Ağu 2015

Yayın serisi

Adı2015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2015
Ülke/BölgeCanada
ŞehirNiagara Falls
Periyot12/08/1515/08/15

Bibliyografik not

Publisher Copyright:
© 2015 IEEE.

Parmak izi

A novel gene selection algorithm for cancer identification based on random forest and particle swarm optimization' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap