Abstract
The Biological sequence data are increasing rapidly, so there is a vital need of effective method for gene detection. Predicting of splice site is an important part of gene finding. Therefore, attempts to improve the prediction accuracy of the computational methods for splice sites detection continue. In this paper we propose a hybrid algorithm for splice sites prediction by combining AdaBoost classifier with a novel nucleotide encoding method, namely FDDM. Our encoding method provides frequency difference between the true sites and false sites (FD) along with distance measure (DM). The proposed method produces an improvement in comparison with the result of current methods such as MM1-SVM, Reduced MM1-SVM, SVM-B, LVMM, DM-SVM, DM2-AdaBoost and MSC+Pos(+APR)-SVM, when applied to the HS3D dataset with repeated 10-fold cross validation. In addition, for demonstrating the stability of the method, we also applied it to NN269 dataset. The obtained results indicate that the new method is practicable and efficient.
Original language | English |
---|---|
Title of host publication | 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 3853-3858 |
Number of pages | 6 |
ISBN (Electronic) | 9781509018970 |
DOIs | |
Publication status | Published - 6 Feb 2017 |
Externally published | Yes |
Event | 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Budapest, Hungary Duration: 9 Oct 2016 → 12 Oct 2016 |
Publication series
Name | 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings |
---|
Conference
Conference | 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 |
---|---|
Country/Territory | Hungary |
City | Budapest |
Period | 9/10/16 → 12/10/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- AdaBoost classifier
- Nucleotide encoding method
- Splice site prediction