TY - JOUR
T1 - Protein fold classification with Grow-and-Learn network
AU - Polat, Özlem
AU - Dokur, Zümray
N1 - Publisher Copyright:
© TÜBİTAK.
PY - 2017
Y1 - 2017
N2 - Protein fold classification is an important subject in computational biology and a compelling work from the point of machine learning. To deal with such a challenging problem, in this study, we propose a solution method for the classification of protein folds using Grow-and-Learn (GAL) neural network together with one-versus-others (OvO) method. To classify the most common 27 protein folds, 125 dimensional data, constituted by the physicochemical properties of amino acids, are used. The study is conducted on a database including 694 proteins: 311 of these proteins are used for training and 383 of them for testing. Overall, the classification system achieves 81.2% fold recognition accuracy on the test set, where most of the proteins have less than 25% sequence identity with the ones used during the training. To portray the capabilities of the GAL network among the other methods, comparisons between a few approaches have also been made, and GAL's accuracy is found to be higher than those of the existing methods for protein fold classification.
AB - Protein fold classification is an important subject in computational biology and a compelling work from the point of machine learning. To deal with such a challenging problem, in this study, we propose a solution method for the classification of protein folds using Grow-and-Learn (GAL) neural network together with one-versus-others (OvO) method. To classify the most common 27 protein folds, 125 dimensional data, constituted by the physicochemical properties of amino acids, are used. The study is conducted on a database including 694 proteins: 311 of these proteins are used for training and 383 of them for testing. Overall, the classification system achieves 81.2% fold recognition accuracy on the test set, where most of the proteins have less than 25% sequence identity with the ones used during the training. To portray the capabilities of the GAL network among the other methods, comparisons between a few approaches have also been made, and GAL's accuracy is found to be higher than those of the existing methods for protein fold classification.
KW - Attributes for protein fold recognition
KW - Bioinformatics
KW - Grow and learn neural network
KW - Protein fold classification
UR - http://www.scopus.com/inward/record.url?scp=85017401769&partnerID=8YFLogxK
U2 - 10.3906/elk-1506-126
DO - 10.3906/elk-1506-126
M3 - Article
AN - SCOPUS:85017401769
SN - 1300-0632
VL - 25
SP - 1184
EP - 1196
JO - Turkish Journal of Electrical Engineering and Computer Sciences
JF - Turkish Journal of Electrical Engineering and Computer Sciences
IS - 2
ER -