Protein fold classification with Grow-and-Learn network


Polat Ö., Dokur Z.

TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, vol.25, no.2, pp.1184-1196, 2017 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 25 Issue: 2
  • Publication Date: 2017
  • Doi Number: 10.3906/elk-1506-126
  • Journal Name: TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, TR DİZİN (ULAKBİM)
  • Page Numbers: pp.1184-1196
  • Keywords: Protein fold classification, grow and learn neural network, attributes for protein fold recognition, bioinformatics, SUPPORT VECTOR MACHINES, STRUCTURAL CLASS PREDICTION, ENSEMBLE CLASSIFIER, NEURAL-NETWORK, RECOGNITION, SCOP
  • Sivas Cumhuriyet University Affiliated: Yes

Abstract

Protein fold classification is an important subject in computational biology and a compelling work from the point of machine learning. To deal with such a challenging problem, in this study, we propose a solution method for the classification of protein folds using Grow-and-Learn (GAL) neural network together with one-versus-others (OvO) method. To classify the most common 27 protein folds, 125 dimensional data, constituted by the physicochemical properties of amino acids, are used. The study is conducted on a database including 694 proteins: 311 of these proteins are used for training and 383 of them for testing. Overall, the classification system achieves 81.2% fold recognition accuracy on the test set, where most of the proteins have less than 25% sequence identity with the ones used during the training. To portray the capabilities of the GAL network among the other methods, comparisons between a few approaches have also been made, and GAL's accuracy is found to be higher than those of the existing methods for protein fold classification.