Skip to main content
Article
Protein Secondary Structure Prediction using Rule Induction from Coverings
Proceedings of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (2009, Nashville, TN)
  • Leong Lee
  • Jennifer Leopold, Missouri University of Science and Technology
  • Ronald L. Frank, Missouri University of Science and Technology
  • Anne M. Maglia, Missouri University of Science and Technology
Abstract

With the increase of data from genome sequencing projects comes the need for reliable and efficient methods for the analysis and classification of protein motifs and domains. Experimental methods currently used to determine protein structure are accurate, yet expensive both in terms of time and equipment. Therefore, various computational approaches to solving the problem have been attempted, although their accuracy has rarely exceeded 75%. In this paper, a rule-based method to predict protein secondary structure is presented. This method uses a newly developed data-mining algorithm called RT-RICO (Relaxed Threshold Rule Induction from Coverings), which identifies dependencies between amino acids in a protein sequence, and generates rules that can be used to predict secondary structures. The average prediction accuracy on sample data sets, or Q3 score, using RT-RICO was 80.3%, an improvement over comparable computational methods.

Meeting Name
2009 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB '09 (2009: Mar. 30-Apr. 2, Nashville, TN)
Department(s)
Computer Science
Second Department
Biological Sciences
Keywords and Phrases
  • Amino acids,
  • Protein sequence,
  • Protein engineering,
  • Probability,
  • Genomics,
  • Bioinformatics,
  • Induction generators,
  • Accuracy,
  • Nuclear magnetic resonance,
  • Neural networks
International Standard Book Number (ISBN)
978-1-4244-2756-7
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2009 Institute of Electrical and Electronics Engineers (IEEE), All rights reserved.
Publication Date
4-2-2009
Publication Date
02 Apr 2009
Disciplines
Citation Information
Leong Lee, Jennifer Leopold, Ronald L. Frank and Anne M. Maglia. "Protein Secondary Structure Prediction using Rule Induction from Coverings" Proceedings of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (2009, Nashville, TN) (2009) p. 79 - 86
Available at: http://works.bepress.com/ronald-frank/6/