Skip to main content
Article
Using Global Sequence Similarity to Enhance Biological Sequence Labeling
Genetics, Development and Cell Biology Presentations, Posters and Proceedings
  • Cornelia Caragea, Iowa State University
  • Jivko Sinapov, Iowa State University
  • Drena Dobbs, Iowa State University
  • Vasant Honavar, Iowa State University
Document Type
Conference Proceeding
Conference
2008 IEEE International Conference on Bioinformatics and Biomedicine
Publication Version
Accepted Manuscript
Publication Date
1-1-2008
DOI
10.1109/BIBM.2008.54
Conference Title
2008 IEEE International Conference on Bioinformatics and Biomedicine
Conference Date
November 3-5, 2008
Geolocation
(39.9525839, -75.16522150000003)
Abstract
Identifying functionally important sites from biological sequences, formulated as a biological sequence labeling problem, has broad applications ranging from rational drug design to the analysis of metabolic and signal transduction networks. In this paper, we present an approach to biological sequence labeling that takes into account the global similarity between biological sequences. Our approach combines unsupervised and supervised learning techniques. Given a set of sequences and a similarity measure defined on pairs of sequences, we learn a mixture of experts model by using spectral clustering to learn the hierarchical structure of the model and by using bayesian approaches to combine the predictions of the experts. We evaluate our approach on two important biological sequence labeling problems: RNA-protein and DNA-protein interface prediction problems. The results of our experiments show that global sequence similarity can be exploited to improve the performance of classifiers trained to label biological sequence data.
Comments

This is a proceeding from IEEE International Conference on Bioinformatics and Biomedicine (2008): 104, doi: 10.1109/BIBM.2008.54. Posted with permission.

Rights
© 2008 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Copyright Owner
IEEE
Language
en
File Format
application/pdf
Citation Information
Cornelia Caragea, Jivko Sinapov, Drena Dobbs and Vasant Honavar. "Using Global Sequence Similarity to Enhance Biological Sequence Labeling" Philadelphia, Pennsylvania(2008) p. 104 - 111
Available at: http://works.bepress.com/drena-dobbs/51/