Skip to main content
Article
Accuracy and efficiency of algorithms for the demarcation of bacterial ecotypes from DNA sequence data
International Journal of Bioinformatics Research and Applications (2014)
  • Juan Carlos Francisco, Wesleyan University
  • Frederick M Cohan, Wesleyan University
  • Danny Krizanc, Wesleyan University
Abstract
Identification of closely related, ecologically distinct populations of bacteria would benefit microbiologists working in many fields including systematics, epidemiology and biotechnology. Several laboratories have recently developed algorithms aimed at demarcating such ‘ecotypes’. We examine the ability of four of these algorithms to correctly identify ecotypes from sequence data. We tested the algorithms on synthetic sequences, with known history and habitat associations, generated under the stable ecotype model and on data from Bacillus strains isolated from Death Valley where previous work has confirmed the existence of multiple ecotypes. We found that one of the algorithms (ecotype simulation) performs significantly better than the others (AdaptML, GMYC, BAPS) in both instances. Unfortunately, it was also shown to be the least efficient of the four. While ecotype simulation is the most accurate, it is by a large margin the slowest of the algorithms tested. Attempts at improving its efficiency are underway.
Keywords
  • bacterial ecotypes,
  • demarcation algorithms,
  • stable ecotype model,
  • DNA sequence data,
  • ecotype simulation,
  • AdaptML,
  • GYMC,
  • BAPS
Publication Date
2014
Citation Information
Juan Carlos Francisco, Frederick M Cohan and Danny Krizanc. "Accuracy and efficiency of algorithms for the demarcation of bacterial ecotypes from DNA sequence data" International Journal of Bioinformatics Research and Applications Vol. 10 Iss. 4/5 (2014)
Available at: http://works.bepress.com/frederick_cohan/64/