Phylogenetic classification of short environmental DNA fragments

Lutz Krause, Naryttza N. Diaz, Alexander Goesmann, Scott Kelley, Tim W. Nattkemper, Forest Rohwer, Robert A. Edwards, Jens Stoye

Research output: Contribution to journalArticlepeer-review

211 Citations (Scopus)
3 Downloads (Pure)


Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed.

Original languageEnglish
Pages (from-to)2230-2239
Number of pages10
JournalNucleic Acids Research
Issue number7
Publication statusPublished - Apr 2008
Externally publishedYes

Bibliographical note

Oxford University Press (OUP) has partnered with Copyright Clearance Center's RightsLink service to offer a variety of options for reusing this content. Note: This article is available under the Creative Commons CC-BY-NC license and permits non-commercial use, distribution and reproduction in any medium, provided the original work is properly cited. For commercial reuse, permission must be requested below.


Dive into the research topics of 'Phylogenetic classification of short environmental DNA fragments'. Together they form a unique fingerprint.

Cite this