TY - JOUR
T1 - Connecting genotype to phenotype in the era of high-throughput sequencing
AU - Henry, Christopher S.
AU - Overbeek, Ross
AU - Xia, Fangfang
AU - Best, Aaron A.
AU - Glass, Elizabeth
AU - Gilbert, Jack
AU - Larsen, Peter
AU - Edwards, Rob
AU - Disz, Terry
AU - Meyer, Folker
AU - Vonstein, Veronika
AU - Dejongh, Matthew
AU - Bartels, Daniela
AU - Desai, Narayan
AU - D'Souza, Mark
AU - Devoid, Scott
AU - Keegan, Kevin P.
AU - Olson, Robert
AU - Wilke, Andreas
AU - Wilkening, Jared
AU - Stevens, Rick L.
PY - 2011/10
Y1 - 2011/10
N2 - Background: The development of next generation sequencing technology is rapidly changing the face of the genome annotation and analysis field. One of the primary uses for genome sequence data is to improve our understanding and prediction of phenotypes for microbes and microbial communities, but the technologies for predicting phenotypes must keep pace with the new sequences emerging. Scope of review: This review presents an integrated view of the methods and technologies used in the inference of phenotypes for microbes and microbial communities based on genomic and metagenomic data. Given the breadth of this topic, we place special focus on the resources available within the SEED Project. We discuss the two steps involved in connecting genotype to phenotype: sequence annotation, and phenotype inference, and we highlight the challenges in each of these steps when dealing with both single genome and metagenome data. Major conclusions: This integrated view of the genotype-to-phenotype problem highlights the importance of a controlled ontology in the annotation of genomic data, as this benefits subsequent phenotype inference and metagenome annotation. We also note the importance of expanding the set of reference genomes to improve the annotation of all sequence data, and we highlight metagenome assembly as a potential new source for complete genomes. Finally, we find that phenotype inference, particularly from metabolic models, generates predictions that can be validated and reconciled to improve annotations. General significance: This review presents the first look at the challenges and opportunities associated with the inference of phenotype from genotype during the next generation sequencing revolution. This article is part of a Special Issue entitled: Systems Biology of Microorganisms.
AB - Background: The development of next generation sequencing technology is rapidly changing the face of the genome annotation and analysis field. One of the primary uses for genome sequence data is to improve our understanding and prediction of phenotypes for microbes and microbial communities, but the technologies for predicting phenotypes must keep pace with the new sequences emerging. Scope of review: This review presents an integrated view of the methods and technologies used in the inference of phenotypes for microbes and microbial communities based on genomic and metagenomic data. Given the breadth of this topic, we place special focus on the resources available within the SEED Project. We discuss the two steps involved in connecting genotype to phenotype: sequence annotation, and phenotype inference, and we highlight the challenges in each of these steps when dealing with both single genome and metagenome data. Major conclusions: This integrated view of the genotype-to-phenotype problem highlights the importance of a controlled ontology in the annotation of genomic data, as this benefits subsequent phenotype inference and metagenome annotation. We also note the importance of expanding the set of reference genomes to improve the annotation of all sequence data, and we highlight metagenome assembly as a potential new source for complete genomes. Finally, we find that phenotype inference, particularly from metabolic models, generates predictions that can be validated and reconciled to improve annotations. General significance: This review presents the first look at the challenges and opportunities associated with the inference of phenotype from genotype during the next generation sequencing revolution. This article is part of a Special Issue entitled: Systems Biology of Microorganisms.
KW - Assembly
KW - Genome-scale metabolic models
KW - Metagenomics
KW - MG-RAST
KW - RAST
KW - SEED
UR - http://www.scopus.com/inward/record.url?scp=80052054573&partnerID=8YFLogxK
U2 - 10.1016/j.bbagen.2011.03.010
DO - 10.1016/j.bbagen.2011.03.010
M3 - Review article
C2 - 21421023
AN - SCOPUS:80052054573
SN - 0304-4165
VL - 1810
SP - 967
EP - 977
JO - Biochimica et Biophysica Acta - General Subjects
JF - Biochimica et Biophysica Acta - General Subjects
IS - 10
ER -