PHANOTATE: A novel approach to gene identification in phage genomes

Katelyn Mcnair, Carol Zhou, Elizabeth A. Dinsdale, Brian Souza, Robert A. Edwards

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

Currently there are no tools specifically designed for annotating genes in phages. Several tools are available that have been adapted to run on phage genomes, but due to their underlying design, they are unable to capture the full complexity of phage genomes. Phages have adapted their genomes to be extremely compact, having adjacent genes that overlap and genes completely inside of other longer genes. This non-delineated genome structure makes it difficult for gene prediction using the currently available gene annotators. Here we present PHANOTATE, a novel method for gene calling specifically designed for phage genomes. Although the compact nature of genes in phages is a problem for current gene annotators, we exploit this property by treating a phage genome as a network of paths: where open reading frames are favorable, and overlaps and gaps are less favorable, but still possible. We represent this network of connections as a weighted graph, and use dynamic programing to find the optimal path. Results: We compare PHANOTATE to other gene callers by annotating a set of 2133 complete phage genomes from GenBank, using PHANOTATE and the three most popular gene callers. We found that the four programs agree on 82% of the total predicted genes, with PHANOTATE predicting more genes than the other three. We searched for these extra genes in both GenBank's non-redundant protein database and all of the metagenomes in the sequence read archive, and found that they are present at levels that suggest that these are functional protein-coding genes.

Original languageEnglish
Pages (from-to)4537-4542
Number of pages6
JournalBioinformatics
Volume35
Issue number22
DOIs
Publication statusPublished - 1 Nov 2019
Externally publishedYes

Bibliographical note

C The Author(s) 2019. Published by Oxford University Press. 4537
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits
unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Keywords

  • Phanotate
  • gene identification
  • phage genomes

Fingerprint Dive into the research topics of 'PHANOTATE: A novel approach to gene identification in phage genomes'. Together they form a unique fingerprint.

  • Cite this