GraphBin: Refined binning of metagenomic contigs using assembly graphs

Vijini Mallawaarachchi, Anuradha Wickramarachchi, Yu Lin

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Motivation: The field of metagenomics has provided valuable insights into the structure, diversity and ecology within microbial communities. One key step in metagenomics analysis is to assemble reads into longer contigs which are then binned into groups of contigs that belong to different species present in the metagenomic sample. Binning of contigs plays an important role in metagenomics and most available binning algorithms bin contigs using genomic features such as oligonucleotide/k-mer composition and contig coverage. As metagenomic contigs are derived from the assembly process, they are output from the underlying assembly graph which contains valuable connectivity information between contigs that can be used for binning. Results: We propose GraphBin, a new binning method that makes use of the assembly graph and applies a label propagation algorithm to refine the binning result of existing tools. We show that GraphBin can make use of the assembly graphs constructed from both the de Bruijn graph and the overlap-layout-consensus approach. Moreover, we demonstrate improved experimental results from GraphBin in terms of identifying mis-binned contigs and binning of contigs discarded by existing binning tools. To the best of our knowledge, this is the first time that the information from the assembly graph has been used in a tool for the binning of metagenomic contigs. 

Original languageEnglish
Pages (from-to)3307-3313
Number of pages7
JournalBioinformatics
Volume36
Issue number11
DOIs
Publication statusPublished - Jun 2020
Externally publishedYes

Keywords

  • metagenomic analysis
  • Contigs
  • GraphBin
  • Binning Tools
  • Assembly graphs
  • Metagenomics binning

Fingerprint

Dive into the research topics of 'GraphBin: Refined binning of metagenomic contigs using assembly graphs'. Together they form a unique fingerprint.

Cite this