MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

Chi-Man Liu; Dinghua Li; Kunihiko Sadakane; Ruibang Luo; Tak-Wah Lam

arxiv: 1409.7208 · v2 · pith:P6XEMBE7new · submitted 2014-09-25 · 🧬 q-bio.GN

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

Dinghua Li , Chi-Man Liu , Ruibang Luo , Kunihiko Sadakane , Tak-Wah Lam This is my paper

classification 🧬 q-bio.GN

keywords megahitassemblymetagenomicsassemblingcomplexcontigdatahours

0 comments

read the original abstract

MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252Gbps in 44.1 hours and 99.6 hours on a single computing node with and without a GPU, respectively. MEGAHIT assembles the data as a whole, i.e., it avoids pre-processing like partitioning and normalization, which might compromise on result integrity. MEGAHIT generates 3 times larger assembly, with longer contig N50 and average contig length than the previous assembly. 55.8% of the reads were aligned to the assembly, which is 4 times higher than the previous. The source code of MEGAHIT is freely available at https://github.com/voutcn/megahit under GPLv3 license.

This paper has not been read by Pith yet.

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

discussion (0)