Journal article

Scaffolding and completing genome assemblies in real-time with nanopore sequencing

MD Cao, SH Nguyen, D Ganesamoorthy, AG Elliott, MA Cooper, LJM Coin

Nature Communications | NATURE PORTFOLIO | Published : 2017

Abstract

Third generation sequencing technologies provide the opportunity to improve genome assemblies by generating long reads spanning most repeat sequences. However, current analysis methods require substantial amounts of sequence data and computational resources to overcome the high error rates. Furthermore, they can only perform analysis after sequencing has completed, resulting in either over-sequencing, or in a low quality assembly due to under-sequencing. Here we present npScarf, which can scaffold and complete short read assemblies while the long read sequencing run is in progress. It reports assembly metrics in real-time so the sequencing run can be terminated once an assembly of sufficient..

View full abstract

University of Melbourne Researchers

Grants

Awarded by National Health and Medical Research Council


Funding Acknowledgements

MAC is an National Health and Medical Research Council Principal Research Fellow (APP1059354). LC is an Australian Research Council Future Fellow (FT110100972). The research is supported by funding from the National Health and Medical Research Council (APP1052303) as well as funding from the Institute for Molecular Bioscience Centre for Superbugs Solutions (610246).