Journal article

Bazam: A rapid method for read extraction and realignment of high throughput sequencing data

Simon Sadedin, Alicia Oshlack

Published : 2018

Abstract

Abstract Background As costs of high throughput sequencing have fallen, we are seeing vast quantities of short read genomic data being generated. Often, the data is exchanged and stored as aligned reads, which provides high compression and convenient access for many analyses. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Moreover, some applications cannot utilise pre-aligned reads at all, necessitating conversion back to raw format (FASTQ) before they can be used. In both cases, the process of extraction and realignment is expensive and time consuming. Findings We describe Bazam, a tool that efficiently extracts the original paired FA..

View full abstract