Journal article

Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment

LG Fearnley, MF Bennett, M Bahlo

Scientific Reports | Published : 2022

Open access

Abstract

Bioinformatic methods for detecting short tandem repeat expansions in short-read sequencing have identified new repeat expansions in humans, but require alignment information to identify repetitive motif enrichment at genomic locations. We present superSTR, an ultrafast method that does not require alignment. superSTR is used to process whole-genome and whole-exome sequencing data, and perform the first STR analysis of the UK Biobank, efficiently screening and identifying known and potential disease-associated STRs in the exomes of 49,953 biobank participants. We demonstrate the first bioinformatic screening of RNA sequencing data to detect repeat expansions in humans and mouse models of ata..

View full abstract

Grants

Awarded by State Government of Victoria


Funding Acknowledgements

LGF is supported by the DHB Foundation Centenary Postdoctoral Fellowship in Neurogenetic Systems Biology. MB is funded by an NHMRC Senior Research Fellowship (ID: 1102971). MFB is supported by a Taking Flight Award from CURE Epilepsy. This work is supported by the Victorian State Government Operational Infrastructure Support and the Australian Government National Health and Medical Research Council Independent Research Institute Infrastructure Support Scheme.