Journal article

An accurate method for identifying recent recombinants from unaligned sequences

Q Feng, KE Tiedje, S Ruybal-Pesantez, G Tonkin-Hill, MF Duffy, KP Day, H Shim, YB Chan

Bioinformatics | Published : 2022

Abstract

Motivation: Recombination is a fundamental process in molecular evolution, and the identification of recombinant sequences is thus of major interest. However, current methods for detecting recombinants are primarily designed for aligned sequences. Thus, they struggle with analyses of highly diverse genes, such as the var genes of the malaria parasite Plasmodium falciparum, which are known to diversify primarily through recombination. Results: We introduce an algorithm to detect recent recombinant sequences from a dataset without a full multiple alignment. Our algorithm can handle thousands of gene-length sequences without the need for a reference panel. We demonstrate the accuracy of our alg..

View full abstract

Grants

Awarded by National Institute of Allergy and Infectious Diseases


Funding Acknowledgements

This study was supported by the Fogarty International Center at the National Institutes of Health (Program on the Ecology and Evolution of Infectious Diseases) [R01-TW009670 to K.P.D.]. Salary support for KET was provided by R01-TW009670 and The University of Melbourne. SRP was supported by a Melbourne International Engagement Award from The University of Melbourne. Salary support for MFD was provided by The University of Melbourne. QF has been supported by the China Scholarship Council.