Journal article

SumHer better estimates the SNP heritability of complex traits from summary statistics

Doug Speed, David J Balding



We present SumHer, software for estimating confounding bias, SNP heritability, enrichments of heritability and genetic correlations using summary statistics from genome-wide association studies. The key difference between SumHer and the existing software LD Score Regression (LDSC) is that SumHer allows the user to specify the heritability model. We apply SumHer to results from 24 large-scale association studies (average sample size 121,000) using our recommended heritability model. We show that these studies tended to substantially over-correct for confounding, and as a result the number of genome-wide significant loci was under-reported by about a quarter. We also estimate enrichments for 2..

View full abstract

University of Melbourne Researchers


Awarded by UK Medical Research Council

Awarded by European Union's Horizon 2020 Research and Innovation Programme under the Marie Sklodowska-Curie grant

Awarded by Independent Research Fund Denmark

Awarded by NHGRI

Awarded by National Institute on Aging

Awarded by dbGaP Project



Funding Acknowledgements

We thank A. Price, H. Finucane, P. O'Reilly and M. Speed for helpful discussions. Access to the Wellcome Trust Case Control Consortium data was authorized as work related to the project 'Genome-wide association study of susceptibility and clinical phenotypes in epilepsy', access to eMERGE Network data was granted under dbGaP Project 14422, 'Comprehensive testing of SNP-based prediction models', while access to the Health and Retirement Study was granted under dbGaP Project 15139, 'Developing summary-statistic tools for analysing genetic association study data'. D.S. is funded by the UK Medical Research Council under grant no. MR/L012561/1, by the European Union's Horizon 2020 Research and Innovation Programme under the Marie Sklodowska-Curie grant agreement no. 754513, by Aarhus University Research Foundation (AUFF) and the Independent Research Fund Denmark under Project no. 7025-00094B. The eMERGE Network was initiated and funded by NHGRI through the following grants: U01HG006828 (Cincinnati Children's Hospital Medical Center/Boston Children's Hospital); U01HG006830 (Children's Hospital of Philadelphia); U01HG006389 (Essentia Institute of Rural Health, Marshfield Clinic Research Foundation and Pennsylvania State University); U01HG006382 (Geisinger Clinic); U01HG006375 (Group Health Cooperative); U01HG006379 (Mayo Clinic); U01HG006380 (Icahn School of Medicine at Mount Sinai); U01HG006388 (Northwestern University); U01HG006378 (Vanderbilt University Medical Center); and U01HG006385 (Vanderbilt University Medical Center serving as the Coordinating Center). The Health and Retirement Study genetic data is sponsored by the National Institute on Aging (grant nos. U01AG009740, RC2AG036495, and RC4AG039029) and was conducted by the University of Michigan. Analyses were performed with the use of the UCL Computer Science Cluster and the help of the CS Technical Support Group, as well as the use of the UCL Legion High-Performance Computing Facility (Legion@UCL) and associated support services.