Journal article
Model selection procedures for high dimensional genomic data
AJ Motyer, S Galbraith, SR Wilson
Anziam Journal | Australian Mathematical Publishing Association, Inc. | Published : 2010
Abstract
Many complex diseases are thought to be caused by multiple genetic variants. Recent advances in genotyping technology allowed investi- gators of a complex disease to obtain data for a massive number of candidate genetic variants. Typically each candidate variant is tested individually for an association with the disease. We approach the problem as one of model selection for high dimensional data. We propose a method whereby penalised maximum likelihood estimation provides a reasonably sized set of variants for inclusion in our model. We then perform stepwise regression on this set of variants to arrive at our model. Penalised maximum likelihood estimation is performed with both the lasso and..
View full abstract