Mixture SNPs effect on phenotype in genome-wide association studies.

TitleMixture SNPs effect on phenotype in genome-wide association studies.
Publication TypeJournal Article
Year of Publication2015
AuthorsWang, L, Shen, H, Liu, H, Guo, G
JournalBMC Genomics
Date Published2015 Feb 03
ISSN Number1471-2164
KeywordsAlgorithms, Alleles, Alpha-Ketoglutarate-Dependent Dioxygenase FTO, Bayes Theorem, Body Mass Index, Chromosomes, Human, Pair 16, Genome-Wide Association Study, Humans, Linkage Disequilibrium, Phenotype, Polymorphism, Single Nucleotide, Principal Component Analysis, Proteins

BACKGROUND: Recently mixed linear models are used to address the issue of "missing" heritability in traditional Genome-wide association studies (GWAS). The models assume that all single-nucleotide polymorphisms (SNPs) are associated with the phenotypes of interest. However, it is more common that only a small proportion of SNPs have significant effects on the phenotypes, while most SNPs have no or very small effects. To incorporate this feature, we propose an efficient Hierarchical Bayesian Model (HBM) that extends the existing mixed models to enforce automatic selection of significant SNPs. The HBM models the SNP effects using a mixture distribution of a point mass at zero and a normal distribution, where the point mass corresponds to those non-associative SNPs.

RESULTS: We estimate the HBM using Gibbs sampling. The estimation performance of our method is first demonstrated through two simulation studies. We make the simulation setups realistic by using parameters fitted on the Framingham Heart Study (FHS) data. The simulation studies show that our method can accurately estimate the proportion of SNPs associated with the simulated phenotype and identify these SNPs, as well as adapt to certain model mis-specification than the standard mixed models. In addition, we analyze data from the FHS and the Health and Retirement Study (HRS) to study the association between Body Mass Index (BMI) and SNPs on Chromosome 16, and replicate the identified genetic associations. The analysis of the FHS data identifies 0.3% SNPs on Chromosome 16 that affect BMI, including rs9939609 and rs9939973 on the FTO gene. These two SNPs are in strong linkage disequilibrium with rs1558902 (Rsq =0.901 for rs9939609 and Rsq =0.905 for rs9939973), which has been reported to be linked with obesity in previous GWAS. We then replicate the findings using the HRS data: the analysis finds 0.4% of SNPs associated with BMI on Chromosome 16. Furthermore, around 25% of the genes that are identified to be associated with BMI are common between the two studies.

CONCLUSIONS: The results demonstrate that the HBM and the associated estimation algorithm offer a powerful tool for identifying significant genetic associations with phenotypes of interest, among a large number of SNPs that are common in modern genetics studies.


Times Cited: 0 0

User Guide Notes


Endnote Keywords

genetics/genetics/Phenotypes/Phenotypes/Hierarchical Bayesian Model/Statistical analysis/Framingham Heart Study/Body Mass Index

Endnote ID


Alternate JournalBMC Genomics
Citation Key8299
PubMed ID25649116
PubMed Central IDPMC4417323
Grant ListP2C HD050924 / HD / NICHD NIH HHS / United States
RC1 DA029425 / DA / NIDA NIH HHS / United States
RC1 DA029425-01 / DA / NIDA NIH HHS / United States