Tools Mendelian randomization

Curation of Mendelian randomization — listings under the GWAS Tools tab.

Summary Table

Click a column header to sort the table.

NAME	CATEGORY	Main citation	YEAR
Concepts&Principals	Concepts&Principals	Pierce BL et al., Am J Epidemiol, 2013	2013
Concepts&Principals	Concepts&Principals	Bowden J et al., Int J Epidemiol, 2015	2015
Concepts&Principals	Concepts&Principals	Smith GD et al., Int J Epidemiol, 2003	2003
Concepts&Principals	Concepts&Principals	Hemani G et al., PLoS Genet, 2017	2017
Concepts&Principals	Concepts&Principals	Howe LJ et al., Int J Epidemiol, 2022	2022
Diet-Inflammation-Insomnia MR	MISC	Zhou Y et al., medRxiv, 2026	2026
Reviews&Tutorials	Reviews&Tutorials	Benn M et al., Cardiovasc Res, 2018	2018
Reviews&Tutorials	Reviews&Tutorials	Minelli C et al., Int J Epidemiol, 2021	2021
Reviews&Tutorials	Reviews&Tutorials	Zheng J et al., Curr Epidemiol Rep, 2017	2017
Reviews&Tutorials	Reviews&Tutorials	Teumer A, Front Cardiovasc Med, 2018	2018
Reviews&Tutorials	Reviews&Tutorials	Emdin CA et al., JAMA, 2017	2017
Reviews&Tutorials	Reviews&Tutorials	Sanderson E et al., Nat Rev Methods Primers, 2022	2022
Reviews&Tutorials	Reviews&Tutorials	Davies NM et al., BMJ, 2018	2018
Reviews&Tutorials	Reviews&Tutorials	Lawlor DA, Int J Epidemiol, 2016	2016
SMR-multi	Tools	Wu Y et al., Nat Commun, 2018	2018
SMR	Tools	Zhu Z et al., Nat Genet, 2016	2016
Two-sample MR	Tools	Hemani G et al., Elife, 2018	2018

Concepts&Principals

Tool

PUBMED_LINK

23863760

TITLE

Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators.

Main citation

Pierce BL, Burgess S. (2013) Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am J Epidemiol, 178 (7) 1177-84. doi:10.1093/aje/kwt084. PMID 23863760

ABSTRACT

Mendelian randomization (MR) is a method for estimating the causal relationship between an exposure and an outcome using a genetic factor as an instrumental variable (IV) for the exposure. In the traditional MR setting, data on the IV, exposure, and outcome are available for all participants. However, obtaining complete exposure data may be difficult in some settings, due to high measurement costs or lack of appropriate biospecimens. We used simulated data sets to assess statistical power and bias for MR when exposure data are available for a subset (or an independent set) of participants. We show that obtaining exposure data for a subset of participants is a cost-efficient strategy, often having negligible effects on power in comparison with a traditional complete-data analysis. The size of the subset needed to achieve maximum power depends on IV strength, and maximum power is approximately equal to the power of traditional IV estimators. Weak IVs are shown to lead to bias towards the null when the subsample is small and towards the confounded association when the subset is relatively large. Various approaches for confidence interval calculation are considered. These results have important implications for reducing the costs and increasing the feasibility of MR studies.

Show full abstractShow less

DOI

10.1093/aje/kwt084

Concepts&Principals

Tool

PUBMED_LINK

26050253

TITLE

Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression.

Main citation

Bowden J, Davey Smith G, Burgess S. (2015) Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol, 44 (2) 512-25. doi:10.1093/ije/dyv080. PMID 26050253

ABSTRACT

BACKGROUND: The number of Mendelian randomization analyses including large numbers of genetic variants is rapidly increasing. This is due to the proliferation of genome-wide association studies, and the desire to obtain more precise estimates of causal effects. However, some genetic variants may not be valid instrumental variables, in particular due to them having more than one proximal phenotypic correlate (pleiotropy). METHODS: We view Mendelian randomization with multiple instruments as a meta-analysis, and show that bias caused by pleiotropy can be regarded as analogous to small study bias. Causal estimates using each instrument can be displayed visually by a funnel plot to assess potential asymmetry. Egger regression, a tool to detect small study bias in meta-analysis, can be adapted to test for bias from pleiotropy, and the slope coefficient from Egger regression provides an estimate of the causal effect. Under the assumption that the association of each genetic variant with the exposure is independent of the pleiotropic effect of the variant (not via the exposure), Egger's test gives a valid test of the null causal hypothesis and a consistent causal effect estimate even when all the genetic variants are invalid instrumental variables. RESULTS: We illustrate the use of this approach by re-analysing two published Mendelian randomization studies of the causal effect of height on lung function, and the causal effect of blood pressure on coronary artery disease risk. The conservative nature of this approach is illustrated with these examples. CONCLUSIONS: An adaption of Egger regression (which we call MR-Egger) can detect some violations of the standard instrumental variable assumptions, and provide an effect estimate which is not subject to these violations. The approach provides a sensitivity analysis for the robustness of the findings from a Mendelian randomization investigation.

Show full abstractShow less

DOI

10.1093/ije/dyv080

Concepts&Principals

Tool

PUBMED_LINK

12689998

TITLE

'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease?

Main citation

Smith GD, Ebrahim S. (2003) 'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol, 32 (1) 1-22. doi:10.1093/ije/dyg070. PMID 12689998

ABSTRACT

Associations between modifiable exposures and disease seen in observational epidemiology are sometimes confounded and thus misleading, despite our best efforts to improve the design and analysis of studies. Mendelian randomization-the random assortment of genes from parents to offspring that occurs during gamete formation and conception-provides one method for assessing the causal nature of some environmental exposures. The association between a disease and a polymorphism that mimics the biological link between a proposed exposure and disease is not generally susceptible to the reverse causation or confounding that may distort interpretations of conventional observational studies. Several examples where the phenotypic effects of polymorphisms are well documented provide encouraging evidence of the explanatory power of Mendelian randomization and are described. The limitations of the approach include confounding by polymorphisms in linkage disequilibrium with the polymorphism under study, that polymorphisms may have several phenotypic effects associated with disease, the lack of suitable polymorphisms for studying modifiable exposures of interest, and canalization-the buffering of the effects of genetic variation during development. Nevertheless, Mendelian randomization provides new opportunities to test causality and demonstrates how investment in the human genome project may contribute to understanding and preventing the adverse effects on human health of modifiable exposures.

Show full abstractShow less

DOI

10.1093/ije/dyg070

Concepts&Principals

Tool

PUBMED_LINK

29149188

TITLE

Orienting the causal relationship between imprecisely measured traits using GWAS summary data.

Main citation

Hemani G, Tilling K, Davey Smith G. (2017) Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet, 13 (11) e1007081. doi:10.1371/journal.pgen.1007081. PMID 29149188

ABSTRACT

Inference about the causal structure that induces correlations between two traits can be achieved by combining genetic associations with a mediation-based approach, as is done in the causal inference test (CIT). However, we show that measurement error in the phenotypes can lead to the CIT inferring the wrong causal direction, and that increasing sample sizes has the adverse effect of increasing confidence in the wrong answer. This problem is likely to be general to other mediation-based approaches. Here we introduce an extension to Mendelian randomisation, a method that uses genetic associations in an instrumentation framework, that enables inference of the causal direction between traits, with some advantages. First, it can be performed using only summary level data from genome-wide association studies; second, it is less susceptible to bias in the presence of measurement error or unmeasured confounding. We apply the method to infer the causal direction between DNA methylation and gene expression levels. Our results demonstrate that, in general, DNA methylation is more likely to be the causal factor, but this result is highly susceptible to bias induced by systematic differences in measurement error between the platforms, and by horizontal pleiotropy. We emphasise that, where possible, implementing MR and appropriate sensitivity analyses alongside other approaches such as CIT is important to triangulate reliable conclusions about causality.

Show full abstractShow less

DOI

10.1371/journal.pgen.1007081

Concepts&Principals

Tool

PUBMED_LINK

34570226

TITLE

Interpreting Mendelian-randomization estimates of the effects of categorical exposures such as disease status and educational attainment.

Main citation

Howe LJ, Tudball M, Davey Smith G, Davies NM. (2022) Interpreting Mendelian-randomization estimates of the effects of categorical exposures such as disease status and educational attainment. Int J Epidemiol, 51 (3) 948-957. doi:10.1093/ije/dyab208. PMID 34570226

ABSTRACT

BACKGROUND: Mendelian randomization has been previously used to estimate the effects of binary and ordinal categorical exposures-e.g. Type 2 diabetes or educational attainment defined by qualification-on outcomes. Binary and categorical phenotypes can be modelled in terms of liability-an underlying latent continuous variable with liability thresholds separating individuals into categories. Genetic variants influence an individual's categorical exposure via their effects on liability, thus Mendelian-randomization analyses with categorical exposures will capture effects of liability that act independently of exposure category. METHODS AND RESULTS: We discuss how groups in which the categorical exposure is invariant can be used to detect liability effects acting independently of exposure category. For example, associations between an adult educational-attainment polygenic score (PGS) and body mass index measured before the minimum school leaving age (e.g. age 10 years), cannot indicate the effects of years in full-time education on this outcome. Using UK Biobank data, we show that a higher educational-attainment PGS is strongly associated with lower smoking initiation and higher odds of glasses use at age 15 years. These associations were replicated in sibling models. An orthogonal approach using the raising of the school leaving age (ROSLA) policy change found that individuals who chose to remain in education to age 16 years before the reform likely had higher liability to educational attainment than those who were compelled to remain in education to age 16 years after the reform, and had higher income, lower pack-years of smoking, higher odds of glasses use and lower deprivation in adulthood. These results suggest that liability to educational attainment is associated with health and social outcomes independently of years in full-time education. CONCLUSIONS: Mendelian-randomization studies with non-continuous exposures should be interpreted in terms of liability, which may affect the outcome via changes in exposure category and/or independently.

Show full abstractShow less

DOI

10.1093/ije/dyab208

MISC

Diet-Inflammation-Insomnia MR

Mendelian Randomization Diet Inflammation Insomnia Sleep FinnGen NHANES CHARLS medRxiv Preprint

FULL NAME

Dietary Traits, Systemic Inflammatory Proxies, and Insomnia-Related Outcomes: Exploratory Mendelian Randomization and Population-Based Evidence

DESCRIPTION

High-dimensional Mendelian randomization screen of 231 dietary traits and 731 immune phenotypes for insomnia, with cross-release follow-up in FinnGen R12/R13 and population-based validation in NHANES and CHARLS. Prioritized omelette-related intake (protective, OR 0.77) and mixed-fruit (risk, OR 1.29) dietary signals, plus CD33- and HLA-DR-related immune traits. CRP associated with sleep problems in both NHANES and CHARLS. Exploratory cross-design analysis — does not establish causal mechanism.

Show full descriptionShow less

URL

https://www.medrxiv.org/content/10.64898/2026.07.03.26357235v1

TITLE

Dietary Traits, Systemic Inflammatory Proxies, and Insomnia-Related Outcomes: Exploratory Mendelian Randomization and Population-Based Evidence.

Main citation

Zhou Y, Huang Y, Cao Y, Bi X. (2026) Dietary Traits, Systemic Inflammatory Proxies, and Insomnia-Related Outcomes: Exploratory Mendelian Randomization and Population-Based Evidence. medRxiv. doi:10.64898/2026.07.03.26357235

ABSTRACT

High-dimensional Mendelian randomization (MR) screens can prioritize candidate dietary and immune pathways for insomnia, but their interpretation is constrained by multiple testing, cross-dataset instability, and limited correspondence between genetic constructs and measured population variables. We conducted an exploratory cross-design analysis that combined MR screening of 231 dietary traits and 731 immune phenotypes, targeted cross-release genetic follow-up in FinnGen R12 and R13, and population-based analyses in NHANES and CHARLS. The targeted R13 follow-up prioritised an omelette-related dietary signal (OR 0.773, 95% CI 0.651-0.917), a mixed-fruit signal (OR 1.285, 95% CI 1.102-1.498), and CD33- and HLA-DR-related immune-cell traits. In NHANES, mapped omelet/scrambled-egg intake was associated with lower odds of sleep problems (OR 0.746, FDR=0.033) and doctor-reported sleep disorder (OR 0.313, FDR=0.008). Higher CRP was associated with sleep problems in NHANES (OR 1.192, FDR=0.001) and restless sleep in CHARLS (OR 1.097, FDR<0.001).

Show full abstractShow less

DOI

10.64898/2026.07.03.26357235

Reviews&Tutorials

Tool

PUBMED_LINK

29471399

TITLE

From genome-wide association studies to Mendelian randomization: novel opportunities for understanding cardiovascular disease causality, pathogenesis, prevention, and treatment.

Main citation

Benn M, Nordestgaard BG. (2018) From genome-wide association studies to Mendelian randomization: novel opportunities for understanding cardiovascular disease causality, pathogenesis, prevention, and treatment. Cardiovasc Res, 114 (9) 1192-1208. doi:10.1093/cvr/cvy045. PMID 29471399

ABSTRACT

The Mendelian randomization approach is an epidemiological study design incorporating genetic information into traditional epidemiological studies to infer causality of biomarkers, risk factors, or lifestyle factors on disease risk. Mendelian randomization studies often draw on novel information generated in genome-wide association studies on causal associations between genetic variants and a risk factor or lifestyle factor. Such information can then be used in a largely unconfounded study design free of reverse causation to understand if and how risk factors and lifestyle factors cause cardiovascular disease. If causation is demonstrated, an opportunity for prevention of disease is identified; importantly however, before prevention or treatment can be implemented, randomized intervention trials altering risk factor levels or improving deleterious lifestyle factors needs to document reductions in cardiovascular disease in a safe and side-effect sparse manner. Documentation of causality can also inform on potential drug targets, more likely to be successful than prior approaches often relying on animal or cell studies mainly. The present review summarizes the history and background of Mendelian randomization, the study design, assumptions for using the design, and the most common caveats, followed by a discussion on advantages and disadvantages of different types of Mendelian randomization studies using one or more samples and different levels of information on study participants. The review also provides an overview of results on many of the risk factors and lifestyle factors for cardiovascular disease examined to date using the Mendelian randomization study design.

Show full abstractShow less

DOI

10.1093/cvr/cvy045

Reviews&Tutorials

Tool

PUBMED_LINK

33899104

TITLE

The use of two-sample methods for Mendelian randomization analyses on single large datasets.

Main citation

Minelli C, Del Greco M F, van der Plaat DA, Bowden J, ...&, Thompson J. (2021) The use of two-sample methods for Mendelian randomization analyses on single large datasets. Int J Epidemiol, 50 (5) 1651-1659. doi:10.1093/ije/dyab084. PMID 33899104

ABSTRACT

BACKGROUND: With genome-wide association data for many exposures and outcomes now available from large biobanks, one-sample Mendelian randomization (MR) is increasingly used to investigate causal relationships. Many robust MR methods are available to address pleiotropy, but these assume independence between the gene-exposure and gene-outcome association estimates. Unlike in two-sample MR, in one-sample MR the two estimates are obtained from the same individuals, and the assumption of independence does not hold in the presence of confounding. METHODS: With simulations mimicking a typical study in UK Biobank, we assessed the performance, in terms of bias and precision of the MR estimate, of the fixed-effect and (multiplicative) random-effects meta-analysis method, weighted median estimator, weighted mode estimator and MR-Egger regression, used in both one-sample and two-sample data. We considered scenarios differing by the: presence/absence of a true causal effect; amount of confounding; and presence and type of pleiotropy (none, balanced or directional). RESULTS: Even in the presence of substantial correlation due to confounding, all two-sample methods used in one-sample MR performed similarly to when used in two-sample MR, except for MR-Egger which resulted in bias reflecting direction and magnitude of the confounding. Such bias was much reduced in the presence of very high variability in instrument strength across variants (IGX2 of 97%). CONCLUSIONS: Two-sample MR methods can be safely used for one-sample MR performed within large biobanks, expect for MR-Egger. MR-Egger is not recommended for one-sample MR unless the correlation between the gene-exposure and gene-outcome estimates due to confounding can be kept low, or the variability in instrument strength is very high.

Show full abstractShow less

DOI

10.1093/ije/dyab084

Reviews&Tutorials

Tool

PUBMED_LINK

29226067

TITLE

Recent Developments in Mendelian Randomization Studies.

Main citation

Zheng J, Baird D, Borges MC, Bowden J, ...&, Smith GD. (2017) Recent Developments in Mendelian Randomization Studies. Curr Epidemiol Rep, 4 (4) 330-345. doi:10.1007/s40471-017-0128-6. PMID 29226067

ABSTRACT

PURPOSE OF REVIEW: Mendelian randomization (MR) is a strategy for evaluating causality in observational epidemiological studies. MR exploits the fact that genotypes are not generally susceptible to reverse causation and confounding, due to their fixed nature and Mendel's First and Second Laws of Inheritance. MR has the potential to provide information on causality in many situations where randomized controlled trials are not possible, but the results of MR studies must be interpreted carefully to avoid drawing erroneous conclusions. RECENT FINDINGS: In this review, we outline the principles behind MR, as well as assumptions and limitations of the method. Extensions to the basic approach are discussed, including two-sample MR, bidirectional MR, two-step MR, multivariable MR, and factorial MR. We also consider some new applications and recent developments in the methodology, including its ability to inform drug development, automation of the method using tools such as MR-Base, and phenome-wide and hypothesis-free MR. SUMMARY: In conjunction with the growing availability of large-scale genomic databases, higher level of automation and increased robustness of the methods, MR promises to be a valuable strategy to examine causality in complex biological/omics networks, inform drug development and prioritize intervention targets for disease prevention in the future.

Show full abstractShow less

DOI

10.1007/s40471-017-0128-6

Reviews&Tutorials

Tool

PUBMED_LINK

29892602

TITLE

Common Methods for Performing Mendelian Randomization.

Main citation

Teumer A. (2018) Common Methods for Performing Mendelian Randomization. Front Cardiovasc Med, 5 () 51. doi:10.3389/fcvm.2018.00051. PMID 29892602

ABSTRACT

Mendelian randomization (MR) is a framework for assessing causal inference using cross-sectional data in combination with genetic information. This paper summarizes statistical methods commonly applied and strait forward to use for conducting MR analyses including those taking advantage of the rich dataset of SNP-trait associations that were revealed in the last decade through large-scale genome-wide association studies. Using these data, powerful MR studies are possible. However, the causal estimate may be biased in case the assumptions of MR are violated. The source and the type of this bias are described while providing a summary of the mathematical formulas that should help estimating the magnitude and direction of the potential bias depending on the specific research setting. Finally, methods for relaxing the assumptions and for conducting sensitivity analyses are discussed. Future researches in the field of MR include the assessment of non-linear causal effects, and automatic detection of invalid instruments.

Show full abstractShow less

DOI

10.3389/fcvm.2018.00051

Reviews&Tutorials

Tool

PUBMED_LINK

29164242

TITLE

Mendelian Randomization.

Main citation

Emdin CA, Khera AV, Kathiresan S. (2017) Mendelian Randomization. JAMA, 318 (19) 1925-1926. doi:10.1001/jama.2017.17219. PMID 29164242

DOI

10.1001/jama.2017.17219

Reviews&Tutorials

Tool

PUBMED_LINK

37325194

TITLE

Mendelian randomization.

Main citation

Sanderson E, Glymour MM, Holmes MV, Kang H, ...&, Smith GD. (2022) Mendelian randomization. Nat Rev Methods Primers, 2 () . doi:10.1038/s43586-021-00092-5. PMID 37325194

ABSTRACT

Mendelian randomization (MR) is a term that applies to the use of genetic variation to address causal questions about how modifiable exposures influence different outcomes. The principles of MR are based on Mendel's laws of inheritance and instrumental variable estimation methods, which enable the inference of causal effects in the presence of unobserved confounding. In this Primer, we outline the principles of MR, the instrumental variable conditions underlying MR estimation and some of the methods used for estimation. We go on to discuss how the assumptions underlying an MR study can be assessed and give methods of estimation that are robust to certain violations of these assumptions. We give examples of a range of studies in which MR has been applied, the limitations of current methods of analysis and the outlook for MR in the future. The difference between the assumptions required for MR analysis and other forms of non-interventional epidemiological studies means that MR can be used as part of a triangulation across multiple sources of evidence for causal inference.

Show full abstractShow less

DOI

10.1038/s43586-021-00092-5

Reviews&Tutorials

Tool

PUBMED_LINK

30002074

TITLE

Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians.

Main citation

Davies NM, Holmes MV, Davey Smith G. (2018) Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians. BMJ, 362 () k601. doi:10.1136/bmj.k601. PMID 30002074

ABSTRACT

Mendelian randomisation uses genetic variation as a natural experiment to investigate the causal relations between potentially modifiable risk factors and health outcomes in observational data. As with all epidemiological approaches, findings from Mendelian randomisation studies depend on specific assumptions. We provide explanations of the information typically reported in Mendelian randomisation studies that can be used to assess the plausibility of these assumptions and guidance on how to interpret findings from Mendelian randomisation studies in the context of other sources of evidence

Show full abstractShow less

DOI

10.1136/bmj.k601

Reviews&Tutorials

Tool

PUBMED_LINK

27427429

TITLE

Commentary: Two-sample Mendelian randomization: opportunities and challenges.

Main citation

Lawlor DA. (2016) Commentary: Two-sample Mendelian randomization: opportunities and challenges. Int J Epidemiol, 45 (3) 908-15. doi:10.1093/ije/dyw127. PMID 27427429

DOI

10.1093/ije/dyw127

Tools

SMR

Tool

PUBMED_LINK

27019110

FULL NAME

Summary-data-based Mendelian Randomization

DESCRIPTION

The SMR software tool was originally developed to implement the SMR & HEIDI methods to test for pleiotropic association between the expression level of a gene and a complex trait of interest using summary-level data from GWAS and expression quantitative trait loci (eQTL) studies (Zhu et al. 2016 Nature Genetics). The SMR & HEIDI methodology can be interpreted as an analysis to test if the effect size of a SNP on the phenotype is mediated by gene expression. This tool can therefore be used to prioritize genes underlying GWAS hits for follow-up functional studies. The methods are applicable to all kinds of molecular QTL (xQTL) data, including DNA methylation QTL (mQTL) and protein abundance QTL (pQTL).

Show full descriptionShow less

URL

https://yanglab.westlake.edu.cn/software/smr/#Overview

KEYWORDS

pleiotropy or causality, xQTL, eQTL, MR, HEIDI, linkage

Show full keywordsShow less

TITLE

Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.

Main citation

Zhu Z, Zhang F, Hu H, Bakshi A, ...&, Yang J. (2016) Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet, 48 (5) 481-7. doi:10.1038/ng.3538. PMID 27019110

ABSTRACT

Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with human complex traits. However, the genes or functional DNA elements through which these variants exert their effects on the traits are often unknown. We propose a method (called SMR) that integrates summary-level data from GWAS with data from expression quantitative trait locus (eQTL) studies to identify genes whose expression levels are associated with a complex trait because of pleiotropy. We apply the method to five human complex traits using GWAS data on up to 339,224 individuals and eQTL data on 5,311 individuals, and we prioritize 126 genes (for example, TRAF1 and ANKRD55 for rheumatoid arthritis and SNX19 and NMRAL1 for schizophrenia), of which 25 genes are new candidates; 77 genes are not the nearest annotated gene to the top associated GWAS SNP. These genes provide important leads to design future functional studies to understand the mechanism whereby DNA variation leads to complex trait variation.

Show full abstractShow less

DOI

10.1038/ng.3538

SMR-multi

Tool

PUBMED_LINK

29500431

TITLE

Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits.

Main citation

Wu Y, Zeng J, Zhang F, Zhu Z, ...&, Yang J. (2018) Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits. Nat Commun, 9 (1) 918. doi:10.1038/s41467-018-03371-0. PMID 29500431

ABSTRACT

The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.

Show full abstractShow less

DOI

10.1038/s41467-018-03371-0

Two-sample MR

Tool

PUBMED_LINK

29846171

DESCRIPTION

Two sample Mendelian randomisation (2SMR) is a method to estimate the causal effect of an exposure on an outcome using only summary statistics from genome wide association studies (GWAS). Though conceptually straightforward, there are a number of steps that are required to perform the analysis properly, and they can be cumbersome

Show full descriptionShow less

URL

https://mrcieu.github.io/TwoSampleMR/articles/introduction.html

TITLE

The MR-Base platform supports systematic causal inference across the human phenome.

Main citation

Hemani G, Zheng J, Elsworth B, Wade KH, ...&, Haycock PC. (2018) The MR-Base platform supports systematic causal inference across the human phenome. Elife, 7 () . doi:10.7554/eLife.34408. PMID 29846171

ABSTRACT

Results from genome-wide association studies (GWAS) can be used to infer causal relationships between phenotypes, using a strategy known as 2-sample Mendelian randomization (2SMR) and bypassing the need for individual-level data. However, 2SMR methods are evolving rapidly and GWAS results are often insufficiently curated, undermining efficient implementation of the approach. We therefore developed MR-Base (<ext-link ext-link-type="uri" xlink:href="http://www.mrbase.org">http://www.mrbase.org</ext-link>): a platform that integrates a curated database of complete GWAS results (no restrictions according to statistical significance) with an application programming interface, web app and R packages that automate 2SMR. The software includes several sensitivity analyses for assessing the impact of horizontal pleiotropy and other violations of assumptions. The database currently comprises 11 billion single nucleotide polymorphism-trait associations from 1673 GWAS and is updated on a regular basis. Integrating data with software ensures more rigorous application of hypothesis-driven analyses and allows millions of potential causal relationships to be efficiently evaluated in phenome-wide association studies.

Show full abstractShow less

DOI

10.7554/eLife.34408