Skip to content

Tools Mendelian randomization

Curation of Mendelian randomization — listings under the GWAS Tools tab.

Summary Table

Click a column header to sort the table.

NAME CATEGORY Main citation YEAR
Concepts&Principals Concepts&Principals
Pierce BL et al., Am J Epidemiol, 2013
2013
Concepts&Principals Concepts&Principals
Bowden J et al., Int J Epidemiol, 2015
2015
Concepts&Principals Concepts&Principals
Smith GD et al., Int J Epidemiol, 2003
2003
Concepts&Principals Concepts&Principals
Hemani G et al., PLoS Genet, 2017
2017
Concepts&Principals Concepts&Principals
Howe LJ et al., Int J Epidemiol, 2022
2022
Reviews&Tutorials Reviews&Tutorials
Benn M et al., Cardiovasc Res, 2018
2018
Reviews&Tutorials Reviews&Tutorials
Minelli C et al., Int J Epidemiol, 2021
2021
Reviews&Tutorials Reviews&Tutorials
Zheng J et al., Curr Epidemiol Rep, 2017
2017
Reviews&Tutorials Reviews&Tutorials
Teumer A, Front Cardiovasc Med, 2018
2018
Reviews&Tutorials Reviews&Tutorials
Emdin CA et al., JAMA, 2017
2017
Reviews&Tutorials Reviews&Tutorials
Sanderson E et al., Nat Rev Methods Primers, 2022
2022
Reviews&Tutorials Reviews&Tutorials
Davies NM et al., BMJ, 2018
2018
Reviews&Tutorials Reviews&Tutorials
Lawlor DA, Int J Epidemiol, 2016
2016
SMR-multi Tools
Wu Y et al., Nat Commun, 2018
2018
SMR Tools
Zhu Z et al., Nat Genet, 2016
2016
Two-sample MR Tools
Hemani G et al., Elife, 2018
2018

Concepts&Principals

Concepts&Principals

Tool
PUBMED_LINK
23863760
TITLE
Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators.
Main citation
Pierce BL, Burgess S. (2013) Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am J Epidemiol, 178 (7) 1177-84. doi:10.1093/aje/kwt084. PMID 23863760
ABSTRACT
Mendelian randomization (MR) is a method for estimating the causal relationship between an exposure and an outcome using a genetic factor as an instrumental variable (IV) for the exposure. In the traditional MR setting, data on the IV, exposure, and outcome are available for all participants. However, obtaining complete exposure data may be difficult in some settings, due to high measurement costs or lack of appropriate biospecimens. We used simulated data sets to assess statistical power and bias for MR when exposure data are available for a subset (or an independent set) of participants. We show that obtaining exposure data for a subset of participants is a cost-efficient strategy, often having negligible effects on power in comparison with a traditional complete-data analysis. The size of the subset needed to achieve maximum power depends on IV strength, and maximum power is approximately equal to the power of traditional IV estimators. Weak IVs are shown to lead to bias towards the null when the subsample is small and towards the confounded association when the subset is relatively large. Various approaches for confidence interval calculation are considered. These results have important implications for reducing the costs and increasing the feasibility of MR studies.
DOI
10.1093/aje/kwt084

Concepts&Principals

Tool
PUBMED_LINK
26050253
TITLE
Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression.
Main citation
Bowden J, Davey Smith G, Burgess S. (2015) Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol, 44 (2) 512-25. doi:10.1093/ije/dyv080. PMID 26050253
ABSTRACT
BACKGROUND: The number of Mendelian randomization analyses including large numbers of genetic variants is rapidly increasing. This is due to the proliferation of genome-wide association studies, and the desire to obtain more precise estimates of causal effects. However, some genetic variants may not be valid instrumental variables, in particular due to them having more than one proximal phenotypic correlate (pleiotropy). METHODS: We view Mendelian randomization with multiple instruments as a meta-analysis, and show that bias caused by pleiotropy can be regarded as analogous to small study bias. Causal estimates using each instrument can be displayed visually by a funnel plot to assess potential asymmetry. Egger regression, a tool to detect small study bias in meta-analysis, can be adapted to test for bias from pleiotropy, and the slope coefficient from Egger regression provides an estimate of the causal effect. Under the assumption that the association of each genetic variant with the exposure is independent of the pleiotropic effect of the variant (not via the exposure), Egger's test gives a valid test of the null causal hypothesis and a consistent causal effect estimate even when all the genetic variants are invalid instrumental variables. RESULTS: We illustrate the use of this approach by re-analysing two published Mendelian randomization studies of the causal effect of height on lung function, and the causal effect of blood pressure on coronary artery disease risk. The conservative nature of this approach is illustrated with these examples. CONCLUSIONS: An adaption of Egger regression (which we call MR-Egger) can detect some violations of the standard instrumental variable assumptions, and provide an effect estimate which is not subject to these violations. The approach provides a sensitivity analysis for the robustness of the findings from a Mendelian randomization investigation.
DOI
10.1093/ije/dyv080

Concepts&Principals

Tool
PUBMED_LINK
12689998
TITLE
'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease?
Main citation
Smith GD, Ebrahim S. (2003) 'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol, 32 (1) 1-22. doi:10.1093/ije/dyg070. PMID 12689998
ABSTRACT
Associations between modifiable exposures and disease seen in observational epidemiology are sometimes confounded and thus misleading, despite our best efforts to improve the design and analysis of studies. Mendelian randomization-the random assortment of genes from parents to offspring that occurs during gamete formation and conception-provides one method for assessing the causal nature of some environmental exposures. The association between a disease and a polymorphism that mimics the biological link between a proposed exposure and disease is not generally susceptible to the reverse causation or confounding that may distort interpretations of conventional observational studies. Several examples where the phenotypic effects of polymorphisms are well documented provide encouraging evidence of the explanatory power of Mendelian randomization and are described. The limitations of the approach include confounding by polymorphisms in linkage disequilibrium with the polymorphism under study, that polymorphisms may have several phenotypic effects associated with disease, the lack of suitable polymorphisms for studying modifiable exposures of interest, and canalization-the buffering of the effects of genetic variation during development. Nevertheless, Mendelian randomization provides new opportunities to test causality and demonstrates how investment in the human genome project may contribute to understanding and preventing the adverse effects on human health of modifiable exposures.
DOI
10.1093/ije/dyg070

Concepts&Principals

Tool
PUBMED_LINK
29149188
TITLE
Orienting the causal relationship between imprecisely measured traits using GWAS summary data.
Main citation
Hemani G, Tilling K, Davey Smith G. (2017) Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet, 13 (11) e1007081. doi:10.1371/journal.pgen.1007081. PMID 29149188
ABSTRACT
Inference about the causal structure that induces correlations between two traits can be achieved by combining genetic associations with a mediation-based approach, as is done in the causal inference test (CIT). However, we show that measurement error in the phenotypes can lead to the CIT inferring the wrong causal direction, and that increasing sample sizes has the adverse effect of increasing confidence in the wrong answer. This problem is likely to be general to other mediation-based approaches. Here we introduce an extension to Mendelian randomisation, a method that uses genetic associations in an instrumentation framework, that enables inference of the causal direction between traits, with some advantages. First, it can be performed using only summary level data from genome-wide association studies; second, it is less susceptible to bias in the presence of measurement error or unmeasured confounding. We apply the method to infer the causal direction between DNA methylation and gene expression levels. Our results demonstrate that, in general, DNA methylation is more likely to be the causal factor, but this result is highly susceptible to bias induced by systematic differences in measurement error between the platforms, and by horizontal pleiotropy. We emphasise that, where possible, implementing MR and appropriate sensitivity analyses alongside other approaches such as CIT is important to triangulate reliable conclusions about causality.
DOI
10.1371/journal.pgen.1007081

Concepts&Principals

Tool
PUBMED_LINK
34570226
TITLE
Interpreting Mendelian-randomization estimates of the effects of categorical exposures such as disease status and educational attainment.
Main citation
Howe LJ, Tudball M, Davey Smith G, Davies NM. (2022) Interpreting Mendelian-randomization estimates of the effects of categorical exposures such as disease status and educational attainment. Int J Epidemiol, 51 (3) 948-957. doi:10.1093/ije/dyab208. PMID 34570226
ABSTRACT
BACKGROUND: Mendelian randomization has been previously used to estimate the effects of binary and ordinal categorical exposures-e.g. Type 2 diabetes or educational attainment defined by qualification-on outcomes. Binary and categorical phenotypes can be modelled in terms of liability-an underlying latent continuous variable with liability thresholds separating individuals into categories. Genetic variants influence an individual's categorical exposure via their effects on liability, thus Mendelian-randomization analyses with categorical exposures will capture effects of liability that act independently of exposure category. METHODS AND RESULTS: We discuss how groups in which the categorical exposure is invariant can be used to detect liability effects acting independently of exposure category. For example, associations between an adult educational-attainment polygenic score (PGS) and body mass index measured before the minimum school leaving age (e.g. age 10 years), cannot indicate the effects of years in full-time education on this outcome. Using UK Biobank data, we show that a higher educational-attainment PGS is strongly associated with lower smoking initiation and higher odds of glasses use at age 15 years. These associations were replicated in sibling models. An orthogonal approach using the raising of the school leaving age (ROSLA) policy change found that individuals who chose to remain in education to age 16 years before the reform likely had higher liability to educational attainment than those who were compelled to remain in education to age 16 years after the reform, and had higher income, lower pack-years of smoking, higher odds of glasses use and lower deprivation in adulthood. These results suggest that liability to educational attainment is associated with health and social outcomes independently of years in full-time education. CONCLUSIONS: Mendelian-randomization studies with non-continuous exposures should be interpreted in terms of liability, which may affect the outcome via changes in exposure category and/or independently.
DOI
10.1093/ije/dyab208

Reviews&Tutorials

Reviews&Tutorials

Tool
PUBMED_LINK
29471399
TITLE
From genome-wide association studies to Mendelian randomization: novel opportunities for understanding cardiovascular disease causality, pathogenesis, prevention, and treatment.
Main citation
Benn M, Nordestgaard BG. (2018) From genome-wide association studies to Mendelian randomization: novel opportunities for understanding cardiovascular disease causality, pathogenesis, prevention, and treatment. Cardiovasc Res, 114 (9) 1192-1208. doi:10.1093/cvr/cvy045. PMID 29471399
ABSTRACT
The Mendelian randomization approach is an epidemiological study design incorporating genetic information into traditional epidemiological studies to infer causality of biomarkers, risk factors, or lifestyle factors on disease risk. Mendelian randomization studies often draw on novel information generated in genome-wide association studies on causal associations between genetic variants and a risk factor or lifestyle factor. Such information can then be used in a largely unconfounded study design free of reverse causation to understand if and how risk factors and lifestyle factors cause cardiovascular disease. If causation is demonstrated, an opportunity for prevention of disease is identified; importantly however, before prevention or treatment can be implemented, randomized intervention trials altering risk factor levels or improving deleterious lifestyle factors needs to document reductions in cardiovascular disease in a safe and side-effect sparse manner. Documentation of causality can also inform on potential drug targets, more likely to be successful than prior approaches often relying on animal or cell studies mainly. The present review summarizes the history and background of Mendelian randomization, the study design, assumptions for using the design, and the most common caveats, followed by a discussion on advantages and disadvantages of different types of Mendelian randomization studies using one or more samples and different levels of information on study participants. The review also provides an overview of results on many of the risk factors and lifestyle factors for cardiovascular disease examined to date using the Mendelian randomization study design.
DOI
10.1093/cvr/cvy045

Reviews&Tutorials

Tool
PUBMED_LINK
33899104
TITLE
The use of two-sample methods for Mendelian randomization analyses on single large datasets.
Main citation
Minelli C, Del Greco M F, van der Plaat DA, Bowden J, ...&, Thompson J. (2021) The use of two-sample methods for Mendelian randomization analyses on single large datasets. Int J Epidemiol, 50 (5) 1651-1659. doi:10.1093/ije/dyab084. PMID 33899104
ABSTRACT
BACKGROUND: With genome-wide association data for many exposures and outcomes now available from large biobanks, one-sample Mendelian randomization (MR) is increasingly used to investigate causal relationships. Many robust MR methods are available to address pleiotropy, but these assume independence between the gene-exposure and gene-outcome association estimates. Unlike in two-sample MR, in one-sample MR the two estimates are obtained from the same individuals, and the assumption of independence does not hold in the presence of confounding. METHODS: With simulations mimicking a typical study in UK Biobank, we assessed the performance, in terms of bias and precision of the MR estimate, of the fixed-effect and (multiplicative) random-effects meta-analysis method, weighted median estimator, weighted mode estimator and MR-Egger regression, used in both one-sample and two-sample data. We considered scenarios differing by the: presence/absence of a true causal effect; amount of confounding; and presence and type of pleiotropy (none, balanced or directional). RESULTS: Even in the presence of substantial correlation due to confounding, all two-sample methods used in one-sample MR performed similarly to when used in two-sample MR, except for MR-Egger which resulted in bias reflecting direction and magnitude of the confounding. Such bias was much reduced in the presence of very high variability in instrument strength across variants (IGX2 of 97%). CONCLUSIONS: Two-sample MR methods can be safely used for one-sample MR performed within large biobanks, expect for MR-Egger. MR-Egger is not recommended for one-sample MR unless the correlation between the gene-exposure and gene-outcome estimates due to confounding can be kept low, or the variability in instrument strength is very high.
DOI
10.1093/ije/dyab084

Reviews&Tutorials

Tool
PUBMED_LINK
29226067
TITLE
Recent Developments in Mendelian Randomization Studies.
Main citation
Zheng J, Baird D, Borges MC, Bowden J, ...&, Smith GD. (2017) Recent Developments in Mendelian Randomization Studies. Curr Epidemiol Rep, 4 (4) 330-345. doi:10.1007/s40471-017-0128-6. PMID 29226067
ABSTRACT
PURPOSE OF REVIEW: Mendelian randomization (MR) is a strategy for evaluating causality in observational epidemiological studies. MR exploits the fact that genotypes are not generally susceptible to reverse causation and confounding, due to their fixed nature and Mendel's First and Second Laws of Inheritance. MR has the potential to provide information on causality in many situations where randomized controlled trials are not possible, but the results of MR studies must be interpreted carefully to avoid drawing erroneous conclusions. RECENT FINDINGS: In this review, we outline the principles behind MR, as well as assumptions and limitations of the method. Extensions to the basic approach are discussed, including two-sample MR, bidirectional MR, two-step MR, multivariable MR, and factorial MR. We also consider some new applications and recent developments in the methodology, including its ability to inform drug development, automation of the method using tools such as MR-Base, and phenome-wide and hypothesis-free MR. SUMMARY: In conjunction with the growing availability of large-scale genomic databases, higher level of automation and increased robustness of the methods, MR promises to be a valuable strategy to examine causality in complex biological/omics networks, inform drug development and prioritize intervention targets for disease prevention in the future.
DOI
10.1007/s40471-017-0128-6

Reviews&Tutorials

Tool
PUBMED_LINK
29892602
TITLE
Common Methods for Performing Mendelian Randomization.
Main citation
Teumer A. (2018) Common Methods for Performing Mendelian Randomization. Front Cardiovasc Med, 5 () 51. doi:10.3389/fcvm.2018.00051. PMID 29892602
ABSTRACT
Mendelian randomization (MR) is a framework for assessing causal inference using cross-sectional data in combination with genetic information. This paper summarizes statistical methods commonly applied and strait forward to use for conducting MR analyses including those taking advantage of the rich dataset of SNP-trait associations that were revealed in the last decade through large-scale genome-wide association studies. Using these data, powerful MR studies are possible. However, the causal estimate may be biased in case the assumptions of MR are violated. The source and the type of this bias are described while providing a summary of the mathematical formulas that should help estimating the magnitude and direction of the potential bias depending on the specific research setting. Finally, methods for relaxing the assumptions and for conducting sensitivity analyses are discussed. Future researches in the field of MR include the assessment of non-linear causal effects, and automatic detection of invalid instruments.
DOI
10.3389/fcvm.2018.00051

Reviews&Tutorials

Tool
PUBMED_LINK
29164242
TITLE
Mendelian Randomization.
Main citation
Emdin CA, Khera AV, Kathiresan S. (2017) Mendelian Randomization. JAMA, 318 (19) 1925-1926. doi:10.1001/jama.2017.17219. PMID 29164242
DOI
10.1001/jama.2017.17219

Reviews&Tutorials

Tool
PUBMED_LINK
37325194
TITLE
Mendelian randomization.
Main citation
Sanderson E, Glymour MM, Holmes MV, Kang H, ...&, Smith GD. (2022) Mendelian randomization. Nat Rev Methods Primers, 2 () . doi:10.1038/s43586-021-00092-5. PMID 37325194
ABSTRACT
Mendelian randomization (MR) is a term that applies to the use of genetic variation to address causal questions about how modifiable exposures influence different outcomes. The principles of MR are based on Mendel's laws of inheritance and instrumental variable estimation methods, which enable the inference of causal effects in the presence of unobserved confounding. In this Primer, we outline the principles of MR, the instrumental variable conditions underlying MR estimation and some of the methods used for estimation. We go on to discuss how the assumptions underlying an MR study can be assessed and give methods of estimation that are robust to certain violations of these assumptions. We give examples of a range of studies in which MR has been applied, the limitations of current methods of analysis and the outlook for MR in the future. The difference between the assumptions required for MR analysis and other forms of non-interventional epidemiological studies means that MR can be used as part of a triangulation across multiple sources of evidence for causal inference.
DOI
10.1038/s43586-021-00092-5

Reviews&Tutorials

Tool
PUBMED_LINK
30002074
TITLE
Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians.
Main citation
Davies NM, Holmes MV, Davey Smith G. (2018) Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians. BMJ, 362 () k601. doi:10.1136/bmj.k601. PMID 30002074
ABSTRACT
Mendelian randomisation uses genetic variation as a natural experiment to investigate the causal relations between potentially modifiable risk factors and health outcomes in observational data. As with all epidemiological approaches, findings from Mendelian randomisation studies depend on specific assumptions. We provide explanations of the information typically reported in Mendelian randomisation studies that can be used to assess the plausibility of these assumptions and guidance on how to interpret findings from Mendelian randomisation studies in the context of other sources of evidence
DOI
10.1136/bmj.k601

Reviews&Tutorials

Tool
PUBMED_LINK
27427429
TITLE
Commentary: Two-sample Mendelian randomization: opportunities and challenges.
Main citation
Lawlor DA. (2016) Commentary: Two-sample Mendelian randomization: opportunities and challenges. Int J Epidemiol, 45 (3) 908-15. doi:10.1093/ije/dyw127. PMID 27427429
DOI
10.1093/ije/dyw127

Tools

SMR

Tool
PUBMED_LINK
27019110
FULL NAME
Summary-data-based Mendelian Randomization
DESCRIPTION
The SMR software tool was originally developed to implement the SMR & HEIDI methods to test for pleiotropic association between the expression level of a gene and a complex trait of interest using summary-level data from GWAS and expression quantitative trait loci (eQTL) studies (Zhu et al. 2016 Nature Genetics). The SMR & HEIDI methodology can be interpreted as an analysis to test if the effect size of a SNP on the phenotype is mediated by gene expression. This tool can therefore be used to prioritize genes underlying GWAS hits for follow-up functional studies. The methods are applicable to all kinds of molecular QTL (xQTL) data, including DNA methylation QTL (mQTL) and protein abundance QTL (pQTL).
URL
https://yanglab.westlake.edu.cn/software/smr/#Overview
KEYWORDS
pleiotropy or causality, xQTL, eQTL, MR, HEIDI, linkage
TITLE
Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.
Main citation
Zhu Z, Zhang F, Hu H, Bakshi A, ...&, Yang J. (2016) Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet, 48 (5) 481-7. doi:10.1038/ng.3538. PMID 27019110
ABSTRACT
Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with human complex traits. However, the genes or functional DNA elements through which these variants exert their effects on the traits are often unknown. We propose a method (called SMR) that integrates summary-level data from GWAS with data from expression quantitative trait locus (eQTL) studies to identify genes whose expression levels are associated with a complex trait because of pleiotropy. We apply the method to five human complex traits using GWAS data on up to 339,224 individuals and eQTL data on 5,311 individuals, and we prioritize 126 genes (for example, TRAF1 and ANKRD55 for rheumatoid arthritis and SNX19 and NMRAL1 for schizophrenia), of which 25 genes are new candidates; 77 genes are not the nearest annotated gene to the top associated GWAS SNP. These genes provide important leads to design future functional studies to understand the mechanism whereby DNA variation leads to complex trait variation.
DOI
10.1038/ng.3538

SMR-multi

Tool
PUBMED_LINK
29500431
TITLE
Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits.
Main citation
Wu Y, Zeng J, Zhang F, Zhu Z, ...&, Yang J. (2018) Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits. Nat Commun, 9 (1) 918. doi:10.1038/s41467-018-03371-0. PMID 29500431
ABSTRACT
The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.
DOI
10.1038/s41467-018-03371-0

Two-sample MR

Tool
PUBMED_LINK
29846171
DESCRIPTION
Two sample Mendelian randomisation (2SMR) is a method to estimate the causal effect of an exposure on an outcome using only summary statistics from genome wide association studies (GWAS). Though conceptually straightforward, there are a number of steps that are required to perform the analysis properly, and they can be cumbersome
URL
https://mrcieu.github.io/TwoSampleMR/articles/introduction.html
TITLE
The MR-Base platform supports systematic causal inference across the human phenome.
Main citation
Hemani G, Zheng J, Elsworth B, Wade KH, ...&, Haycock PC. (2018) The MR-Base platform supports systematic causal inference across the human phenome. Elife, 7 () . doi:10.7554/eLife.34408. PMID 29846171
ABSTRACT
Results from genome-wide association studies (GWAS) can be used to infer causal relationships between phenotypes, using a strategy known as 2-sample Mendelian randomization (2SMR) and bypassing the need for individual-level data. However, 2SMR methods are evolving rapidly and GWAS results are often insufficiently curated, undermining efficient implementation of the approach. We therefore developed MR-Base (<ext-link ext-link-type="uri" xlink:href="http://www.mrbase.org">http://www.mrbase.org</ext-link>): a platform that integrates a curated database of complete GWAS results (no restrictions according to statistical significance) with an application programming interface, web app and R packages that automate 2SMR. The software includes several sensitivity analyses for assessing the impact of horizontal pleiotropy and other violations of assumptions. The database currently comprises 11 billion single nucleotide polymorphism-trait associations from 1673 GWAS and is updated on a regular basis. Integrating data with software ensures more rigorous application of hypothesis-driven analyses and allows millions of potential causal relationships to be efficiently evaluated in phenome-wide association studies.
DOI
10.7554/eLife.34408