Tools Gene set pathway analysis
Curation of Gene set pathway analysis — listings under the GWAS Tools tab.
Summary Table
Click a column header to sort the table.
| NAME | CATEGORY | Main citation | YEAR |
|---|---|---|---|
| FUMA | MISC | Watanabe K et al., Nat Commun, 2017 |
2017 |
| MAGMA | MISC | de Leeuw CA et al., PLoS Comput Biol, 2015 |
2015 |
| PASCAL | MISC | Lamparter D et al., PLoS Comput Biol, 2016 |
2016 |
| VEGAS2 | MISC | Mishra A et al., Twin Res Hum Genet, 2015 |
2015 |
| reviews | Review | White MJ et al., Curr Protoc Hum Genet, 2019 |
2019 |
MISC
FUMA
PUBMED_LINK
DESCRIPTION
FUMA is a platform that can be used to annotate, prioritize, visualize and interpret GWAS results.
URL
TITLE
Functional mapping and annotation of genetic associations with FUMA.
Main citation
Watanabe K, Taskesen E, van Bochoven A, Posthuma D. (2017) Functional mapping and annotation of genetic associations with FUMA. Nat Commun, 8 (1) 1826. doi:10.1038/s41467-017-01261-5. PMID 29184056
ABSTRACT
A main challenge in genome-wide association studies (GWAS) is to pinpoint possible causal variants. Results from GWAS typically do not directly translate into causal variants because the majority of hits are in non-coding or intergenic regions, and the presence of linkage disequilibrium leads to effects being statistically spread out across multiple variants. Post-GWAS annotation facilitates the selection of most likely causal variant(s). Multiple resources are available for post-GWAS annotation, yet these can be time consuming and do not provide integrated visual aids for data interpretation. We, therefore, develop FUMA: an integrative web-based platform using information from multiple biological resources to facilitate functional annotation of GWAS results, gene prioritization and interactive visualization. FUMA accommodates positional, expression quantitative trait loci (eQTL) and chromatin interaction mappings, and provides gene-based, pathway and tissue enrichment results. FUMA results directly aid in generating hypotheses that are testable in functional experiments aimed at proving causal relations.
DOI
10.1038/s41467-017-01261-5
MAGMA
PUBMED_LINK
FULL NAME
Multi-marker Analysis of GenoMic Annotation
DESCRIPTION
MAGMA is a tool for gene analysis and generalized gene-set analysis of GWAS data. It can be used to analyse both raw genotype data as well as summary SNP p-values from a previous GWAS or meta-analysis.
URL
TITLE
MAGMA: generalized gene-set analysis of GWAS data.
Main citation
de Leeuw CA, Mooij JM, Heskes T, Posthuma D. (2015) MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol, 11 (4) e1004219. doi:10.1371/journal.pcbi.1004219. PMID 25885710
ABSTRACT
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
DOI
10.1371/journal.pcbi.1004219
PASCAL
PUBMED_LINK
FULL NAME
Pathway scoring algorithm
DESCRIPTION
Pascal (Pathway scoring algorithm) is an easy-to-use tool for gene scoring and pathway analysis from GWAS results.
URL
TITLE
Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics.
Main citation
Lamparter D, Marbach D, Rueedi R, Kutalik Z, ...&, Bergmann S. (2016) Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics. PLoS Comput Biol, 12 (1) e1004714. doi:10.1371/journal.pcbi.1004714. PMID 26808494
ABSTRACT
Integrating single nucleotide polymorphism (SNP) p-values from genome-wide association studies (GWAS) across genes and pathways is a strategy to improve statistical power and gain biological insight. Here, we present Pascal (Pathway scoring algorithm), a powerful tool for computing gene and pathway scores from SNP-phenotype association summary statistics. For gene score computation, we implemented analytic and efficient numerical solutions to calculate test statistics. We examined in particular the sum and the maximum of chi-squared statistics, which measure the strongest and the average association signals per gene, respectively. For pathway scoring, we use a modified Fisher method, which offers not only significant power improvement over more traditional enrichment strategies, but also eliminates the problem of arbitrary threshold selection inherent in any binary membership based pathway enrichment approach. We demonstrate the marked increase in power by analyzing summary statistics from dozens of large meta-studies for various traits. Our extensive testing indicates that our method not only excels in rigorous type I error control, but also results in more biologically meaningful discoveries.
DOI
10.1371/journal.pcbi.1004714
VEGAS2
PUBMED_LINK
FULL NAME
Versatile Gene-based Association Study - 2 version 2
DESCRIPTION
This is the VEGAS2 web platform. Here, user can perfome the gene-based and pathway-based analyses on GWAS summary data using VEGAS and VEGAS2Pathway approaches respectively. It is publically available for non-commercial use.
URL
TITLE
VEGAS2: Software for More Flexible Gene-Based Testing.
Main citation
Mishra A, Macgregor S. (2015) VEGAS2: Software for More Flexible Gene-Based Testing. Twin Res Hum Genet, 18 (1) 86-91. doi:10.1017/thg.2014.79. PMID 25518859
ABSTRACT
Gene-based tests such as versatile gene-based association study (VEGAS) are commonly used following per-single nucleotide polymorphism (SNP) GWAS (genome-wide association studies) analysis. Two limitations of VEGAS were that the HapMap2 reference set was used to model the correlation between SNPs and only autosomal genes were considered. HapMap2 has now been superseded by the 1,000 Genomes reference set, and whereas early GWASs frequently ignored the X chromosome, it is now commonly included. Here we have developed VEGAS2, an extension that uses 1,000 Genomes data to model SNP correlations across the autosomes and chromosome X. VEGAS2 allows greater flexibility when defining gene boundaries. VEGAS2 offers both a user-friendly, web-based front end and a command line Linux version. The online version of VEGAS2 can be accessed through https://vegas2.qimrberghofer.edu.au/. The command line version can be downloaded from https://vegas2.qimrberghofer.edu.au/zVEGAS2offline.tgz. The command line version is developed in Perl, R and shell scripting languages; source code is available for further development.
DOI
10.1017/thg.2014.79
Review
reviews
PUBMED_LINK
TITLE
Strategies for Pathway Analysis Using GWAS and WGS Data.
Main citation
White MJ, Yaspan BL, Veatch OJ, Goddard P, ...&, Contreras MG. (2019) Strategies for Pathway Analysis Using GWAS and WGS Data. Curr Protoc Hum Genet, 100 (1) e79. doi:10.1002/cphg.79. PMID 30387919
ABSTRACT
Single-allele study designs, commonly used in genome-wide association studies (GWAS) as well as the more recently developed whole genome sequencing (WGS) studies, are a standard approach for investigating the relationship of common variation within the human genome to a given phenotype of interest. However, single-allele association results published for many GWAS studies represent only the tip of the iceberg for the information that can be extracted from these datasets. The primary analysis strategy for GWAS entails association analysis in which only the single nucleotide polymorphisms (SNPs) with the strongest p-values are declared statistically significant due to issues arising from multiple testing and type I errors. Factors such as locus heterogeneity, epistasis, and multiple genes conferring small effects contribute to the complexity of the genetic models underlying phenotype expression. Thus, many biologically meaningful associations having lower effect sizes at individual genes are overlooked, making it difficult to separate true associations from a sea of false-positive associations. Organizing these individual SNPs into biologically meaningful groups to look at the overall effects of minor perturbations to genes and pathways is desirable. This pathway-based approach provides researchers with insight into the functional foundations of the phenotype being studied and allows testing of various genetic scenarios. © 2018 by John Wiley & Sons, Inc.
DOI
10.1002/cphg.79