Skip to content

Gene_set_pathway_analysis

Summary Table

NAME CATEGORY CITATION YEAR
FUMA MISC Watanabe K, Taskesen E, van Bochoven A, Posthuma D. (2017) Functional mapping and annotation of genetic associations with FUMA Nat. Commun., 8 (1) 1826. doi:10.1038/s41467-017-01261-5. PMID 29184056 2017
MAGMA MISC de Leeuw CA, Mooij JM, Heskes T, Posthuma D. (2015) MAGMA: generalized gene-set analysis of GWAS data PLoS Comput. Biol., 11 (4) e1004219. doi:10.1371/journal.pcbi.1004219. PMID 25885710 2015
PASCAL MISC Lamparter D, Marbach D, Rueedi R, Kutalik Z, ...&, Bergmann S. (2016) Fast and rigorous computation of gene and pathway scores from SNP-based summary statistics PLoS Comput. Biol., 12 (1) e1004714. doi:10.1371/journal.pcbi.1004714. PMID 26808494 2016
VEGAS2 MISC Mishra A, Macgregor S. (2015) VEGAS2: Software for more flexible gene-based testing Twin Res. Hum. Genet., 18 (1) 86-91. doi:10.1017/thg.2014.79. PMID 25518859 2015
reviews Review White MJ, Yaspan BL, Veatch OJ, Goddard P, ...&, Contreras MG. (2019) Strategies for pathway analysis using GWAS and WGS data Curr. Protoc. Hum. Genet., 100 (1) e79. doi:10.1002/cphg.79. PMID 30387919 2019

MISC

FUMA

  • NAME : FUMA
  • SHORT NAME : FUMA
  • FULL NAME : FUMA
  • DESCRIPTION : FUMA is a platform that can be used to annotate, prioritize, visualize and interpret GWAS results.
  • URL : https://fuma.ctglab.nl/
  • TITLE : Functional mapping and annotation of genetic associations with FUMA
  • DOI : 10.1038/s41467-017-01261-5
  • ABSTRACT : A main challenge in genome-wide association studies (GWAS) is to pinpoint possible causal variants. Results from GWAS typically do not directly translate into causal variants because the majority of hits are in non-coding or intergenic regions, and the presence of linkage disequilibrium leads to effects being statistically spread out across multiple variants. Post-GWAS annotation facilitates the selection of most likely causal variant(s). Multiple resources are available for post-GWAS annotation, yet these can be time consuming and do not provide integrated visual aids for data interpretation. We, therefore, develop FUMA: an integrative web-based platform using information from multiple biological resources to facilitate functional annotation of GWAS results, gene prioritization and interactive visualization. FUMA accommodates positional, expression quantitative trait loci (eQTL) and chromatin interaction mappings, and provides gene-based, pathway and tissue enrichment results. FUMA results directly aid in generating hypotheses that are testable in functional experiments aimed at proving causal relations.
  • CITATION : Watanabe K, Taskesen E, van Bochoven A, Posthuma D. (2017) Functional mapping and annotation of genetic associations with FUMA Nat. Commun., 8 (1) 1826. doi:10.1038/s41467-017-01261-5. PMID 29184056
  • JOURNAL_INFO : Nature communications ; Nat. Commun. ; 2017 ; 8 ; 1 ; 1826
  • PUBMED_LINK : 29184056

MAGMA

  • NAME : MAGMA
  • SHORT NAME : MAGMA
  • FULL NAME : Multi-marker Analysis of GenoMic Annotation
  • DESCRIPTION : MAGMA is a tool for gene analysis and generalized gene-set analysis of GWAS data. It can be used to analyse both raw genotype data as well as summary SNP p-values from a previous GWAS or meta-analysis.
  • URL : https://ctg.cncr.nl/software/magma
  • TITLE : MAGMA: generalized gene-set analysis of GWAS data
  • DOI : 10.1371/journal.pcbi.1004219
  • ABSTRACT : By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
  • COPYRIGHT : http://creativecommons.org/licenses/by/4.0/
  • CITATION : de Leeuw CA, Mooij JM, Heskes T, Posthuma D. (2015) MAGMA: generalized gene-set analysis of GWAS data PLoS Comput. Biol., 11 (4) e1004219. doi:10.1371/journal.pcbi.1004219. PMID 25885710
  • JOURNAL_INFO : PLoS computational biology ; PLoS Comput. Biol. ; 2015 ; 11 ; 4 ; e1004219
  • PUBMED_LINK : 25885710

PASCAL

  • NAME : PASCAL
  • SHORT NAME : PASCAL
  • FULL NAME : Pathway scoring algorithm
  • DESCRIPTION : Pascal (Pathway scoring algorithm) is an easy-to-use tool for gene scoring and pathway analysis from GWAS results.
  • URL : https://www2.unil.ch/cbg/index.php?title=Pascal
  • TITLE : Fast and rigorous computation of gene and pathway scores from SNP-based summary statistics
  • DOI : 10.1371/journal.pcbi.1004714
  • ABSTRACT : Integrating single nucleotide polymorphism (SNP) p-values from genome-wide association studies (GWAS) across genes and pathways is a strategy to improve statistical power and gain biological insight. Here, we present Pascal (Pathway scoring algorithm), a powerful tool for computing gene and pathway scores from SNP-phenotype association summary statistics. For gene score computation, we implemented analytic and efficient numerical solutions to calculate test statistics. We examined in particular the sum and the maximum of chi-squared statistics, which measure the strongest and the average association signals per gene, respectively. For pathway scoring, we use a modified Fisher method, which offers not only significant power improvement over more traditional enrichment strategies, but also eliminates the problem of arbitrary threshold selection inherent in any binary membership based pathway enrichment approach. We demonstrate the marked increase in power by analyzing summary statistics from dozens of large meta-studies for various traits. Our extensive testing indicates that our method not only excels in rigorous type I error control, but also results in more biologically meaningful discoveries.
  • COPYRIGHT : http://creativecommons.org/licenses/by/4.0/
  • CITATION : Lamparter D, Marbach D, Rueedi R, Kutalik Z, ...&, Bergmann S. (2016) Fast and rigorous computation of gene and pathway scores from SNP-based summary statistics PLoS Comput. Biol., 12 (1) e1004714. doi:10.1371/journal.pcbi.1004714. PMID 26808494
  • JOURNAL_INFO : PLoS computational biology ; PLoS Comput. Biol. ; 2016 ; 12 ; 1 ; e1004714
  • PUBMED_LINK : 26808494

VEGAS2

  • NAME : VEGAS2
  • SHORT NAME : VEGAS2
  • FULL NAME : Versatile Gene-based Association Study - 2 version 2
  • DESCRIPTION : This is the VEGAS2 web platform. Here, user can perfome the gene-based and pathway-based analyses on GWAS summary data using VEGAS and VEGAS2Pathway approaches respectively. It is publically available for non-commercial use.
  • URL : https://vegas2.qimrberghofer.edu.au/
  • TITLE : VEGAS2: Software for more flexible gene-based testing
  • DOI : 10.1017/thg.2014.79
  • ABSTRACT : Gene-based tests such as versatile gene-based association study (VEGAS) are commonly used following per-single nucleotide polymorphism (SNP) GWAS (genome-wide association studies) analysis. Two limitations of VEGAS were that the HapMap2 reference set was used to model the correlation between SNPs and only autosomal genes were considered. HapMap2 has now been superseded by the 1,000 Genomes reference set, and whereas early GWASs frequently ignored the X chromosome, it is now commonly included. Here we have developed VEGAS2, an extension that uses 1,000 Genomes data to model SNP correlations across the autosomes and chromosome X. VEGAS2 allows greater flexibility when defining gene boundaries. VEGAS2 offers both a user-friendly, web-based front end and a command line Linux version. The online version of VEGAS2 can be accessed through https://vegas2.qimrberghofer.edu.au/. The command line version can be downloaded from https://vegas2.qimrberghofer.edu.au/zVEGAS2offline.tgz. The command line version is developed in Perl, R and shell scripting languages; source code is available for further development.
  • CITATION : Mishra A, Macgregor S. (2015) VEGAS2: Software for more flexible gene-based testing Twin Res. Hum. Genet., 18 (1) 86-91. doi:10.1017/thg.2014.79. PMID 25518859
  • JOURNAL_INFO : Twin research and human genetics: the official journal of the International Society for Twin Studies ; Twin Res. Hum. Genet. ; 2015 ; 18 ; 1 ; 86-91
  • PUBMED_LINK : 25518859

Review

reviews

  • NAME : reviews
  • TITLE : Strategies for pathway analysis using GWAS and WGS data
  • DOI : 10.1002/cphg.79
  • ABSTRACT : Single-allele study designs, commonly used in genome-wide association studies (GWAS) as well as the more recently developed whole genome sequencing (WGS) studies, are a standard approach for investigating the relationship of common variation within the human genome to a given phenotype of interest. However, single-allele association results published for many GWAS studies represent only the tip of the iceberg for the information that can be extracted from these datasets. The primary analysis strategy for GWAS entails association analysis in which only the single nucleotide polymorphisms (SNPs) with the strongest p-values are declared statistically significant due to issues arising from multiple testing and type I errors. Factors such as locus heterogeneity, epistasis, and multiple genes conferring small effects contribute to the complexity of the genetic models underlying phenotype expression. Thus, many biologically meaningful associations having lower effect sizes at individual genes are overlooked, making it difficult to separate true associations from a sea of false-positive associations. Organizing these individual SNPs into biologically meaningful groups to look at the overall effects of minor perturbations to genes and pathways is desirable. This pathway-based approach provides researchers with insight into the functional foundations of the phenotype being studied and allows testing of various genetic scenarios. © 2018 by John Wiley & Sons, Inc.
  • COPYRIGHT : http://onlinelibrary.wiley.com/termsAndConditions#vor
  • CITATION : White MJ, Yaspan BL, Veatch OJ, Goddard P, ...&, Contreras MG. (2019) Strategies for pathway analysis using GWAS and WGS data Curr. Protoc. Hum. Genet., 100 (1) e79. doi:10.1002/cphg.79. PMID 30387919
  • JOURNAL_INFO : et al [Current protocols in human genetics] ; Curr. Protoc. Hum. Genet. ; 2019 ; 100 ; 1 ; e79
  • PUBMED_LINK : 30387919