Skip to content

Tools Association tests TWAS

Curation of TWAS within Association tests — listings under the GWAS Tools tab.

Summary Table

Click a column header to sort the table.

NAME Main citation YEAR
FUSION
Gusev A et al., Nat Genet, 2016
2016
MetaXcan
Barbeira AN et al., Nat Commun, 2018
2018
MultiXcan
Barbeira AN et al., PLoS Genet, 2019
2019
OTTERS
Dai Q et al., Nat Commun, 2023
2023
PTWAS
Zhang Y et al., Genome Biol, 2020
2020
PrediXcan
Gamazon ER et al., Nat Genet, 2015
2015
S-PrediXcan
Barbeira AN et al., Nat Commun, 2018
2018
TGVIS
Yang Y et al., Nat Commun, 2025
2025
TWAS hub
Mancuso N et al., Am J Hum Genet, 2017
2017
cTWAS
Zhao S et al., Nat Genet, 2024
2024
scTWAS
Lin Z et al., Nat Commun, 2026
2026
webTWAS
Cao C et al., Nucleic Acids Res, 2022
2022

FUSION

Tool
PUBMED_LINK
26854917
FULL NAME
Functional Summary-based Imputation
DESCRIPTION
FUSION is a suite of tools for performing transcriptome-wide and regulome-wide association studies (TWAS and RWAS). FUSION builds predictive models of the genetic component of a functional/molecular phenotype and predicts and tests that component for association with disease using GWAS summary statistics. The goal is to identify associations between a GWAS phenotype and a functional phenotype that was only measured in reference data. We provide precomputed predictive models from multiple studies to facilitate this analysis.
URL
http://gusevlab.org/projects/fusion/
TITLE
Integrative approaches for large-scale transcriptome-wide association studies.
Main citation
Gusev A, Ko A, Shi H, Bhatia G, ...&, Pasaniuc B. (2016) Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet, 48 (3) 245-52. doi:10.1038/ng.3506. PMID 26854917
ABSTRACT
Many genetic variants influence complex traits by modulating gene expression, thus altering the abundance of one or multiple proteins. Here we introduce a powerful strategy that integrates gene expression measurements with summary association statistics from large-scale genome-wide association studies (GWAS) to identify genes whose cis-regulated expression is associated with complex traits. We leverage expression imputation from genetic data to perform a transcriptome-wide association study (TWAS) to identify significant expression-trait associations. We applied our approaches to expression data from blood and adipose tissue measured in ∼ 3,000 individuals overall. We imputed gene expression into GWAS data from over 900,000 phenotype measurements to identify 69 new genes significantly associated with obesity-related traits (BMI, lipids and height). Many of these genes are associated with relevant phenotypes in the Hybrid Mouse Diversity Panel. Our results showcase the power of integrating genotype, gene expression and phenotype to gain insights into the genetic basis of complex traits.
DOI
10.1038/ng.3506

MetaXcan

Tool
PUBMED_LINK
29739930
DESCRIPTION
MetaXcan is a set of tools to integrate genomic information of biological mechanisms with complex traits.
URL
https://github.com/hakyimlab/MetaXcan
TITLE
Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics.
Main citation
Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, ...&, Im HK. (2018) Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun, 9 (1) 1825. doi:10.1038/s41467-018-03621-1. PMID 29739930
ABSTRACT
Scalable, integrative methods to understand mechanisms that link genetic variants with phenotypes are needed. Here we derive a mathematical expression to compute PrediXcan (a gene mapping approach) results using summary data (S-PrediXcan) and show its accuracy and general robustness to misspecified reference sets. We apply this framework to 44 GTEx tissues and 100+ phenotypes from GWAS and meta-analysis studies, creating a growing public catalog of associations that seeks to capture the effects of gene expression variation on human phenotypes. Replication in an independent cohort is shown. Most of the associations are tissue specific, suggesting context specificity of the trait etiology. Colocalized significant associations in unexpected tissues underscore the need for an agnostic scanning of multiple contexts to improve our ability to detect causal regulatory mechanisms. Monogenic disease genes are enriched among significant associations for related traits, suggesting that smaller alterations of these genes may cause a spectrum of milder phenotypes.
DOI
10.1038/s41467-018-03621-1

MultiXcan

Tool
PUBMED_LINK
30668570
DESCRIPTION
an efficient statistical method (MultiXcan) that leverages the substantial sharing of eQTLs across tissues and contexts to improve our ability to identify potential target genes. MultiXcan integrates evidence across multiple panels using multivariate regression, which naturally takes into account the correlation structure.
URL
https://github.com/hakyimlab/MetaXcan
TITLE
Integrating predicted transcriptome from multiple tissues improves association detection.
Main citation
Barbeira AN, Pividori M, Zheng J, Wheeler HE, ...&, Im HK. (2019) Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet, 15 (1) e1007889. doi:10.1371/journal.pgen.1007889. PMID 30668570
ABSTRACT
Integration of genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) studies is needed to improve our understanding of the biological mechanisms underlying GWAS hits, and our ability to identify therapeutic targets. Gene-level association methods such as PrediXcan can prioritize candidate targets. However, limited eQTL sample sizes and absence of relevant developmental and disease context restrict our ability to detect associations. Here we propose an efficient statistical method (MultiXcan) that leverages the substantial sharing of eQTLs across tissues and contexts to improve our ability to identify potential target genes. MultiXcan integrates evidence across multiple panels using multivariate regression, which naturally takes into account the correlation structure. We apply our method to simulated and real traits from the UK Biobank and show that, in realistic settings, we can detect a larger set of significantly associated genes than using each panel separately. To improve applicability, we developed a summary result-based extension called S-MultiXcan, which we show yields highly concordant results with the individual level version when LD is well matched. Our multivariate model-based approach allowed us to use the individual level results as a gold standard to calibrate and develop a robust implementation of the summary-based extension. Results from our analysis as well as software and necessary resources to apply our method are publicly available.
DOI
10.1371/journal.pgen.1007889

OTTERS

Tool
PUBMED_LINK
36882394
FULL NAME
Omnibus Transcriptome Test using Expression Reference Summary data
DESCRIPTION
Dai, Q. et al. OTTERS: a powerful TWAS framework leveraging summary-level reference data. Nat. Commun. 14, 1271 (2023).
URL
https://github.com/daiqile96/OTTERS
TITLE
OTTERS: a powerful TWAS framework leveraging summary-level reference data.
Main citation
Dai Q, Zhou G, Zhao H, Võsa U, ...&, Yang J. (2023) OTTERS: a powerful TWAS framework leveraging summary-level reference data. Nat Commun, 14 (1) 1271. doi:10.1038/s41467-023-36862-w. PMID 36882394
ABSTRACT
Most existing TWAS tools require individual-level eQTL reference data and thus are not applicable to summary-level reference eQTL datasets. The development of TWAS methods that can harness summary-level reference data is valuable to enable TWAS in broader settings and enhance power due to increased reference sample size. Thus, we develop a TWAS framework called OTTERS (Omnibus Transcriptome Test using Expression Reference Summary data) that adapts multiple polygenic risk score (PRS) methods to estimate eQTL weights from summary-level eQTL reference data and conducts an omnibus TWAS. We show that OTTERS is a practical and powerful TWAS tool by both simulations and application studies.
DOI
10.1038/s41467-023-36862-w

PTWAS

Tool
PUBMED_LINK
32912253
FULL NAME
probabilistic TWAS
URL
https://github.com/xqwen/ptwas
KEYWORDS
TWAS, instrumental variables
TITLE
PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis.
Main citation
Zhang Y, Quick C, Yu K, Barbeira A, ...&, Wen X. (2020) PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis. Genome Biol, 21 (1) 232. doi:10.1186/s13059-020-02026-y. PMID 32912253
ABSTRACT
We propose a new computational framework, probabilistic transcriptome-wide association study (PTWAS), to investigate causal relationships between gene expressions and complex traits. PTWAS applies the established principles from instrumental variables analysis and takes advantage of probabilistic eQTL annotations to delineate and tackle the unique challenges arising in TWAS. PTWAS not only confers higher power than the existing methods but also provides novel functionalities to evaluate the causal assumptions and estimate tissue- or cell-type-specific gene-to-trait effects. We illustrate the power of PTWAS by analyzing the eQTL data across 49 tissues from GTEx (v8) and GWAS summary statistics from 114 complex traits.
DOI
10.1186/s13059-020-02026-y

PrediXcan

Tool
PUBMED_LINK
26258848
DESCRIPTION
(deprecated) PrediXcan is a gene-based association test that prioritizes genes that are likely to be causal for the phenotype.
URL
https://github.com/hakyimlab/PrediXcan
TITLE
A gene-based association method for mapping traits using reference transcriptome data.
Main citation
Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, ...&, Im HK. (2015) A gene-based association method for mapping traits using reference transcriptome data. Nat Genet, 47 (9) 1091-8. doi:10.1038/ng.3367. PMID 26258848
ABSTRACT
Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual's genetic profile and correlates 'imputed' gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys the benefits of gene-based approaches such as reduced multiple-testing burden and a principled approach to the design of follow-up experiments. Our results demonstrate that PrediXcan can detect known and new genes associated with disease traits and provide insights into the mechanism of these associations.
DOI
10.1038/ng.3367

S-PrediXcan

Tool
PUBMED_LINK
29739930
DESCRIPTION
a mathematical expression to compute PrediXcan (a gene mapping approach) results using summary data (S-PrediXcan)
URL
https://github.com/hakyimlab/MetaXcan
TITLE
Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics.
Main citation
Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, ...&, Im HK. (2018) Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun, 9 (1) 1825. doi:10.1038/s41467-018-03621-1. PMID 29739930
ABSTRACT
Scalable, integrative methods to understand mechanisms that link genetic variants with phenotypes are needed. Here we derive a mathematical expression to compute PrediXcan (a gene mapping approach) results using summary data (S-PrediXcan) and show its accuracy and general robustness to misspecified reference sets. We apply this framework to 44 GTEx tissues and 100+ phenotypes from GWAS and meta-analysis studies, creating a growing public catalog of associations that seeks to capture the effects of gene expression variation on human phenotypes. Replication in an independent cohort is shown. Most of the associations are tissue specific, suggesting context specificity of the trait etiology. Colocalized significant associations in unexpected tissues underscore the need for an agnostic scanning of multiple contexts to improve our ability to detect causal regulatory mechanisms. Monogenic disease genes are enriched among significant associations for related traits, suggesting that smaller alterations of these genes may cause a spectrum of milder phenotypes.
DOI
10.1038/s41467-018-03621-1

TGVIS

TWAS Gene prioritization Fine mapping Tool Summary statistics
PUBMED_LINK
40603866
FULL NAME
Tissue-Gene pairs, direct causal Variants, and Infinitesimal effects selector
DESCRIPTION
Multivariate TWAS approach that prioritizes causal gene–tissue pairs and candidate causal variants from GWAS summary data while explicitly controlling for genome-wide infinitesimal (polygenic) effects that can otherwise inflate false gene discoveries.
URL
https://github.com/harryyiheyang/TGVIS ,https://doi.org/10.1038/s41467-025-61423-8
KEYWORDS
multivariate TWAS, infinitesimal model, causal gene-tissue, eQTL, sQTL
TITLE
Uncovering causal gene-tissue pairs and variants through a multivariate TWAS controlling for infinitesimal effects.
Main citation
Yang Y, Lorincz-Comi N, Zhu X. (2025) Uncovering causal gene-tissue pairs and variants through a multivariate TWAS controlling for infinitesimal effects. Nat Commun, 16 (1) 6098. doi:10.1038/s41467-025-61423-8. PMID 40603866
ABSTRACT
Transcriptome-wide association studies (TWAS) are commonly used to prioritize causal genes underlying associations found in genome-wide association studies (GWAS) and have been extended to identify causal genes through multivariate TWAS methods. However, recent studies have shown that widespread infinitesimal effects due to polygenicity can impair the performance of these methods. In this report, we introduce a multivariate TWAS method named tissue-gene pairs, direct causal variants, and infinitesimal effects selector (TGVIS) to identify tissue-specific causal genes and direct causal variants while accounting for infinitesimal effects. In simulations, TGVIS maintains an accurate prioritization of causal gene-tissue pairs and variants and demonstrates comparable or superior power to existing approaches, regardless of the presence of infinitesimal effects. In the real data analysis of GWAS summary data of 45 cardiometabolic traits and expression/splicing quantitative trait loci from 31 tissues, TGVIS is able to improve causal gene prioritization and identifies novel genes that were missed by conventional TWAS.
DOI
10.1038/s41467-025-61423-8

TWAS hub

Tool
PUBMED_LINK
28238358
DESCRIPTION
TWAS hub is an interactive browser of results from integrative analyses of GWAS and functional data for hundreds of traits and >100k expression models. The aim is facilitate the investigation of individual TWAS associations; pleiotropic disease/trait associations for a given gene of interest; predicted gene associations for a given disease/trait of interest with detailed per-locus statistics; and pleiotropic relationships between traits based on shared associated genes.
URL
http://twas-hub.org
TITLE
Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits.
Main citation
Mancuso N, Shi H, Goddard P, Kichaev G, ...&, Pasaniuc B. (2017) Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits. Am J Hum Genet, 100 (3) 473-487. doi:10.1016/j.ajhg.2017.01.031. PMID 28238358
ABSTRACT
Although genome-wide association studies (GWASs) have identified thousands of risk loci for many complex traits and diseases, the causal variants and genes at these loci remain largely unknown. Here, we introduce a method for estimating the local genetic correlation between gene expression and a complex trait and utilize it to estimate the genetic correlation due to predicted expression between pairs of traits. We integrated gene expression measurements from 45 expression panels with summary GWAS data to perform 30 multi-tissue transcriptome-wide association studies (TWASs). We identified 1,196 genes whose expression is associated with these traits; of these, 168 reside more than 0.5 Mb away from any previously reported GWAS significant variant. We then used our approach to find 43 pairs of traits with significant genetic correlation at the level of predicted expression; of these, eight were not found through genetic correlation at the SNP level. Finally, we used bi-directional regression to find evidence that BMI causally influences triglyceride levels and that triglyceride levels causally influence low-density lipoprotein. Together, our results provide insight into the role of gene expression in the susceptibility of complex traits and diseases.
DOI
10.1016/j.ajhg.2017.01.031

cTWAS

Tool
PUBMED_LINK
38279041
FULL NAME
causal-TWAS
DESCRIPTION
Expression Quantitative Trait Loci (eQTLs) have often been used to nominate candidate genes from Genome-wide association studies (GWAS). However, commonly used methods are susceptible to false positives largely due to Linkage Disequilibrium of eQTLs with causal variants acting on the phenotype directly. Our method, causal-TWAS (cTWAS), addressed this challenge by borrowing ideas from statistical fine-mapping. It is a generalization of Transcriptome-wide association studies (TWAS), but when analyzing any gene, it adjusts for other nearby genes and all nearby genetic variants.
URL
https://xinhe-lab.github.io/ctwas/
KEYWORDS
TWAS, fine-mapping
TITLE
Adjusting for genetic confounders in transcriptome-wide association studies improves discovery of risk genes of complex traits.
Main citation
Zhao S, Crouse W, Qian S, Luo K, ...&, He X. (2024) Adjusting for genetic confounders in transcriptome-wide association studies improves discovery of risk genes of complex traits. Nat Genet, 56 (2) 336-347. doi:10.1038/s41588-023-01648-9. PMID 38279041
ABSTRACT
Many methods have been developed to leverage expression quantitative trait loci (eQTL) data to nominate candidate genes from genome-wide association studies. These methods, including colocalization, transcriptome-wide association studies (TWAS) and Mendelian randomization-based methods; however, all suffer from a key problem-when assessing the role of a gene in a trait using its eQTLs, nearby variants and genetic components of other genes' expression may be correlated with these eQTLs and have direct effects on the trait, acting as potential confounders. Our extensive simulations showed that existing methods fail to account for these 'genetic confounders', resulting in severe inflation of false positives. Our new method, causal-TWAS (cTWAS), borrows ideas from statistical fine-mapping and allows us to adjust all genetic confounders. cTWAS showed calibrated false discovery rates in simulations, and its application on several common traits discovered new candidate genes. In conclusion, cTWAS provides a robust statistical framework for gene discovery.
DOI
10.1038/s41588-023-01648-9

scTWAS

TWAS Single cell scRNA-seq Tool Summary statistics
PUBMED_LINK
41820391
DESCRIPTION
Statistical framework for cell-type-resolved transcriptome-wide association using single-cell RNA-seq: models sparsity and technical noise via latent variables and moment-based estimation to improve genetically regulated expression prediction and gene–trait discovery.
URL
https://github.com/ZhaotongL/scTWAS ,https://doi.org/10.1038/s41467-026-70374-7
KEYWORDS
TWAS, single-cell, cell-type-specific, latent variable, GReX
TITLE
scTWAS: a powerful statistical framework for single-cell transcriptome-wide association studies.
Main citation
Lin Z, Su C. (2026) scTWAS: a powerful statistical framework for single-cell transcriptome-wide association studies. Nat Commun, () . doi:10.1038/s41467-026-70374-7. PMID 41820391
ABSTRACT
Transcriptome-wide association studies (TWAS) have successfully identified genes associated with complex traits and diseases, but most have been performed using bulk gene expression data, which aggregate signals across heterogeneous cell types. Population-scale single-cell RNA sequencing data now make it possible to perform TWAS at the cell-type resolution, but present unique challenges due to strong noises, technical variations, and high sparsity. Here, we propose scTWAS, a statistical method to conduct cell-type-specific TWAS using single-cell data. Leveraging a latent-variable model and moment-based estimation to address the challenges of single-cell data, scTWAS consistently improves the prediction of genetically regulated gene expression across cell types in both blood and brain tissues. Compared to existing methods, scTWAS identifies substantially more gene-trait associations across 29 hematological traits and three immune-related diseases in immune cell types. An application to Alzheimer's disease also reveals cell-subtype-specific associations, including MS4A6A in the disease-associated microglial subtype and PPP1R37 in the inflammatory microglial subtype.
DOI
10.1038/s41467-026-70374-7

webTWAS

Tool
PUBMED_LINK
34669946
DESCRIPTION
a resource for disease candidate susceptibility genes identified by transcriptome-wide association study
URL
http://www.webtwas.net/#/
TITLE
webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study.
Main citation
Cao C, Wang J, Kwok D, Cui F, ...&, Zou Q. (2022) webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study. Nucleic Acids Res, 50 (D1) D1123-D1130. doi:10.1093/nar/gkab957. PMID 34669946
ABSTRACT
The development of transcriptome-wide association studies (TWAS) has enabled researchers to better identify and interpret causal genes in many diseases. However, there are currently no resources providing a comprehensive listing of gene-disease associations discovered by TWAS from published GWAS summary statistics. TWAS analyses are also difficult to conduct due to the complexity of TWAS software pipelines. To address these issues, we introduce a new resource called webTWAS, which integrates a database of the most comprehensive disease GWAS datasets currently available with credible sets of potential causal genes identified by multiple TWAS software packages. Specifically, a total of 235 064 gene-diseases associations for a wide range of human diseases are prioritized from 1298 high-quality downloadable European GWAS summary statistics. Associations are calculated with seven different statistical models based on three popular and representative TWAS software packages. Users can explore associations at the gene or disease level, and easily search for related studies or diseases using the MeSH disease tree. Since the effects of diseases are highly tissue-specific, webTWAS applies tissue-specific enrichment analysis to identify significant tissues. A user-friendly web server is also available to run custom TWAS analyses on user-provided GWAS summary statistics data. webTWAS is freely available at http://www.webtwas.net.
DOI
10.1093/nar/gkab957