Skip to content

GWAS

Catalog entries using this tag (links open the entry card on its page):

Entries

IMRP-GxE

GWAS G×E Mendelian randomization Summary statistics Tool
PUBMED_LINK
38649715
FULL NAME
Mendelian randomization-based genome-wide screening for gene–environment interactions
DESCRIPTION
Screens for combined gene–environment interaction and environmental mediation by testing departure of marginal GWAS effects from GWIS main effects using an MR-style statistic (IMRP), applicable to summary statistics from separate GWAS and interaction meta-analyses.
URL
https://www.nature.com/articles/s41467-024-47806-3 ,https://github.com/XiaofengZhuCase/IMRP23 ,https://zenodo.org/records/10815731
KEYWORDS
GWAS, GWIS, G×E, Mendelian randomization, IMRP, summary statistics
TITLE
An approach to identify gene-environment interactions and reveal new biological insight in complex traits.
Main citation
Zhu X, Yang Y, Lorincz-Comi N, Li G, Bentley AR, de Vries PS, ...&, Aschard H. (2024) An approach to identify gene-environment interactions and reveal new biological insight in complex traits. Nat Commun, 15 (1) 3385. doi:10.1038/s41467-024-47806-3. PMID 38649715
ABSTRACT
There is a long-standing debate about the magnitude of the contribution of gene-environment interactions to phenotypic variations of complex traits owing to the low statistical power and few reported interactions to date. To address this issue, the Gene-Lifestyle Interactions Working Group within the Cohorts for Heart and Aging Research in Genetic Epidemiology Consortium has been spearheading efforts to investigate G×E in large and diverse samples through meta-analysis. Here, we present a powerful new approach to screen for interactions across the genome, an approach that shares substantial similarity to the Mendelian randomization framework. We identify and confirm 5 loci (6 independent signals) interacted with either cigarette smoking or alcohol consumption for serum lipids, and empirically demonstrate that interaction and mediation are the major contributors to genetic effect size heterogeneity across populations. The estimated lower bound of the interaction and environmentally mediated heritability is significant (P < 0.02) for low-density lipoprotein cholesterol and triglycerides in Cross-Population data. Our study improves the understanding of the genetic architecture and environmental contributions to complex traits.
DOI
10.1038/s41467-024-47806-3
ARROW_SUMMARY
Inputs: GWAS + GWIS summary stats (per-SNP β/SE; LD-pruned instruments; sample-overlap ρ if needed) → IMRP θ → T_MR-GxE (G×E + mediation)

PP-GWAS

GWAS Privacy-preserving GWAS Tool Summary statistics
PUBMED_LINK
41365878
DESCRIPTION
Privacy-preserving framework for multi-site GWAS on quantitative traits using a distributed linear mixed model and randomized encoding so servers never see raw genotypes or phenotypes—only obfuscated intermediates—while improving speed versus several cryptographic baselines.
URL
https://github.com/mdppml/PP-GWAS ,https://doi.org/10.1038/s41467-025-66771-z
KEYWORDS
Privacy-preserving GWAS, multi-site, quantitative traits, federated analysis
TITLE
PP-GWAS: Privacy Preserving Multi-Site Genome-wide Association Studies.
Main citation
Swaminathan A, Hannemann A, Ünal AB, Pfeifer N, ...&, Akgün M. (2025) PP-GWAS: Privacy Preserving Multi-Site Genome-wide Association Studies. Nat Commun, 16 (1) 11030. doi:10.1038/s41467-025-66771-z. PMID 41365878
ABSTRACT
Genome-wide association studies help uncover genetic influences on complex traits and diseases. Importantly, multi-site data collaborations enhance the statistical power of these studies but pose challenges due to the sensitivity of genomic data. Existing privacy-preserving approaches to performing multi-site genome-wide association studies rely on computationally expensive cryptographic techniques, which limit applicability. To address this, we present PP-GWAS, a privacy-preserving algorithm that improves efficiency and scalability while maintaining data privacy. Our method leverages randomized encoding within a distributed framework to perform stacked ridge regression on a linear mixed model, enabling robust analysis of quantitative phenotypes. We show experimentally using real-world and synthetic data that our approach achieves twice the computational speed of comparable methods while reducing resource consumption.
DOI
10.1038/s41467-025-66771-z

REGENIE

GWAS
PUBMED_LINK
34017140
DESCRIPTION
regenie is a C++ program for whole genome regression modelling of large genome-wide association studies. It is developed and supported by a team of scientists at the Regeneron Genetics Center.
URL
https://github.com/rgcgithub/regenie
KEYWORDS
whole genome regression
TITLE
Computationally efficient whole-genome regression for quantitative and binary traits.
Main citation
Mbatchou J, Barnard L, Backman J, Marcketta A, ...&, Marchini J. (2021) Computationally efficient whole-genome regression for quantitative and binary traits. Nat Genet, 53 (7) 1097-1103. doi:10.1038/s41588-021-00870-7. PMID 34017140
ABSTRACT
Genome-wide association analysis of cohorts with thousands of phenotypes is computationally expensive, particularly when accounting for sample relatedness or population structure. Here we present a novel machine-learning method called REGENIE for fitting a whole-genome regression model for quantitative and binary phenotypes that is substantially faster than alternatives in multi-trait analyses while maintaining statistical efficiency. The method naturally accommodates parallel analysis of multiple phenotypes and requires only local segments of the genotype matrix to be loaded in memory, in contrast to existing alternatives, which must load genome-wide matrices into memory. This results in substantial savings in compute time and memory usage. We introduce a fast, approximate Firth logistic regression test for unbalanced case-control phenotypes. The method is ideally suited to take advantage of distributed computing frameworks. We demonstrate the accuracy and computational benefits of this approach using the UK Biobank dataset with up to 407,746 individuals.
DOI
10.1038/s41588-021-00870-7

seismic

GWAS Single cell scRNA-seq Gene prioritization Tool
PUBMED_LINK
41034207
FULL NAME
Single-cell Expression Integration System for Mapping genetically Implicated Cell types
DESCRIPTION
R framework that links GWAS signals to single-cell-defined cell types via a cell-type gene specificity score (expression magnitude and consistency) and regression on gene-level association statistics, with influential-gene follow-up for interpretability.
URL
https://github.com/ylaboratory/seismic ,https://ylaboratory.github.io/seismic/ ,https://doi.org/10.1038/s41467-025-63753-z
KEYWORDS
GWAS, scRNA-seq, cell type, MAGMA, post-GWAS interpretation
TITLE
Disentangling associations between complex traits and cell types with seismic.
Main citation
Lai Q, Dannenfelser R, Roussarie JP, Yao V. (2025) Disentangling associations between complex traits and cell types with seismic. Nat Commun, 16 (1) 8744. doi:10.1038/s41467-025-63753-z. PMID 41034207
ABSTRACT
Integrating single-cell RNA sequencing with Genome-Wide Association Studies (GWAS) can uncover cell types involved in complex traits and disease. However, current methods often lack scalability, interpretability, and robustness. We present seismic, a framework that computes a novel specificity score capturing both expression magnitude and consistency across cell types and introduces influential gene analysis, an approach to identify genes driving each cell type-trait association. Across over 1000 cell-type characterizations at different granularities and 28 polygenic traits, seismic corroborates known associations and uncovers trait-relevant cell groups not apparent through other methodologies. In Parkinson's and Alzheimer's, seismic unveils both cell- and brain-region-specific differences in pathology. Analyzing a pathology-based Alzheimer's GWAS with seismic enables the identification of vulnerable neuron populations and molecular pathways implicated in their neurodegeneration. In general, seismic is a computationally efficient, powerful, and interpretable approach for mapping the relationships between polygenic traits and cell-type-specific expression, offering new insights into disease mechanisms.
DOI
10.1038/s41467-025-63753-z

AD gut microbiome host genetics (Chinese cohort)

GWAS
PUBMED_LINK
41782023
DESCRIPTION
Joint host whole-genome sequencing and gut microbiome profiling in 252 Chinese individuals with graded cognitive disability. Microbiome GWAS for latent enterosignature (Anaerostipes-enriched) abundance; integrates AD polygenic risk and brain cell-type expression context. Open-access report in Microbiome.
URL
https://link.springer.com/article/10.1186/s40168-026-02342-8
Main citation
Liu J, Cao J, Jia L, et al. (2026) Impacts of host genetics on gut microbiome composition in Alzheimer's disease. Microbiome. doi:10.1186/s40168-026-02342-8. PMID 41782023
MAIN ANCESTRY
EAS
METAGENOME
Gut bacteria

Cole JB-32193382

GWAS
PUBMED_LINK
32193382
DESCRIPTION
GWAS of food-frequency questionnaire traits in UK Biobank; hundreds of associated loci. Summary statistics for ~143 heritable dietary measures (BOLT-LMM); large tarball via Knowledge Portal.
URL
https://www.nature.com/articles/s41467-020-15193-0 ,https://www.kp4cd.org/node/351
Main citation
Cole JB, Florez JC, Hirschhorn JN. (2020) Comprehensive genomic analysis of dietary habits in UK Biobank identifies hundreds of genetic associations. Nat Commun, 11 (1) 1467. doi:10.1038/s41467-020-15193-0. PMID 32193382
DOI
10.1038/s41467-020-15193-0
MAIN ANCESTRY
EUR

Doherty A-30531941

GWAS
PUBMED_LINK
30531941
DESCRIPTION
GWAS of UK Biobank wrist accelerometer phenotypes (91,105 participants); 14 loci. Summary statistics (unadjusted and sex/BMI-adjusted) deposited in Oxford University Research Archive.
URL
https://www.nature.com/articles/s41467-018-07743-4 ,https://ora.ox.ac.uk/objects/uuid:ff479f44-bf35-48b9-9e67-e690a2937b22
Main citation
Doherty A, Smith-Byrne K, Ferreira T, et al. (2018) GWAS identifies 14 loci for device-measured physical activity and sleep duration. Nat Commun, 9 (1) 5257. doi:10.1038/s41467-018-07743-4. PMID 30531941
DOI
10.1038/s41467-018-07743-4
MAIN ANCESTRY
EUR

German gut microbiome GWAS (ABO / FUT2)

GWAS
PUBMED_LINK
33462482
DESCRIPTION
Genome-wide association analysis in 8,956 German individuals links host variation to single-taxon and overall gut microbiome composition; replicates ABO histo-blood group and FUT2 secretor associations with Bacteroides and Faecalibacterium. Includes Mendelian randomization for IBD-relevant microbial effects.
URL
https://www.nature.com/articles/s41588-020-00747-1
Main citation
Rühlemann MC, Hermes BM, Bang C, et al. (2021) Genome-wide association study in 8,956 German individuals identifies influence of ABO histo-blood groups on gut microbiome. Nat Genet, 53:147–155. doi:10.1038/s41588-020-00747-1. PMID 33462482
MAIN ANCESTRY
EUR
METAGENOME
Gut bacteria

Gut microbial structural variation GWAS (Dutch meta-analysis)

GWAS
PUBMED_LINK
38172637
DESCRIPTION
Meta-analysis of genome-wide associations between host genotypes and gut microbial structural variants (deletion dSVs and variable vSVs) from metagenomes in four Dutch cohorts (n = 9,015), with replication in Tanzania (n = 279). After frequency filters, GWAS used 3,552 common SV phenotypes aggregated across 49 bacterial species with sufficient metagenomic coverage. Highlights ABO/FUT2–GalNAc pathways and Faecalibacterium prausnitzii SVs.
URL
https://www.nature.com/articles/s41586-023-06893-w
Main citation
Zhernakova DV, Wang D, Liu L, et al. (2024) Host genetic regulation of human gut microbial structural variation. Nature, 625:813–821. doi:10.1038/s41586-023-06893-w. PMID 38172637
MAIN ANCESTRY
EUR
METAGENOME
Gut bacteria (metagenomic SVs)

Host genetics & gut microbiome (Nat Genet perspective)

GWAS
PUBMED_LINK
35115688
DESCRIPTION
Perspective on microbial GWAS (mbGWAS): state of the art, heterogeneity of microbiome assays, power, and directions for genetic analysis of the gut microbiome.
URL
https://www.nature.com/articles/s41588-021-00983-z
Main citation
Sanna S, Kurilshikov A, van der Graaf A, et al. (2022) Challenges and future directions for studying effects of host genetics on the gut microbiome. Nat Genet, 54:100–106. doi:10.1038/s41588-021-00983-z. PMID 35115688

Japanese gut microbiome–host genetics (Cell Reports)

GWAS
PUBMED_LINK
37935197
DESCRIPTION
Japanese shotgun metagenome and SNP-array GWAS (7,213,469 post-imputation variants, MAF >1%, Minimac4 Rsq >0.7): microbial traits included species, gene orthologs, and pathways. Two study waves—dataset 1 (gut microbiome, plasma metabolome, genotype): n = 300; dataset 2 (gut microbiome, genotype): n = 224—were combined by fixed-effect meta-analysis (total n = 524; Figure S1 / Table S1). Species GWAS used 423 microbial species (study-wide threshold 5×10⁻⁸/423); headline hits include PDE1C–Bacteroides intestinalis and TGIF2 / TGIF2-RAB5IF–B. acidifaciens, plus microbial gene ortholog associations with blood group A conditioned on East Asian FUT2 secretor status. Metabolome arm in dataset 1 supports microbiome–plasma integration. Public data resource.
URL
https://www.cell.com/cell-reports/fulltext/S2211-1247%2823%2901336-0
Main citation
Tomofuji Y, Kishikawa T, Sonehara K, et al. (2023) Analysis of gut microbiome, host genetics, and plasma metabolites reveals gut microbiome-host interactions in the Japanese population. Cell Rep, 42(11):113324. doi:10.1016/j.celrep.2023.113324. PMID 37935197
MAIN ANCESTRY
EAS
METAGENOME
Gut (shotgun metagenome)

Matoba N-31959922

GWAS
PUBMED_LINK
31959922
DESCRIPTION
Genome-wide association studies for 13 dietary habits in Japanese individuals (n = 58,610–165,084) from BioBank Japan; nine genome-wide significant loci reported.
URL
https://www.nature.com/articles/s41562-019-0805-1
Main citation
Matoba N, Akiyama M, Ishigaki K, et al. (2020) GWAS of 165,084 Japanese individuals identified nine loci associated with dietary habits. Nat Hum Behav, 4 (3) 308-316. doi:10.1038/s41562-019-0805-1. PMID 31959922
DOI
10.1038/s41562-019-0805-1
MAIN ANCESTRY
EAS

Merino J-34426670

GWAS
PUBMED_LINK
34426670
DESCRIPTION
Multi-trait GWAS meta-analysis of dietary intake combining UK Biobank and CHARGE; functional and brain-related follow-up. Summary statistics via UK Biobank showcase, dbGaP, and Type 2 Diabetes Knowledge Portal (see paper data availability).
URL
https://www.nature.com/articles/s41562-021-01182-w ,https://www.kp4cd.org/dataset_downloads/t2d
Main citation
Merino J, Dashti HS, Sarnowski C, et al. (2022) Genetic analysis of dietary intake identifies new loci and functional links with metabolic traits. Nat Hum Behav, 6 (1) 155-163. doi:10.1038/s41562-021-01182-w. PMID 34426670
DOI
10.1038/s41562-021-01182-w
MAIN ANCESTRY
EUR

MiBioGen gut microbiome GWAS (multi-cohort)

GWAS
PUBMED_LINK
33462485
DESCRIPTION
MiBioGen consortium meta-analysis: genome-wide host genotypes with 16S fecal microbiome data across 24 cohorts (n = 18,340). N_MICROBES = 410 genus-level groups in the MiBioGen framework (paper: nine genera detected in >95% of samples). Identifies 31 genome-wide significant loci affecting microbial taxa (e.g. lactase LCT, ABO, fucosyltransferase cluster).
URL
https://pmc.ncbi.nlm.nih.gov/articles/PMC8515199/ ,https://www.nature.com/articles/s41588-020-00763-1
Main citation
Kurilshikov A, Medina-Gomez C, Bacigalupe R, et al. (2021) Large-scale association analyses identify host factors influencing human gut microbiome composition. Nat Genet, 53:156–165. doi:10.1038/s41588-020-00763-1. PMID 33462485
MAIN ANCESTRY
Multi-ancestry
METAGENOME
Gut bacteria (16S)

Namba S-41606330

GWAS
PUBMED_LINK
41606330
DESCRIPTION
Cross-population atlas of gene–environment interactions (discovery n ≈ 440k EUR + Japanese; replication n ≈ 540k). Genome-wide G×E summary statistics released at NBDC Human Database (hum0197.v26.374-traits.v1) and GWAS Catalog (GCST90681837–GCST90690020).
URL
https://www.nature.com/articles/s41586-025-10054-6 ,https://humandbs.dbcls.jp/en ,https://www.ebi.ac.uk/gwas
Main citation
Namba S, Sonehara K, Koyanagi YN, et al. (2026) A cross-population compendium of gene-environment interactions. Nature, 651:688-697. doi:10.1038/s41586-025-10054-6. PMID 41606330
DOI
10.1038/s41586-025-10054-6
MAIN ANCESTRY
Multi-ancestry

Oral microbiota GWAS (ADDITION-PRO)

GWAS
PUBMED_LINK
38926497
DESCRIPTION
16S rRNA amplicon-based GWAS of salivary microbiota traits in unrelated Danish adults from the ADDITION-PRO cohort (n = 610). Identifies host SNPs associated with oral bacterial abundance and beta diversity; several variants link to metabolic traits. Oral (not gut) microbiota — listed here for host–microbiome genetics.
URL
https://www.nature.com/articles/s41598-024-65538-8
Main citation
Stankevic E, Kern T, Borisevich D, et al. (2024) Genome-wide association study identifies host genetic variants influencing oral microbiota diversity and metabolic health. Sci Rep, 14:14738. doi:10.1038/s41598-024-65538-8. PMID 38926497
MAIN ANCESTRY
EUR
METAGENOME
Oral (salivary)

Swedish gut metagenome GWAS (SCAPIS / HUNT replication)

GWAS
PUBMED_LINK
41688638
DESCRIPTION
Harmonized shotgun metagenome GWAS in 16,017 adults from four Swedish studies, with replication in 12,652 participants from the Norwegian HUNT study. Reports loci including OR51E1–OR51E2 (microbial richness), LCT, ABO, FUT2, MUC12, CORO7–HMOX2, SLC5A11, FOXP1, FUT3–FUT6, and species-level associations.
URL
https://www.nature.com/articles/s41588-026-02512-2
Main citation
Dekkers KF, Pertiwi K, Baldanzi G, et al. (2026) Genome-wide association analyses highlight the role of the intestinal molecular environment in human gut microbiota variation. Nat Genet, 58:540–549. doi:10.1038/s41588-026-02512-2. PMID 41688638
MAIN ANCESTRY
EUR
METAGENOME
Gut (shotgun metagenome)

UK Biobank family-based GWAS (Guan et al.)

GWAS UK Biobank
PUBMED_LINK
40065166
DESCRIPTION
Family-based GWAS (FGWAS) summary statistics estimating direct genetic effects (DGEs) with unified, robust, Young et al. Mendelian-imputation, and sib-difference estimators implemented in snipar. Unified estimator adds singletons via linear parental imputation (largest DGE effective sample size in homogeneous samples); robust estimator avoids allele-frequency imputation bias under strong structure or admixture. UK Biobank application (n up to ~408k unified White British; ~52k robust with ≥1 genotyped first-degree relative).
URL
https://www.nature.com/articles/s41588-025-02118-0 ,https://thessgac.com/ ,https://github.com/AlexTISYoung/snipar ,https://doi.org/10.5281/zenodo.14270274
Main citation
Guan J, Tan T, Nehzati SM, Bennett M, ...&, Young AS. (2025) Family-based genome-wide association study designs for increased power and robustness. Nat Genet, 57 (4) 1044-1052. doi:10.1038/s41588-025-02118-0. PMID 40065166
DOI
10.1038/s41588-025-02118-0
RELATED_BIOBANK
UK Biobank
MAIN ANCESTRY
EUR (unified / Young et al. in White British); multi-ancestry robust estimator

UK Biobank sleep traits (self-report)

GWAS
PUBMED_LINK
30846698
DESCRIPTION
Self-reported sleep-related GWAS summary statistics hosted on the Knowledge Portal (downloads per trait with READMEs). Landmark papers include Dashti et al. on sleep duration and daytime napping, Wang et al. on daytime sleepiness, and Lane et al. on chronotype and sleep disturbance (see portal publication list).
URL
https://www.kp4cd.org/node/235 ,https://www.nature.com/articles/s41467-019-08917-4
Main citation
Dashti HS, Jones SE, Wood AR, et al. (2019) Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nat Commun, 10 (1) 1100. doi:10.1038/s41467-019-08917-4. PMID 30846698
DOI
10.1038/s41467-019-08917-4
MAIN ANCESTRY
EUR

Wang Z-36071172

GWAS
PUBMED_LINK
36071172
DESCRIPTION
Genome-wide association meta-analysis of physical activity and sedentary behaviour traits across ancestries; 99 associated loci. Supplementary tables and open materials via Nature Genetics and WashU Digital Commons.
URL
https://www.nature.com/articles/s41588-022-01165-1 ,https://digitalcommons.wustl.edu/oa_4/339/
Main citation
Wang Z, Emmerich A, Pillon NJ, et al. (2022) Genome-wide association analyses of physical activity and sedentary behavior provide insights into underlying mechanisms and roles in disease prevention. Nat Genet, 54 (9) 1332-1344. doi:10.1038/s41588-022-01165-1. PMID 36071172
DOI
10.1038/s41588-022-01165-1
MAIN ANCESTRY
Multi-ancestry