GWAS
Catalog entries using this tag (links open the entry card on its page):
- IMRP-GxE — GWAS Tools
- PP-GWAS — GWAS Tools
- REGENIE — GWAS Tools
- seismic — GWAS Tools
- Baseline cohort & array genotyping — Projects
- AD gut microbiome host genetics (Chinese cohort) — Summary statistics
- Cole JB-32193382 — Summary statistics
- Doherty A-30531941 — Summary statistics
- German gut microbiome GWAS (ABO / FUT2) — Summary statistics
- Gut microbial structural variation GWAS (Dutch meta-analysis) — Summary statistics
- Host genetics & gut microbiome (Nat Genet perspective) — Summary statistics
- Japanese gut microbiome–host genetics (Cell Reports) — Summary statistics
- Matoba N-31959922 — Summary statistics
- Merino J-34426670 — Summary statistics
- MiBioGen gut microbiome GWAS (multi-cohort) — Summary statistics
- Namba S-41606330 — Summary statistics
- Oral microbiota GWAS (ADDITION-PRO) — Summary statistics
- Swedish gut metagenome GWAS (SCAPIS / HUNT replication) — Summary statistics
- UK Biobank family-based GWAS (Guan et al.) — Summary statistics
- UK Biobank sleep traits (self-report) — Summary statistics
- Wang Z-36071172 — Summary statistics
Entries
IMRP-GxE
PUBMED_LINK
FULL NAME
Mendelian randomization-based genome-wide screening for gene–environment interactions
DESCRIPTION
Screens for combined gene–environment interaction and environmental mediation by testing departure of marginal GWAS effects from GWIS main effects using an MR-style statistic (IMRP), applicable to summary statistics from separate GWAS and interaction meta-analyses.
URL
KEYWORDS
GWAS, GWIS, G×E, Mendelian randomization, IMRP, summary statistics
TITLE
An approach to identify gene-environment interactions and reveal new biological insight in complex traits.
Main citation
Zhu X, Yang Y, Lorincz-Comi N, Li G, Bentley AR, de Vries PS, ...&, Aschard H. (2024) An approach to identify gene-environment interactions and reveal new biological insight in complex traits. Nat Commun, 15 (1) 3385. doi:10.1038/s41467-024-47806-3. PMID 38649715
ABSTRACT
There is a long-standing debate about the magnitude of the contribution of gene-environment interactions to phenotypic variations of complex traits owing to the low statistical power and few reported interactions to date. To address this issue, the Gene-Lifestyle Interactions Working Group within the Cohorts for Heart and Aging Research in Genetic Epidemiology Consortium has been spearheading efforts to investigate G×E in large and diverse samples through meta-analysis. Here, we present a powerful new approach to screen for interactions across the genome, an approach that shares substantial similarity to the Mendelian randomization framework. We identify and confirm 5 loci (6 independent signals) interacted with either cigarette smoking or alcohol consumption for serum lipids, and empirically demonstrate that interaction and mediation are the major contributors to genetic effect size heterogeneity across populations. The estimated lower bound of the interaction and environmentally mediated heritability is significant (P < 0.02) for low-density lipoprotein cholesterol and triglycerides in Cross-Population data. Our study improves the understanding of the genetic architecture and environmental contributions to complex traits.
DOI
10.1038/s41467-024-47806-3
ARROW_SUMMARY
Inputs: GWAS + GWIS summary stats (per-SNP β/SE; LD-pruned instruments; sample-overlap ρ if needed) → IMRP θ → T_MR-GxE (G×E + mediation)
PP-GWAS
PUBMED_LINK
DESCRIPTION
Privacy-preserving framework for multi-site GWAS on quantitative traits using a distributed linear mixed model and randomized encoding so servers never see raw genotypes or phenotypes—only obfuscated intermediates—while improving speed versus several cryptographic baselines.
URL
KEYWORDS
Privacy-preserving GWAS, multi-site, quantitative traits, federated analysis
TITLE
PP-GWAS: Privacy Preserving Multi-Site Genome-wide Association Studies.
Main citation
Swaminathan A, Hannemann A, Ünal AB, Pfeifer N, ...&, Akgün M. (2025) PP-GWAS: Privacy Preserving Multi-Site Genome-wide Association Studies. Nat Commun, 16 (1) 11030. doi:10.1038/s41467-025-66771-z. PMID 41365878
ABSTRACT
Genome-wide association studies help uncover genetic influences on complex traits and diseases. Importantly, multi-site data collaborations enhance the statistical power of these studies but pose challenges due to the sensitivity of genomic data. Existing privacy-preserving approaches to performing multi-site genome-wide association studies rely on computationally expensive cryptographic techniques, which limit applicability. To address this, we present PP-GWAS, a privacy-preserving algorithm that improves efficiency and scalability while maintaining data privacy. Our method leverages randomized encoding within a distributed framework to perform stacked ridge regression on a linear mixed model, enabling robust analysis of quantitative phenotypes. We show experimentally using real-world and synthetic data that our approach achieves twice the computational speed of comparable methods while reducing resource consumption.
DOI
10.1038/s41467-025-66771-z
REGENIE
PUBMED_LINK
DESCRIPTION
regenie is a C++ program for whole genome regression modelling of large genome-wide association studies. It is developed and supported by a team of scientists at the Regeneron Genetics Center.
URL
KEYWORDS
whole genome regression
TITLE
Computationally efficient whole-genome regression for quantitative and binary traits.
Main citation
Mbatchou J, Barnard L, Backman J, Marcketta A, ...&, Marchini J. (2021) Computationally efficient whole-genome regression for quantitative and binary traits. Nat Genet, 53 (7) 1097-1103. doi:10.1038/s41588-021-00870-7. PMID 34017140
ABSTRACT
Genome-wide association analysis of cohorts with thousands of phenotypes is computationally expensive, particularly when accounting for sample relatedness or population structure. Here we present a novel machine-learning method called REGENIE for fitting a whole-genome regression model for quantitative and binary phenotypes that is substantially faster than alternatives in multi-trait analyses while maintaining statistical efficiency. The method naturally accommodates parallel analysis of multiple phenotypes and requires only local segments of the genotype matrix to be loaded in memory, in contrast to existing alternatives, which must load genome-wide matrices into memory. This results in substantial savings in compute time and memory usage. We introduce a fast, approximate Firth logistic regression test for unbalanced case-control phenotypes. The method is ideally suited to take advantage of distributed computing frameworks. We demonstrate the accuracy and computational benefits of this approach using the UK Biobank dataset with up to 407,746 individuals.
DOI
10.1038/s41588-021-00870-7
seismic
PUBMED_LINK
FULL NAME
Single-cell Expression Integration System for Mapping genetically Implicated Cell types
DESCRIPTION
R framework that links GWAS signals to single-cell-defined cell types via a cell-type gene specificity score (expression magnitude and consistency) and regression on gene-level association statistics, with influential-gene follow-up for interpretability.
URL
KEYWORDS
GWAS, scRNA-seq, cell type, MAGMA, post-GWAS interpretation
TITLE
Disentangling associations between complex traits and cell types with seismic.
Main citation
Lai Q, Dannenfelser R, Roussarie JP, Yao V. (2025) Disentangling associations between complex traits and cell types with seismic. Nat Commun, 16 (1) 8744. doi:10.1038/s41467-025-63753-z. PMID 41034207
ABSTRACT
Integrating single-cell RNA sequencing with Genome-Wide Association Studies (GWAS) can uncover cell types involved in complex traits and disease. However, current methods often lack scalability, interpretability, and robustness. We present seismic, a framework that computes a novel specificity score capturing both expression magnitude and consistency across cell types and introduces influential gene analysis, an approach to identify genes driving each cell type-trait association. Across over 1000 cell-type characterizations at different granularities and 28 polygenic traits, seismic corroborates known associations and uncovers trait-relevant cell groups not apparent through other methodologies. In Parkinson's and Alzheimer's, seismic unveils both cell- and brain-region-specific differences in pathology. Analyzing a pathology-based Alzheimer's GWAS with seismic enables the identification of vulnerable neuron populations and molecular pathways implicated in their neurodegeneration. In general, seismic is a computationally efficient, powerful, and interpretable approach for mapping the relationships between polygenic traits and cell-type-specific expression, offering new insights into disease mechanisms.
DOI
10.1038/s41467-025-63753-z
Baseline cohort & array genotyping
STAGE_PERIOD
2006–2010+
DESCRIPTION
Recruitment of ~500,000 UK adults with questionnaire and physical measures; genome-wide array genotyping on the full cohort underpins most early GWAS and PRS applications.
URL
AD gut microbiome host genetics (Chinese cohort)
PUBMED_LINK
DESCRIPTION
Joint host whole-genome sequencing and gut microbiome profiling in 252 Chinese individuals with graded cognitive disability. Microbiome GWAS for latent enterosignature (Anaerostipes-enriched) abundance; integrates AD polygenic risk and brain cell-type expression context. Open-access report in Microbiome.
URL
Main citation
Liu J, Cao J, Jia L, et al. (2026) Impacts of host genetics on gut microbiome composition in Alzheimer's disease. Microbiome. doi:10.1186/s40168-026-02342-8. PMID 41782023
MAIN ANCESTRY
EAS
METAGENOME
Gut bacteria
Cole JB-32193382
PUBMED_LINK
DESCRIPTION
GWAS of food-frequency questionnaire traits in UK Biobank; hundreds of associated loci. Summary statistics for ~143 heritable dietary measures (BOLT-LMM); large tarball via Knowledge Portal.
URL
Main citation
Cole JB, Florez JC, Hirschhorn JN. (2020) Comprehensive genomic analysis of dietary habits in UK Biobank identifies hundreds of genetic associations. Nat Commun, 11 (1) 1467. doi:10.1038/s41467-020-15193-0. PMID 32193382
DOI
10.1038/s41467-020-15193-0
MAIN ANCESTRY
EUR
Doherty A-30531941
PUBMED_LINK
DESCRIPTION
GWAS of UK Biobank wrist accelerometer phenotypes (91,105 participants); 14 loci. Summary statistics (unadjusted and sex/BMI-adjusted) deposited in Oxford University Research Archive.
URL
Main citation
Doherty A, Smith-Byrne K, Ferreira T, et al. (2018) GWAS identifies 14 loci for device-measured physical activity and sleep duration. Nat Commun, 9 (1) 5257. doi:10.1038/s41467-018-07743-4. PMID 30531941
DOI
10.1038/s41467-018-07743-4
MAIN ANCESTRY
EUR
German gut microbiome GWAS (ABO / FUT2)
PUBMED_LINK
DESCRIPTION
Genome-wide association analysis in 8,956 German individuals links host variation to single-taxon and overall gut microbiome composition; replicates ABO histo-blood group and FUT2 secretor associations with Bacteroides and Faecalibacterium. Includes Mendelian randomization for IBD-relevant microbial effects.
URL
Main citation
Rühlemann MC, Hermes BM, Bang C, et al. (2021) Genome-wide association study in 8,956 German individuals identifies influence of ABO histo-blood groups on gut microbiome. Nat Genet, 53:147–155. doi:10.1038/s41588-020-00747-1. PMID 33462482
MAIN ANCESTRY
EUR
METAGENOME
Gut bacteria
Gut microbial structural variation GWAS (Dutch meta-analysis)
PUBMED_LINK
DESCRIPTION
Meta-analysis of genome-wide associations between host genotypes and gut microbial structural variants (deletion dSVs and variable vSVs) from metagenomes in four Dutch cohorts (n = 9,015), with replication in Tanzania (n = 279). After frequency filters, GWAS used 3,552 common SV phenotypes aggregated across 49 bacterial species with sufficient metagenomic coverage. Highlights ABO/FUT2–GalNAc pathways and Faecalibacterium prausnitzii SVs.
URL
Main citation
Zhernakova DV, Wang D, Liu L, et al. (2024) Host genetic regulation of human gut microbial structural variation. Nature, 625:813–821. doi:10.1038/s41586-023-06893-w. PMID 38172637
MAIN ANCESTRY
EUR
METAGENOME
Gut bacteria (metagenomic SVs)
Host genetics & gut microbiome (Nat Genet perspective)
PUBMED_LINK
DESCRIPTION
Perspective on microbial GWAS (mbGWAS): state of the art, heterogeneity of microbiome assays, power, and directions for genetic analysis of the gut microbiome.
URL
Main citation
Sanna S, Kurilshikov A, van der Graaf A, et al. (2022) Challenges and future directions for studying effects of host genetics on the gut microbiome. Nat Genet, 54:100–106. doi:10.1038/s41588-021-00983-z. PMID 35115688
Japanese gut microbiome–host genetics (Cell Reports)
PUBMED_LINK
DESCRIPTION
Japanese shotgun metagenome and SNP-array GWAS (7,213,469 post-imputation variants, MAF >1%, Minimac4 Rsq >0.7): microbial traits included species, gene orthologs, and pathways. Two study waves—dataset 1 (gut microbiome, plasma metabolome, genotype): n = 300; dataset 2 (gut microbiome, genotype): n = 224—were combined by fixed-effect meta-analysis (total n = 524; Figure S1 / Table S1). Species GWAS used 423 microbial species (study-wide threshold 5×10⁻⁸/423); headline hits include PDE1C–Bacteroides intestinalis and TGIF2 / TGIF2-RAB5IF–B. acidifaciens, plus microbial gene ortholog associations with blood group A conditioned on East Asian FUT2 secretor status. Metabolome arm in dataset 1 supports microbiome–plasma integration. Public data resource.
URL
Main citation
Tomofuji Y, Kishikawa T, Sonehara K, et al. (2023) Analysis of gut microbiome, host genetics, and plasma metabolites reveals gut microbiome-host interactions in the Japanese population. Cell Rep, 42(11):113324. doi:10.1016/j.celrep.2023.113324. PMID 37935197
MAIN ANCESTRY
EAS
METAGENOME
Gut (shotgun metagenome)
Matoba N-31959922
PUBMED_LINK
DESCRIPTION
Genome-wide association studies for 13 dietary habits in Japanese individuals (n = 58,610–165,084) from BioBank Japan; nine genome-wide significant loci reported.
URL
Main citation
Matoba N, Akiyama M, Ishigaki K, et al. (2020) GWAS of 165,084 Japanese individuals identified nine loci associated with dietary habits. Nat Hum Behav, 4 (3) 308-316. doi:10.1038/s41562-019-0805-1. PMID 31959922
DOI
10.1038/s41562-019-0805-1
MAIN ANCESTRY
EAS
Merino J-34426670
PUBMED_LINK
DESCRIPTION
Multi-trait GWAS meta-analysis of dietary intake combining UK Biobank and CHARGE; functional and brain-related follow-up. Summary statistics via UK Biobank showcase, dbGaP, and Type 2 Diabetes Knowledge Portal (see paper data availability).
URL
Main citation
Merino J, Dashti HS, Sarnowski C, et al. (2022) Genetic analysis of dietary intake identifies new loci and functional links with metabolic traits. Nat Hum Behav, 6 (1) 155-163. doi:10.1038/s41562-021-01182-w. PMID 34426670
DOI
10.1038/s41562-021-01182-w
MAIN ANCESTRY
EUR
MiBioGen gut microbiome GWAS (multi-cohort)
PUBMED_LINK
DESCRIPTION
MiBioGen consortium meta-analysis: genome-wide host genotypes with 16S fecal microbiome data across 24 cohorts (n = 18,340). N_MICROBES = 410 genus-level groups in the MiBioGen framework (paper: nine genera detected in >95% of samples). Identifies 31 genome-wide significant loci affecting microbial taxa (e.g. lactase LCT, ABO, fucosyltransferase cluster).
URL
Main citation
Kurilshikov A, Medina-Gomez C, Bacigalupe R, et al. (2021) Large-scale association analyses identify host factors influencing human gut microbiome composition. Nat Genet, 53:156–165. doi:10.1038/s41588-020-00763-1. PMID 33462485
MAIN ANCESTRY
Multi-ancestry
METAGENOME
Gut bacteria (16S)
Namba S-41606330
PUBMED_LINK
DESCRIPTION
Cross-population atlas of gene–environment interactions (discovery n ≈ 440k EUR + Japanese; replication n ≈ 540k). Genome-wide G×E summary statistics released at NBDC Human Database (hum0197.v26.374-traits.v1) and GWAS Catalog (GCST90681837–GCST90690020).
URL
Main citation
Namba S, Sonehara K, Koyanagi YN, et al. (2026) A cross-population compendium of gene-environment interactions. Nature, 651:688-697. doi:10.1038/s41586-025-10054-6. PMID 41606330
DOI
10.1038/s41586-025-10054-6
MAIN ANCESTRY
Multi-ancestry
Oral microbiota GWAS (ADDITION-PRO)
PUBMED_LINK
DESCRIPTION
16S rRNA amplicon-based GWAS of salivary microbiota traits in unrelated Danish adults from the ADDITION-PRO cohort (n = 610). Identifies host SNPs associated with oral bacterial abundance and beta diversity; several variants link to metabolic traits. Oral (not gut) microbiota — listed here for host–microbiome genetics.
URL
Main citation
Stankevic E, Kern T, Borisevich D, et al. (2024) Genome-wide association study identifies host genetic variants influencing oral microbiota diversity and metabolic health. Sci Rep, 14:14738. doi:10.1038/s41598-024-65538-8. PMID 38926497
MAIN ANCESTRY
EUR
METAGENOME
Oral (salivary)
Swedish gut metagenome GWAS (SCAPIS / HUNT replication)
PUBMED_LINK
DESCRIPTION
Harmonized shotgun metagenome GWAS in 16,017 adults from four Swedish studies, with replication in 12,652 participants from the Norwegian HUNT study. Reports loci including OR51E1–OR51E2 (microbial richness), LCT, ABO, FUT2, MUC12, CORO7–HMOX2, SLC5A11, FOXP1, FUT3–FUT6, and species-level associations.
URL
Main citation
Dekkers KF, Pertiwi K, Baldanzi G, et al. (2026) Genome-wide association analyses highlight the role of the intestinal molecular environment in human gut microbiota variation. Nat Genet, 58:540–549. doi:10.1038/s41588-026-02512-2. PMID 41688638
MAIN ANCESTRY
EUR
METAGENOME
Gut (shotgun metagenome)
UK Biobank family-based GWAS (Guan et al.)
PUBMED_LINK
DESCRIPTION
Family-based GWAS (FGWAS) summary statistics estimating direct genetic effects (DGEs) with unified, robust, Young et al. Mendelian-imputation, and sib-difference estimators implemented in snipar. Unified estimator adds singletons via linear parental imputation (largest DGE effective sample size in homogeneous samples); robust estimator avoids allele-frequency imputation bias under strong structure or admixture. UK Biobank application (n up to ~408k unified White British; ~52k robust with ≥1 genotyped first-degree relative).
URL
Main citation
Guan J, Tan T, Nehzati SM, Bennett M, ...&, Young AS. (2025) Family-based genome-wide association study designs for increased power and robustness. Nat Genet, 57 (4) 1044-1052. doi:10.1038/s41588-025-02118-0. PMID 40065166
DOI
10.1038/s41588-025-02118-0
RELATED_BIOBANK
MAIN ANCESTRY
EUR (unified / Young et al. in White British); multi-ancestry robust estimator
UK Biobank sleep traits (self-report)
PUBMED_LINK
DESCRIPTION
Self-reported sleep-related GWAS summary statistics hosted on the Knowledge Portal (downloads per trait with READMEs). Landmark papers include Dashti et al. on sleep duration and daytime napping, Wang et al. on daytime sleepiness, and Lane et al. on chronotype and sleep disturbance (see portal publication list).
URL
Main citation
Dashti HS, Jones SE, Wood AR, et al. (2019) Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nat Commun, 10 (1) 1100. doi:10.1038/s41467-019-08917-4. PMID 30846698
DOI
10.1038/s41467-019-08917-4
MAIN ANCESTRY
EUR
Wang Z-36071172
PUBMED_LINK
DESCRIPTION
Genome-wide association meta-analysis of physical activity and sedentary behaviour traits across ancestries; 99 associated loci. Supplementary tables and open materials via Nature Genetics and WashU Digital Commons.
URL
Main citation
Wang Z, Emmerich A, Pillon NJ, et al. (2022) Genome-wide association analyses of physical activity and sedentary behavior provide insights into underlying mechanisms and roles in disease prevention. Nat Genet, 54 (9) 1332-1344. doi:10.1038/s41588-022-01165-1. PMID 36071172
DOI
10.1038/s41588-022-01165-1
MAIN ANCESTRY
Multi-ancestry