Skip to content

Nat Genet

Catalog entries using this tag (links open the entry card on its page):

Entries

Causal ML for scGenomics (Causal ML sc)

AI GWAS Causal ML Single Cell Machine Learning Nat Genet
PUBMED_LINK
40164735
FULL NAME
Causal Machine Learning for Single-Cell Genomics
DESCRIPTION
A Perspective from Nature Genetics delineating the application of causal machine learning to single-cell genomics. Discusses causal models, challenges in inferring causative roles of genes from single-cell omics data combined with perturbation screens, and the potential for integrating causal ML with GWAS to understand disease mechanisms at single-cell resolution.
TITLE
Causal machine learning for single-cell genomics.
ABSTRACT
Advances in single-cell '-omics' allow unprecedented insights into the transcriptional profiles of individual cells and, when combined with large-scale perturbation screens, enable measuring of the effect of targeted perturbations on the whole transcriptome. In this Perspective, we delineate the application of causal machine learning to single-cell genomics and its associated challenges, presenting the causal model most commonly applied to single-cell biology.
DOI
10.1038/s41588-025-02124-2

DL Thoracic Aorta GWAS (DL Aorta GWAS)

AI GWAS Deep Learning Medical Imaging UK Biobank Nat Genet
PUBMED_LINK
34837083
FULL NAME
Deep Learning Enables Genetic Analysis of the Human Thoracic Aorta
DESCRIPTION
Applied a pretrained CNN (transferred from natural image recognition, e.g. ResNet/Inception-like architecture) to 4.6 million cardiac MRI images from UK Biobank, trained on only 116 manually annotated samples to regress ascending and descending thoracic aorta dimensions. GWAS identified 82 ascending and 47 descending aorta loci. Demonstrates transfer learning from natural images to medical imaging for rapid biobank-scale phenotyping.
KEYWORDS
deep learning, CNN, ImageNet transfer learning, cardiac MRI, thoracic aorta, image regression, UK Biobank
TITLE
Deep learning enables genetic analysis of the human thoracic aorta.
ABSTRACT
Enlargement or aneurysm of the aorta predisposes to dissection, an important cause of sudden death. We trained a deep learning model to evaluate the dimensions of the ascending and descending thoracic aorta in 4.6 million cardiac magnetic resonance images from the UK Biobank. We then conducted genome-wide association studies in 39,688 individuals, identifying 82 loci associated with ascending and 47 with descending thoracic aortic diameter. Transcriptome-wide analyses, rare-variant burden tests and human aortic single nucleus RNA sequencing prioritized genes including FBN1 and MFAP5.
DOI
10.1038/s41588-021-00962-4

MILTON

AI GWAS Machine Learning Disease Prediction UK Biobank Multi-omics Nat Genet
PUBMED_LINK
39261665
FULL NAME
MILTON - Machine Learning with Phenotype Associations for Disease Prediction
DESCRIPTION
MILTON is an ensemble machine learning framework that utilizes biomarkers and multi-omics data to predict 3,213 diseases in the UK Biobank. It predicts incident disease cases undiagnosed at time of recruitment and demonstrates utility in augmenting genetic association discovery by empowering case-control GWAS with predicted phenotypes. Published in Nature Genetics.
TITLE
Disease prediction with multi-omics and biomarkers empowers case-control genetic discoveries in the UK Biobank.
ABSTRACT
The emergence of biobank-level datasets offers new opportunities to discover novel biomarkers and develop predictive algorithms for human disease. Here, we present an ensemble machine-learning framework (machine learning with phenotype associations, MILTON) utilizing a range of biomarkers to predict 3,213 diseases in the UK Biobank. MILTON predicts incident disease cases undiagnosed at time of recruitment, largely outperforming available polygenic risk scores, and augments genetic association discovery.
DOI
10.1038/s41588-024-01898-1

PoPS

AI GWAS Gene Prioritization Machine Learning Polygenic Nat Genet
PUBMED_LINK
37443254
FULL NAME
PoPS - Polygenic Priority Score for Gene Prioritization
DESCRIPTION
PoPS (Polygenic Priority Score) is a method that learns trait-relevant gene features, such as cell-type-specific expression, to prioritize genes at GWAS loci. It leverages polygenic enrichments across multiple gene features to predict causal genes underlying complex traits and diseases. Published in Nature Genetics.
URL
https://github.com/FinucaneLab/pops
TITLE
Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases.
ABSTRACT
Genome-wide association studies (GWASs) are a valuable tool for understanding the biology of complex human traits and diseases, but associated variants rarely point directly to causal genes. In the present study, we introduce a new method, polygenic priority score (PoPS), that learns trait-relevant gene features, such as cell-type-specific expression, to prioritize genes at GWAS loci. PoPS and the closest gene individually outperform other gene prioritization methods.
DOI
10.1038/s41588-023-01443-6

Quickdraws

AI GWAS Variational Inference Mixed Model GPU Nat Genet
PUBMED_LINK
39789286
FULL NAME
Quickdraws - Scalable Variational Inference for Mixed-Model GWAS
DESCRIPTION
Quickdraws is a method that increases association power in quantitative and binary traits for GWAS without sacrificing computational efficiency, leveraging a spike-and-slab prior on variant effects, stochastic variational inference, and graphics processing unit acceleration. Published in Nature Genetics.
TITLE
A scalable variational inference approach for increased mixed-model association power.
ABSTRACT
The rapid growth of modern biobanks is creating new opportunities for large-scale genome-wide association studies (GWASs) and the analysis of complex traits. However, performing GWASs on millions of samples often leads to trade-offs between computational efficiency and statistical power, reducing the benefits of large-scale data collection efforts. We developed Quickdraws, a method that increases association power in quantitative and binary traits without sacrificing computational efficiency, leveraging a spike-and-slab prior on variant effects, stochastic variational inference and graphics processing unit acceleration.
DOI
10.1038/s41588-024-02044-7

SynSurr

AI GWAS Machine Learning Phenotype Imputation Synthetic Surrogates Nat Genet
PUBMED_LINK
38872030
FULL NAME
SynSurr - Synthetic Surrogates for GWAS of Missing Phenotypes
DESCRIPTION
SynSurr (Synthetic Surrogate analysis) is a method that makes GWAS on imputed phenotypes robust to imputation errors. Rather than replacing missing values, SynSurr jointly analyzes the observed and imputed data to provide calibrated association statistics, improving power for genome-wide association studies of partially missing phenotypes in population biobanks. Published in Nature Genetics.
TITLE
Synthetic surrogates improve power for genome-wide association studies of partially missing phenotypes in population biobanks.
ABSTRACT
Within population biobanks, incomplete measurement of certain traits limits the power for genetic discovery. Machine learning is increasingly used to impute the missing values from the available data. However, performing GWAS on imputed traits can introduce spurious associations. Here we introduce SynSurr analysis, which makes GWAS on imputed phenotypes robust to imputation errors by jointly analyzing observed and imputed data.
DOI
10.1038/s41588-024-01793-9