Skip to content

Deep Learning

Catalog entries using this tag (links open the entry card on its page):

Entries

Deep EHR

AI Clinical EHR Deep Learning Google Research Digital Medicine
PUBMED_LINK
31304366
FULL NAME
Scalable and Accurate Deep Learning with Electronic Health Records
DESCRIPTION
Pioneering deep learning framework for EHR data by Google Research, using Fast Healthcare Interoperability Resources (FHIR) format to represent patients' raw EHR records. Trained on 216,221 patients across 2 US medical centers with 46.8 billion data points including clinical notes. Achieved AUROC 0.93-0.94 for in-hospital mortality, 0.75-0.76 for 30-day readmission, and 0.90 for final discharge diagnoses, outperforming traditional clinical predictive models. 2,800+ citations, widely considered the landmark paper for deep learning on EHR data.
TITLE
Scalable and accurate deep learning with electronic health records.
Main citation
Rajkomar A, Oren E, Chen K, Dai AM, Hajaj N, Hardt M, Liu PJ, Liu X, Marcus J, Sun M, Sundberg P, Yee H, Zhang K, Zhang Y, Flores G, Duggan GE, Irvine J, Le Q, Litsch K, Mossin A, Tansuwan J, Wang D, Wexler J, Wilson J, Ludwig D, Volchenboum SL, Chou K, Pearson M, Madabushi S, Shah NH, Butte AJ, Howell MD, Cui C, Corrado GS, Dean J. (2018) Scalable and accurate deep learning with electronic health records. npj Digital Medicine, 1:18. doi:10.1038/s41746-018-0029-1. PMID 31304366
ABSTRACT
Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. We propose a representation of patients' entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. Deep learning models achieved high accuracy for predicting in-hospital mortality (AUROC 0.93-0.94), 30-day unplanned readmission (AUROC 0.75-0.76), prolonged length of stay (AUROC 0.85-0.86), and all of a patient's final discharge diagnoses (AUROC 0.90).
DOI
10.1038/s41746-018-0029-1

DeepNull

AI GWAS Deep Learning Covariate Adjustment Statistical Power Nat Commun
PUBMED_LINK
35017556
FULL NAME
DeepNull - Deep Learning for Non-linear Covariate Adjustment in GWAS
DESCRIPTION
DeepNull is a method that identifies and adjusts for non-linear and interactive covariate effects in GWAS using a deep neural network. It maintains tight control of type I error while increasing statistical power by up to 20% in the presence of non-linear covariate effects. Published in Nature Communications.
TITLE
DeepNull models non-linear covariate effects to improve phenotypic prediction and association power.
ABSTRACT
Genome-wide association studies (GWASs) examine the association between genotype and phenotype while adjusting for a set of covariates. Although the covariates may have non-linear or interactive effects, due to the challenge of specifying the model, GWAS often neglect such terms. Here we introduce DeepNull, a method that identifies and adjusts for non-linear and interactive covariate effects using a deep neural network, maintaining tight control of the type I error while increasing statistical power by up to 20%.
DOI
10.1038/s41467-021-27930-0

DL for PRS Survey (DL PRS Survey)

AI GWAS Polygenic Risk Score Deep Learning Survey Review Brief Bioinform
PUBMED_LINK
40802796
FULL NAME
A Survey on Deep Learning for Polygenic Risk Scores
DESCRIPTION
A comprehensive survey of deep learning approaches for polygenic risk scores (PRS). Reviews how neural networks can model non-linear relationships between genetic variants and disease risk, going beyond traditional linear PRS methods, and assesses their performance across different traits and architectures. Published in Briefings in Bioinformatics.
TITLE
A survey on deep learning for polygenic risk scores.
ABSTRACT
Polygenic risk scores (PRS) combine the effects of multiple genetic variants to predict an individual's genetic predisposition to a disease. PRS typically rely on linear models, which assume that all genetic variants act independently. There is growing interest in applying deep learning neural networks to model PRS given their ability to model non-linear relationships. We conducted a survey of the literature to investigate how neural networks are being applied to PRS.
DOI
10.1093/bib/bbaf373

DL Thoracic Aorta GWAS (DL Aorta GWAS)

AI GWAS Deep Learning Medical Imaging UK Biobank Nat Genet
PUBMED_LINK
34837083
FULL NAME
Deep Learning Enables Genetic Analysis of the Human Thoracic Aorta
DESCRIPTION
Applied a pretrained CNN (transferred from natural image recognition, e.g. ResNet/Inception-like architecture) to 4.6 million cardiac MRI images from UK Biobank, trained on only 116 manually annotated samples to regress ascending and descending thoracic aorta dimensions. GWAS identified 82 ascending and 47 descending aorta loci. Demonstrates transfer learning from natural images to medical imaging for rapid biobank-scale phenotyping.
KEYWORDS
deep learning, CNN, ImageNet transfer learning, cardiac MRI, thoracic aorta, image regression, UK Biobank
TITLE
Deep learning enables genetic analysis of the human thoracic aorta.
ABSTRACT
Enlargement or aneurysm of the aorta predisposes to dissection, an important cause of sudden death. We trained a deep learning model to evaluate the dimensions of the ascending and descending thoracic aorta in 4.6 million cardiac magnetic resonance images from the UK Biobank. We then conducted genome-wide association studies in 39,688 individuals, identifying 82 loci associated with ascending and 47 with descending thoracic aortic diameter. Transcriptome-wide analyses, rare-variant burden tests and human aortic single nucleus RNA sequencing prioritized genes including FBN1 and MFAP5.
DOI
10.1038/s41588-021-00962-4

iGWAS

AI GWAS Imaging Deep Learning Self-Supervised Learning Retinal Fundus Phenotyping Contrastive Learning
PUBMED_LINK
38728357
FULL NAME
Image-Based Genome-Wide Association of Self-Supervised Deep Phenotyping of Retina Fundus Images
DESCRIPTION
iGWAS uses self-supervised contrastive learning (SimCLR-style framework with a CNN encoder backbone) to extract a 128-dimensional phenotype vector directly from retinal fundus images without any manual labels. Trained on 40,000 EyePACS images via instance discrimination, then applied to 130,329 UK Biobank fundus images. GWAS on these 128 learned phenotypes identified 14 genome-wide significant loci. First demonstration of unsupervised deep phenotyping for image-based GWAS — discovering genetic associations without predefined human annotations.
KEYWORDS
self-supervised contrastive learning, SimCLR, CNN, retinal fundus, deep phenotyping, image-based GWAS, UK Biobank
TITLE
iGWAS: Image-Based Genome-Wide Association of Self-Supervised Deep Phenotyping of Retina Fundus Images.
Main citation
Xie Z, Zhang T, Kim S, Sun J, Forouzandeh P, Chen R, Zhi D. (2024) iGWAS: Image-Based Genome-Wide Association of Self-Supervised Deep Phenotyping of Retina Fundus Images. PLOS Genetics, 20(5):e1011273. doi:10.1371/journal.pgen.1011273. PMID 38728357
ABSTRACT
Existing imaging genetics studies have been mostly limited in scope by using imaging-derived phenotypes defined by human experts. Here, leveraging new breakthroughs in self-supervised deep representation learning, we propose a new approach, image-based genome-wide association study (iGWAS), for identifying genetic factors associated with phenotypes discovered from medical images using contrastive learning. Using retinal fundus photos, our model extracts a 128-dimensional vector representing features of the retina as phenotypes. We identified 14 loci with genome-wide significance.
DOI
10.1371/journal.pgen.1011273

InsightGWAS (Migraine) (InsightGWAS)

AI GWAS Transformer Deep Learning Migraine Transfer Learning Nat Commun
PUBMED_LINK
41372126
FULL NAME
InsightGWAS - Transformer-based Deep Learning Enhances Migraine GWAS
DESCRIPTION
InsightGWAS is a Transformer-based model that enhances genetic discovery for migraine GWAS by integrating functional annotations and leveraging transfer learning from GWAS datasets of major depressive disorder. It identified 293 previously unreported loci from 53,109 cases and 230,876 controls. Published in Nature Communications.
TITLE
Transformer-based deep learning enhances discovery in migraine GWAS.
ABSTRACT
Migraine is a complex neurological disorder with substantial heritability, yet genome-wide association studies (GWAS) have explained only a fraction of its genetic component. We developed InsightGWAS, a Transformer-based model, to enhance genetic discovery for migraine by integrating functional annotations and leveraging transfer learning from GWAS datasets of major depressive disorder (MDD), identifying 293 previously unreported loci.
DOI
10.1038/s41467-025-65991-7

Khurshid S (DL LV Mass GWAS)

AI GWAS Imaging Deep Learning Cardiac MRI Left Ventricular Mass UK Biobank
PUBMED_LINK
36944631
FULL NAME
Clinical and Genetic Associations of Deep Learning-Derived Cardiac Magnetic Resonance-Based Left Ventricular Mass
DESCRIPTION
Applied a CNN-based segmentation model (U-Net style architecture) to automatically segment left ventricular myocardium from cardiac MRI in 43,230 UK Biobank participants. The segmented contours were used to compute left ventricular mass indexed to body surface area (LVMI), enabling GWAS that identified 12 associations (11 novel) implicating genes associated with cardiac contractility and cardiomyopathy. The LVMI polygenic risk score validated in independent Mass General Brigham cohort.
KEYWORDS
deep learning, cardiac MRI segmentation, U-Net, left ventricular mass, CNN, cardiomyopathy, UK Biobank
TITLE
Clinical and genetic associations of deep learning-derived cardiac magnetic resonance-based left ventricular mass.
Main citation
Khurshid S, Lazarte J, Pirruccello JP, ...&, Lubitz SA. (2023) Clinical and genetic associations of deep learning-derived cardiac magnetic resonance-based left ventricular mass. Nat Commun, 14 (1) 1558. doi:10.1038/s41467-023-37173-w. PMID 36944631
ABSTRACT
Left ventricular mass is a risk marker for cardiovascular events, and may indicate an underlying cardiomyopathy. Cardiac magnetic resonance is the gold-standard for left ventricular mass estimation, but is challenging to obtain at scale. Here, we use deep learning to enable genome-wide association study of cardiac magnetic resonance-derived left ventricular mass indexed to body surface area within 43,230 UK Biobank participants. We identify 12 genome-wide associations (1 known at TTN and 11 novel for left ventricular mass).
DOI
10.1038/s41467-023-37173-w

Liu Y (DL Organ MRI GWAS)

AI GWAS Imaging Deep Learning Abdominal MRI Organ Traits UK Biobank
PUBMED_LINK
34128465
FULL NAME
Genetic Architecture of 11 Organ Traits Derived from Abdominal MRI Using Deep Learning
DESCRIPTION
Applied a U-Net-based CNN segmentation pipeline to over 38,000 abdominal MRI scans from UK Biobank. The deep learning model automatically segmented 7 organs/tissues (liver, pancreas, kidneys, spleen, lungs, visceral adipose tissue, subcutaneous adipose tissue) and quantified their volume, fat content (via signal intensity), and iron content (via T2* mapping). GWAS on these 11 DL-derived traits identified 93 independent genome-wide significant associations (heritability 8-44%), including 4 novel liver trait associations.
KEYWORDS
deep learning, abdominal MRI segmentation, U-Net, organ volume quantification, liver fat, pancreas iron, UK Biobank
TITLE
Genetic architecture of 11 organ traits derived from abdominal MRI using deep learning.
Main citation
Liu Y, Basty N, Whitcher B, Bell JD, ...&, Cule M. (2021) Genetic architecture of 11 organ traits derived from abdominal MRI using deep learning. Elife, 10. doi:10.7554/eLife.65554. PMID 34128465
ABSTRACT
Cardiometabolic diseases are an increasing global health burden. While socioeconomic, environmental, behavioural, and genetic risk factors have been identified, a better understanding of the underlying mechanisms is required to develop more effective interventions. Magnetic resonance imaging (MRI) has been used to assess organ health, but biobank-scale studies are still in their infancy. Using over 38,000 abdominal MRI scans in the UK Biobank, we used deep learning to quantify volume, fat, and iron in seven organs and tissues, and demonstrate that imaging-derived phenotypes reflect health status. We identify 93 independent genome-wide significant associations.
DOI
10.7554/eLife.65554

Ning C (DL LVRWT GWAS)

AI GWAS Imaging Deep Learning Cardiac MRI Left Ventricular Wall Hypertrophic Cardiomyopathy UK Biobank
PUBMED_LINK
38036550
FULL NAME
Genome-Wide Association Analysis of Left Ventricular Imaging-Derived Phenotypes Identifies 72 Risk Loci
DESCRIPTION
Built a CNN-based deep learning algorithm for automated segmentation of left ventricular myocardium from cardiac MRI, enabling precise calculation of 12 regional wall thickness (LVRWT) measurements in 42,194 UK Biobank participants. GWAS of these 12 CNN-derived LVRWT traits identified 72 significant genetic loci involved in heart development and contraction pathways. Mendelian randomization confirmed causal relationships with hypertrophic cardiomyopathy. The PRS of inferoseptal LVRWT enabled identification of high-risk individuals.
KEYWORDS
deep learning, cardiac MRI, CNN, left ventricular wall thickness segmentation, hypertrophic cardiomyopathy, UK Biobank
TITLE
Genome-wide association analysis of left ventricular imaging-derived phenotypes identifies 72 risk loci and yields genetic insights into hypertrophic cardiomyopathy.
Main citation
Ning C, Fan L, Jin M, ...&, Miao X. (2023) Genome-wide association analysis of left ventricular imaging-derived phenotypes identifies 72 risk loci and yields genetic insights into hypertrophic cardiomyopathy. Nat Commun, 14 (1) 7900. doi:10.1038/s41467-023-43771-5. PMID 38036550
ABSTRACT
Left ventricular regional wall thickness (LVRWT) is an independent predictor of morbidity and mortality in cardiovascular diseases (CVDs). To identify specific genetic influences on individual LVRWT, we established a novel deep learning algorithm to calculate 12 LVRWTs accurately in 42,194 individuals from the UK Biobank with cardiac magnetic resonance (CMR) imaging. Genome-wide association studies of CMR-derived 12 LVRWTs identified 72 significant genetic loci associated with at least one LVRWT phenotype.
DOI
10.1038/s41467-023-43771-5

transferGWAS

AI GWAS Imaging Transfer Learning Deep Learning Retinal Fundus Representation Learning
PUBMED_LINK
35640976
FULL NAME
transferGWAS: GWAS of Images Using Deep Transfer Learning
DESCRIPTION
transferGWAS performs GWAS directly on full medical images using deep transfer learning: (1) a pretrained CNN (ResNet-based architecture, pretrained on ImageNet) extracts feature embeddings from raw images; (2) these learned representations are used as quantitative phenotypes for genetic association testing. Applied to UK Biobank retinal fundus images, identified 60 genomic regions including 7 novel candidate loci for eye-related traits. First demonstration of direct GWAS on whole images without predefined phenotype engineering.
URL
https://github.com/mkirchler/transferGWAS/
KEYWORDS
deep transfer learning, pretrained CNN, ResNet, retinal fundus, whole-image GWAS, representation learning, UK Biobank
TITLE
transferGWAS: GWAS of images using deep transfer learning.
Main citation
Kirchler M, Konigorski S, Norden M, Meltendorf C, Kloft M, Schurmann C, Lippert C. (2022) transferGWAS: GWAS of images using deep transfer learning. Bioinformatics, 38(14):3621-3628. doi:10.1093/bioinformatics/btac369. PMID 35640976
ABSTRACT
MOTIVATION: Medical images can provide rich information about diseases and their biology. However, investigating their association with genetic variation requires non-standard methods. We propose transferGWAS, a novel approach to perform genome-wide association studies directly on full medical images. First, we learn semantically meaningful representations of the images based on a transfer learning task, during which a deep neural network is trained on independent but similar data. Then, we perform genetic association tests with these representations. RESULTS: We validate the type I error rates and power of transferGWAS in simulation studies of synthetic images. Then we apply transferGWAS in a genome-wide association study of retinal fundus images from the UK Biobank. This first-of-a-kind GWAS of full imaging data yielded 60 genomic regions associated with retinal fundus images, of which 7 are novel candidate loci for eye-related traits and diseases.
DOI
10.1093/bioinformatics/btac369