Sumstats General
Curation of General — listings under the Summary statistics tab.
Summary Table
Click a column header to sort the table.
Biobanks_Cohorts
Biobank Japan (BBJ) JENGER
PUBMED_LINK
DESCRIPTION
Biobank Japan GWAS summary statistics via the JENGER browser (RIKEN).
URL
TITLE
Population-specific putative causal variants shape quantitative traits.
Main citation
Koyama S, Liu X, Koike Y, Hikino K, ...&, Terao C. (2024) Population-specific putative causal variants shape quantitative traits. Nat Genet, 56 (10) 2027-2035. doi:10.1038/s41588-024-01913-5. PMID 39363016
ABSTRACT
Human genetic variants are associated with many traits through largely unknown mechanisms. Here, combining approximately 260,000 Japanese study participants, a Japanese-specific genotype reference panel and statistical fine-mapping, we identified 4,423 significant loci across 63 quantitative traits, among which 601 were new, and 9,406 putatively causal variants. New associations included Japanese-specific coding, splicing and noncoding variants, exemplified by a damaging missense variant rs730881101 in TNNT2 associated with lower heart function and increased risk for heart failure (P = 1.4 × 10-15 and odds ratio = 4.5, 95% confidence interval = 3.1-6.5). Putative causal noncoding variants were supported by state-of-art in silico functional assays and had comparable effect sizes to coding variants. A plausible example of new mechanisms of causal variants is an enrichment of causal variants in 3' untranslated regions (UTRs), including the Japanese-specific rs13306436 in IL6 associated with pro-inflammatory traits and protection against tuberculosis. We experimentally showed that transcripts with rs13306436 are resistant to mRNA degradation by regnase-1, an RNA-binding protein. Our study provides a list of fine-mapped causal variants to be tested for functionality and underscores the importance of sequencing, genotyping and association efforts in diverse populations.
DOI
10.1038/s41588-024-01913-5
RELATED_BIOBANK
MAIN ANCESTRY
EAS
Biobank Japan (BBJ) Phewebjp
PUBMED_LINK
DESCRIPTION
Japan-wide PheWeb instance for Biobank Japan GWAS summary statistics.
URL
TITLE
Population-specific putative causal variants shape quantitative traits.
Main citation
Koyama S, Liu X, Koike Y, Hikino K, ...&, Terao C. (2024) Population-specific putative causal variants shape quantitative traits. Nat Genet, 56 (10) 2027-2035. doi:10.1038/s41588-024-01913-5. PMID 39363016
ABSTRACT
Human genetic variants are associated with many traits through largely unknown mechanisms. Here, combining approximately 260,000 Japanese study participants, a Japanese-specific genotype reference panel and statistical fine-mapping, we identified 4,423 significant loci across 63 quantitative traits, among which 601 were new, and 9,406 putatively causal variants. New associations included Japanese-specific coding, splicing and noncoding variants, exemplified by a damaging missense variant rs730881101 in TNNT2 associated with lower heart function and increased risk for heart failure (P = 1.4 × 10-15 and odds ratio = 4.5, 95% confidence interval = 3.1-6.5). Putative causal noncoding variants were supported by state-of-art in silico functional assays and had comparable effect sizes to coding variants. A plausible example of new mechanisms of causal variants is an enrichment of causal variants in 3' untranslated regions (UTRs), including the Japanese-specific rs13306436 in IL6 associated with pro-inflammatory traits and protection against tuberculosis. We experimentally showed that transcripts with rs13306436 are resistant to mRNA degradation by regnase-1, an RNA-binding protein. Our study provides a list of fine-mapped causal variants to be tested for functionality and underscores the importance of sequencing, genotyping and association efforts in diverse populations.
DOI
10.1038/s41588-024-01913-5
RELATED_BIOBANK
MAIN ANCESTRY
EAS
Biobank Russia
PUBMED_LINK
DESCRIPTION
GWAS summary statistics from the Russian biobank resource (complex traits in Russian populations).
URL
TITLE
Complex trait susceptibilities and population diversity in a sample of 4,145 Russians.
Main citation
Usoltsev D, Kolosov N, Rotar O, Loboda A, ...&, Artomov M. (2024) Complex trait susceptibilities and population diversity in a sample of 4,145 Russians. Nat Commun, 15 (1) 6212. doi:10.1038/s41467-024-50304-1. PMID 39043636
ABSTRACT
The population of Russia consists of more than 150 local ethnicities. The ethnic diversity and geographic origins, which extend from eastern Europe to Asia, make the population uniquely positioned to investigate the shared properties of inherited disease risks between European and Asian ancestries. We present the analysis of genetic and phenotypic data from a cohort of 4,145 individuals collected in three metro areas in western Russia. We show the presence of multiple admixed genetic ancestry clusters spanning from primarily European to Asian and high identity-by-descent sharing with the Finnish population. As a result, there was notable enrichment of Finnish-specific variants in Russia. We illustrate the utility of Russian-descent cohorts for discovery of novel population-specific genetic associations, as well as replication of previously identified associations that were thought to be population-specific in other cohorts. Finally, we provide access to a database of allele frequencies and GWAS results for 464 phenotypes.
DOI
10.1038/s41467-024-50304-1
MAIN ANCESTRY
EUR
CARTaGENE PheWeb
DESCRIPTION
PheWeb browser for CARTaGENE (Quebec) GWAS summary statistics.
URL
RELATED_BIOBANK
MAIN ANCESTRY
EUR
China Kadoorie Biobank (CKB)
PUBMED_LINK
DESCRIPTION
PheWeb-style GWAS summary statistics for the China Kadoorie Biobank.
URL
TITLE
China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up.
Main citation
Chen Z, Chen J, Collins R, Guo Y, ...&, China Kadoorie Biobank (CKB) collaborative group. (2011) China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol, 40 (6) 1652-66. doi:10.1093/ije/dyr120. PMID 22158673
ABSTRACT
BACKGROUND: Large blood-based prospective studies can provide reliable assessment of the complex interplay of lifestyle, environmental and genetic factors as determinants of chronic disease. METHODS: The baseline survey of the China Kadoorie Biobank took place during 2004-08 in 10 geographically defined regions, with collection of questionnaire data, physical measurements and blood samples. Subsequently, a re-survey of 25,000 randomly selected participants was done (80% responded) using the same methods as in the baseline. All participants are being followed for cause-specific mortality and morbidity, and for any hospital admission through linkages with registries and health insurance (HI) databases. RESULTS: Overall, 512,891 adults aged 30-79 years were recruited, including 41% men, 56% from rural areas and mean age was 52 years. The prevalence of ever-regular smoking was 74% in men and 3% in women. The mean blood pressure was 132/79 mmHg in men and 130/77 mmHg in women. The mean body mass index (BMI) was 23.4 kg/m(2) in men and 23.8 kg/m(2) in women, with only 4% being obese (>30 kg/m(2)), and 3.2% being diabetic. Blood collection was successful in 99.98% and the mean delay from sample collection to processing was 10.6 h. For each of the main baseline variables, there is good reproducibility but large heterogeneity by age, sex and study area. By 1 January 2011, over 10,000 deaths had been recorded, with 91% of surviving participants already linked to HI databases. CONCLUSION: This established large biobank will be a rich and powerful resource for investigating genetic and non-genetic causes of many common chronic diseases in the Chinese population.
DOI
10.1093/ije/dyr120
RELATED_BIOBANK
MAIN ANCESTRY
EAS
FinMetSeq
DESCRIPTION
Finnish metabolic sequencing cohort GWAS results (FinMetSeq) on the Michigan PheWeb.
URL
MAIN ANCESTRY
EUR
FinnGen Kanta 1st Lab values (October 14 2025 )
DESCRIPTION
FinnGen GWAS of laboratory measurements from Finnish register data (first public release, Oct 2025).
URL
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R10 (December 18 2023)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R10 (18 Dec 2023) GWAS summary statistics; flagship FinnGen resource described in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R10-UKBB meta-analysis
PUBMED_LINK
DESCRIPTION
Meta-analysis of FinnGen R10 with UK Biobank GWAS summary statistics (FinnGen distribution).
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R11 (June 24 2024)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R11 (24 Jun 2024) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R12 (November 4 2024)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R12 (4 Nov 2024) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R12-UKBB meta-analysis
PUBMED_LINK
DESCRIPTION
Meta-analysis of FinnGen R12 with UK Biobank GWAS summary statistics (FinnGen distribution).
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R4 (November 30 2020)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R4 (30 Nov 2020) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R5 (May 11 2021)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R5 (11 May 2021) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R6 (January 24 2022)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R6 (24 Jan 2022) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R7 (June 1 2022)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R7 (1 Jun 2022) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R8 (Dec 1 2022)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R8 (1 Dec 2022) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
FinnGen R9 (May 11 2023)
PUBMED_LINK
DESCRIPTION
FinnGen data freeze R9 (11 May 2023) GWAS summary statistics; resource overview in Kurki et al., Nature 2023.
URL
TITLE
FinnGen provides genetic insights from a well-phenotyped isolated population.
Main citation
Kurki MI, Karjalainen J, Palta P, Sipilä TP, ...&, Palotie A. (2023) FinnGen provides genetic insights from a well-phenotyped isolated population. Nature, 613 (7944) 508-518. doi:10.1038/s41586-022-05473-8. PMID 36653562
ABSTRACT
Population isolates such as those in Finland benefit genetic research because deleterious alleles are often concentrated on a small number of low-frequency variants (0.1% ≤ minor allele frequency < 5%). These variants survived the founding bottleneck rather than being distributed over a large number of ultrarare variants. Although this effect is well established in Mendelian genetics, its value in common disease genetics is less explored1,2. FinnGen aims to study the genome and national health register data of 500,000 Finnish individuals. Given the relatively high median age of participants (63 years) and the substantial fraction of hospital-based recruitment, FinnGen is enriched for disease end points. Here we analyse data from 224,737 participants from FinnGen and study 15 diseases that have previously been investigated in large genome-wide association studies (GWASs). We also include meta-analyses of biobank data from Estonia and the United Kingdom. We identified 30 new associations, primarily low-frequency variants, enriched in the Finnish population. A GWAS of 1,932 diseases also identified 2,733 genome-wide significant associations (893 phenome-wide significant (PWS), P < 2.6 × 10-11) at 2,496 (771 PWS) independent loci with 807 (247 PWS) end points. Among these, fine-mapping implicated 148 (73 PWS) coding variants associated with 83 (42 PWS) end points. Moreover, 91 (47 PWS) had an allele frequency of <5% in non-Finnish European individuals, of which 62 (32 PWS) were enriched by more than twofold in Finland. These findings demonstrate the power of bottlenecked populations to find entry points into the biology of common diseases through low-frequency, high impact variants.
DOI
10.1038/s41586-022-05473-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
Generation Scotland
DESCRIPTION
Generation Scotland cohort GWAS summary statistics and related downloads.
URL
MAIN ANCESTRY
EUR
Global Biobank
PUBMED_LINK
DESCRIPTION
Global Biobank Meta-analysis Initiative (GBMI) harmonized GWAS across many biobanks.
URL
TITLE
Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease.
Main citation
Zhou W, Kanai M, Wu KH, Rasheed H, ...&, Neale BM. (2022) Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease. Cell Genom, 2 (10) 100192. doi:10.1016/j.xgen.2022.100192. PMID 36777996
ABSTRACT
Biobanks facilitate genome-wide association studies (GWASs), which have mapped genomic loci across a range of human diseases and traits. However, most biobanks are primarily composed of individuals of European ancestry. We introduce the Global Biobank Meta-analysis Initiative (GBMI)-a collaborative network of 23 biobanks from 4 continents representing more than 2.2 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWASs generated using harmonized genotypes and phenotypes from member biobanks for 14 exemplar diseases and endpoints. This strategy validates that GWASs conducted in diverse biobanks can be integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics. This collaborative effort improves GWAS power for diseases, benefits understudied diseases, and improves risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of human diseases and traits.
DOI
10.1016/j.xgen.2022.100192
MAIN ANCESTRY
ALL
KoGES Pheweb
DESCRIPTION
PheWeb instance for KoGES (Korean Genome and Epidemiology Study) GWAS summary statistics.
URL
MAIN ANCESTRY
EAS
KoreanChip
PUBMED_LINK
DESCRIPTION
GWAS summary statistics based on the Korea Biobank Array (KoreanChip / KoGES).
URL
TITLE
The Korea Biobank Array: Design and Identification of Coding Variants Associated with Blood Biochemical Traits.
Main citation
Moon S, Kim YJ, Han S, Hwang MY, ...&, Kim BJ. (2019) The Korea Biobank Array: Design and Identification of Coding Variants Associated with Blood Biochemical Traits. Sci Rep, 9 (1) 1382. doi:10.1038/s41598-018-37832-9. PMID 30718733
ABSTRACT
We introduce the design and implementation of a new array, the Korea Biobank Array (referred to as KoreanChip), optimized for the Korean population and demonstrate findings from GWAS of blood biochemical traits. KoreanChip comprised >833,000 markers including >247,000 rare-frequency or functional variants estimated from >2,500 sequencing data in Koreans. Of the 833 K markers, 208 K functional markers were directly genotyped. Particularly, >89 K markers were presented in East Asians. KoreanChip achieved higher imputation performance owing to the excellent genomic coverage of 95.38% for common and 73.65% for low-frequency variants. From GWAS (Genome-wide association study) using 6,949 individuals, 28 associations were successfully recapitulated. Moreover, 9 missense variants were newly identified, of which we identified new associations between a common population-specific missense variant, rs671 (p.Glu457Lys) of ALDH2, and two traits including aspartate aminotransferase (P = 5.20 × 10-13) and alanine aminotransferase (P = 4.98 × 10-8). Furthermore, two novel missense variants of GPT with rare frequency in East Asians but extreme rarity in other populations were associated with alanine aminotransferase (rs200088103; p.Arg133Trp, P = 2.02 × 10-9 and rs748547625; p.Arg143Cys, P = 1.41 × 10-6). These variants were successfully replicated in 6,000 individuals (P = 5.30 × 10-8 and P = 1.24 × 10-6). GWAS results suggest the promising utility of KoreanChip with a substantial number of damaging variants to identify new population-specific disease-associated rare/functional variants.
DOI
10.1038/s41598-018-37832-9
MAIN ANCESTRY
EAS
MANE PheWeb
PUBMED_LINK
DESCRIPTION
MANE PheWeb — Chinese maternal cohort GWAS summary statistics browser.
URL
TITLE
Genetic analyses of 104 phenotypes in 20,900 Chinese pregnant women reveal pregnancy-specific discoveries.
Main citation
Xiao H, Li L, Yang M, Zhang X, ...&, Jin X. (2024) Genetic analyses of 104 phenotypes in 20,900 Chinese pregnant women reveal pregnancy-specific discoveries. Cell Genom, 4 (10) 100633. doi:10.1016/j.xgen.2024.100633. PMID 39389017
ABSTRACT
Monitoring biochemical phenotypes during pregnancy is vital for maternal and fetal health, allowing early detection and management of pregnancy-related conditions to ensure safety for both. Here, we conducted a genetic analysis of 104 pregnancy phenotypes in 20,900 Chinese women. The genome-wide association study (GWAS) identified a total of 410 trait-locus associations, with 71.71% reported previously. Among the 116 novel hits for 45 phenotypes, 83 were successfully replicated. Among them, 31 were defined as potentially pregnancy-specific associations, including creatine and HELLPAR and neutrophils and ESR1, with subsequent analysis revealing enrichments in estrogen-related pathways and female reproductive tissues. The partitioning heritability underscored the significant roles of fetal blood, embryoid bodies, and female reproductive organs in pregnancy hematology and birth outcomes. Pathway analysis confirmed the intricate interplay of hormone and immune regulation, metabolism, and cell cycle during pregnancy. This study contributes to the understanding of genetic influences on pregnancy phenotypes and their implications for maternal health.
DOI
10.1016/j.xgen.2024.100633
MAIN ANCESTRY
EAS
MGI 1
DESCRIPTION
Michigan Genomics Initiative PheWeb freeze 1 — GWAS summary statistics.
URL
MAIN ANCESTRY
EUR
MGI 2
DESCRIPTION
Michigan Genomics Initiative PheWeb freeze 2 — GWAS summary statistics.
URL
MAIN ANCESTRY
EUR
MGI BioUV
DESCRIPTION
Michigan Genomics Initiative PheWeb BioUV freeze — GWAS summary statistics.
URL
MAIN ANCESTRY
EUR
MVP-Finngen-UKBB meta-analysis
PUBMED_LINK
DESCRIPTION
Cross-biobank GWAS meta-analysis across MVP, FinnGen, and UK Biobank (phenome-wide association resource).
URL
TITLE
Prevalence and disease risks for male and female sex chromosome trisomies: a registry-based phenome-wide association study in 1.5 million participants of MVP, FinnGen, and UK Biobank.
Main citation
Davis SM, Liu A, Teerlink CC, Lapato DM, ...&, Hauger RL. (2025) Prevalence and disease risks for male and female sex chromosome trisomies: a registry-based phenome-wide association study in 1.5 million participants of MVP, FinnGen, and UK Biobank. medRxiv, () . doi:10.1101/2025.01.31.25321488. PMID 39974076
ABSTRACT
Sex chromosome trisomies (SCT) are the most common whole chromosome aneuploidy in humans. Yet, our understanding of the prevalence and associated health outcomes is largely driven by observational studies of clinically diagnosed cases, resulting in a disproportionate focus on 47,XXY and associated hypogonadism. We analyzed microarray intensity data of sex chromosomes for 1.5 million individuals enrolled in three large cohorts-Million Veteran Program, FinnGen, and UK Biobank-to identify individuals with 47,XXY, 47,XYY, and 47,XXX. We examined disease conditions associated with SCTs by performing phenome-wide association studies (PheWAS) using electronic health records (EHR) data for each cohort, followed by meta-analysis across cohorts. Association results are presented for each SCT and also stratified by presence or absence of a documented clinical diagnosis for 47,XXY. We identified 2,769 individuals with (47,XXY: 1,319; 47,XYY: 1,108; 47,XXX: 342), most of whom had no documented clinical diagnosis (47,XXY: 73.8%; 47,XYY: 98.6%; 47,XXX: 93.6%). The identified phenotypic associations with SCT spanned all PheWAS disease categories except neoplasms. Many associations are shared among three SCT subtypes, particularly for vascular diseases (e.g., chronic venous insufficiency (OR [95% CI] for 47,XXY 4.7 [3.9,5.8]; 47,XYY 5.6 [4.5,7.0]; 4 7,XXX 4.6 [2.7,7.6], venous thromboembolism (47,XXY 4.6 [3.7-5.6]; 47,XYY 4.1 [3.3-5.0]; 47,XXX 8.1 [4.2-15.4]), and glaucoma (47,XXY 2.5 [2.1-2.9]; 47,XYY 2.4 [2.0-2.8]; 47,XXX 2.3 [1.4-3.5]). A third sex chromosome confers an increased risk for systemic comorbidities, even if the SCT is not documented. SCT phenotypes largely overlap, suggesting one or more X/Y homolog genes may underlie pathophysiology and comorbidities across SCTs.
DOI
10.1101/2025.01.31.25321488
RELATED_BIOBANK
MAIN ANCESTRY
EUR
PLATLAS
PUBMED_LINK
FULL NAME
PLeiotropic ATLAS
DESCRIPTION
PLATLAS — pleiotropy atlas with GWAS summary statistics across >1000 phenotypes (multi-biobank).
URL
TITLE
Genome-Wide Assessment of Pleiotropy Across >1000 Traits from Global Biobanks.
Main citation
Levin MG, Koyama S, Woerner J, Zhang DY, ...&, Natarajan P. (2025) Genome-Wide Assessment of Pleiotropy Across >1000 Traits from Global Biobanks. medRxiv, () . doi:10.1101/2025.04.18.25326074. PMID 40313291
ABSTRACT
Large-scale genetic association studies have identified thousands of trait-associated risk loci, establishing the polygenic basis for common complex traits and diseases. Although prior studies suggest that many trait-associated loci are pleiotropic, the extent to which this pleiotropy reflects shared causal variants or confounding by linkage disequilibrium remains poorly characterized. To define a set of candidate loci with potentially pleiotropic associations, we performed genome-wide association study (GWAS) meta-analyses of up to 1,167 clinically relevant traits and diseases across 1,789,365 diverse individuals genetically similar to Admixed American (AMR, NMax = 60,756), African (AFR, NMax = 128,361), East Asian (EAS, NMax = 307,465), European (EUR, NMax = 1,283,907), and South Asian (SAS, NMax = 8,876) reference populations from the VA Million Veteran Program (MVP), UK Biobank (UKB), FinnGen, Biobank Japan (BBJ), Tohoku Medical Megabank (ToMMo), and Korean Genome and Epidemiology Study (KoGES). We identified 27,193 genome-wide significant locus-trait pairs (1MB region with PGWAMA < 5 × 10-8) in within-population analysis and 29,139 in multi-population analysis (PMR-MEGA < 5 × 10-8). Among these, 11.5% (n = 3,149) of locus-trait pairs in population-wise and 6.4% (n = 1,875) in multi-population analyses did not reach genome-wide significance in previously published GWAS. In aggregate, the genome-wide significant loci fell within 2,624 non-overlapping autosomal genomic windows on average ~600kb in size. Each locus contained genome-wide significant signals for a median of 6 traits (IQR 2 to 18), including 2,110 (80%) pleiotropic loci associated with >1 trait. Multi-trait colocalization identified 1,902 (72%) loci with high-confidence (posterior probability > 0.9) evidence of a shared causal variant across two or more traits. Variants in pleiotropic loci were significantly enriched for a broad spectrum of functional annotations compared to non-pleiotropic counterparts. Polygenic scores (PGS) developed from these data generally improved prediction compared to existing PGS, and were broadly associated with both primary and pleiotropic phenotypes. These results provide a contemporary map of genetic pleiotropy across the spectrum of human traits/diseases and diverse genetic backgrounds.
DOI
10.1101/2025.04.18.25326074
MAIN ANCESTRY
ALL
Pan-UKB
PUBMED_LINK
DESCRIPTION
Pan-UK Biobank — multi-ancestry GWAS in UK Biobank across thousands of phenotypes.
URL
TITLE
Pan-UK Biobank genome-wide association analyses enhance discovery and resolution of ancestry-enriched effects.
Main citation
Karczewski KJ, Gupta R, Kanai M, Lu W, ...&, Martin AR. (2025) Pan-UK Biobank genome-wide association analyses enhance discovery and resolution of ancestry-enriched effects. Nat Genet, 57 (10) 2408-2417. doi:10.1038/s41588-025-02335-7. PMID 40968291
ABSTRACT
Large biobanks, such as the UK Biobank (UKB), enable massive phenome by genome-wide association studies that elucidate genetic etiology of complex traits. However, people from diverse genetic ancestry groups are often excluded from association analyses due to concerns about population structure introducing false positive associations. Here we generate mixed model associations and meta-analyses across genetic ancestry groups, inclusive of a larger fraction of the UK Biobank than previous efforts, to produce freely available summary statistics for 7,266 traits. We build a quality control and analysis framework informed by genetic architecture. Overall, we identify 14,676 significant loci (P < 5 × 10-8) in the meta-analysis that were not found in the EUR genetic ancestry group alone, including new associations, for example between CAMK2D and triglycerides. We also highlight associations from ancestry-enriched variation, including a known pleiotropic missense variant in G6PD associated with several biomarker traits. We release these results publicly alongside frequently asked questions that describe caveats for interpretation of results, enhancing available resources for interpretation of risk variants across diverse populations.
DOI
10.1038/s41588-025-02335-7
RELATED_BIOBANK
MAIN ANCESTRY
EUR
TPMI PheWeb
PUBMED_LINK
DESCRIPTION
Taiwan Precision Medicine Initiative PheWeb — cohort GWAS summary statistics.
URL
TITLE
The Taiwan Precision Medicine Initiative provides a cohort for large-scale studies.
Main citation
Yang HC, Kwok PY, Li LH, Liu YM, ...&, Wu JY. (2025) The Taiwan Precision Medicine Initiative provides a cohort for large-scale studies. Nature, 648 (8092) 117-127. doi:10.1038/s41586-025-09680-x. PMID 41092961
ABSTRACT
Han Chinese people comprise nearly 20% of the global population but remain under-represented in genetic studies1,2, so there is an urgent need for large-scale cohorts to advance precision medicine. Here we present the Taiwan Precision Medicine Initiative (TPMI), established by Academia Sinica in collaboration with 16 major medical centres around Taiwan, which has recruited 565,390 participants who consent to provide DNA samples for genetic profiling and grant access to their electronic medical records (EMRs) for research. EMR access is both retrospective and prospective, allowing longitudinal studies. Genetic profiling is done with population-optimized arrays of single-nucleotide polymorphisms for people of Han Chinese ancestry, which enable genome-wide association3,4, phenome-wide association5,6 and polygenic risk score7,8 studies to be performed to evaluate common disease risk and pharmacogenetic response. Participants also agreed to be re-contacted for future research and receive personalized genetic risk profiles with health management recommendations. The TPMI has established the TPMI Data Access Platform, a central database and analysis platform that both safeguards the security of the data and facilitates academic research. As a large cohort of individuals with non-European ancestry that merges genetic profiles with EMR data and enables longitudinal follow-up, TPMI provides a unique resource that could be used to validate genetic risk prediction models, perform clinical trials of risk-based health management and inform health policies. Ultimately, the TPMI cohort will contribute to global genetic research and serve as a model for population-based precision medicine.
DOI
10.1038/s41586-025-09680-x
RELATED_BIOBANK
MAIN ANCESTRY
EAS
Taiwan BioBank Pheweb
PUBMED_LINK
DESCRIPTION
Taiwan Biobank PheWeb — GWAS summary statistics for Taiwanese participants.
URL
TITLE
Taiwan Biobank: making cross-database convergence possible in the Big Data era.
Main citation
Lin JC, Fan CT, Liao CC, Chen YS. (2018) Taiwan Biobank: making cross-database convergence possible in the Big Data era. Gigascience, 7 (1) 1-4. doi:10.1093/gigascience/gix110. PMID 29149267
ABSTRACT
The Taiwan Biobank (TWB) is a biomedical research database of biopsy data from 200 000 participants. Access to this database has been granted to research communities taking part in the development of precision medicines; however, this has raised issues surrounding TWB's access to electronic medical records (EMRs). The Personal Data Protection Act of Taiwan restricts access to EMRs for purposes not covered by patients' original consent. This commentary explores possible legal solutions to help ensure that the access TWB has to EMR abides with legal obligations, and with governance frameworks associated with ethical, legal, and social implications. We suggest utilizing "hash function" algorithms to create nonretrospective, anonymized data for the purpose of cross-transmission and/or linkage with EMR.
DOI
10.1093/gigascience/gix110
RELATED_BIOBANK
MAIN ANCESTRY
EAS
Tohoku Medical Megabank (TMM) Jmorp
PUBMED_LINK
DESCRIPTION
Tohoku Medical Megabank / jMorp multi-omics reference and GWAS-related summary data portal.
URL
TITLE
jMorp: Japanese Multi-Omics Reference Panel update report 2023.
Main citation
Tadaka S, Kawashima J, Hishinuma E, Saito S, ...&, Kinoshita K. (2024) jMorp: Japanese Multi-Omics Reference Panel update report 2023. Nucleic Acids Res, 52 (D1) D622-D632. doi:10.1093/nar/gkad978. PMID 37930845
ABSTRACT
Modern medicine is increasingly focused on personalized medicine, and multi-omics data is crucial in understanding biological phenomena and disease mechanisms. Each ethnic group has its unique genetic background with specific genomic variations influencing disease risk and drug response. Therefore, multi-omics data from specific ethnic populations are essential for the effective implementation of personalized medicine. Various prospective cohort studies, such as the UK Biobank, All of Us and Lifelines, have been conducted worldwide. The Tohoku Medical Megabank project was initiated after the Great East Japan Earthquake in 2011. It collects biological specimens and conducts genome and omics analyses to build a basis for personalized medicine. Summary statistical data from these analyses are available in the jMorp web database (https://jmorp.megabank.tohoku.ac.jp), which provides a multidimensional approach to the diversity of the Japanese population. jMorp was launched in 2015 as a public database for plasma metabolome and proteome analyses and has been continuously updated. The current update will significantly expand the scale of the data (metabolome, genome, transcriptome, and metagenome). In addition, the user interface and backend server implementations were rewritten to improve the connectivity between the items stored in jMorp. This paper provides an overview of the new version of the jMorp.
DOI
10.1093/nar/gkad978
RELATED_BIOBANK
MAIN ANCESTRY
EAS
UKB Neale
DESCRIPTION
Neale lab UK Biobank GWAS summary statistics (round-2 style phenome-wide results via PheWeb).
URL
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB TOPMed
PUBMED_LINK
DESCRIPTION
UK Biobank GWAS using TOPMed-imputed genotypes (multi-ancestry imputation panel).
URL
TITLE
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.
Main citation
Taliun D, Harris DN, Kessler MD, Carlson J, ...&, Abecasis GR. (2021) Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature, 590 (7845) 290-299. doi:10.1038/s41586-021-03205-y. PMID 33568819
ABSTRACT
The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
DOI
10.1038/s41586-021-03205-y
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB exome
PUBMED_LINK
DESCRIPTION
UK Biobank exome sequence-based GWAS summary statistics (gene- and variant-level association resource).
URL
TITLE
Rare variant contribution to human disease in 281,104 UK Biobank exomes.
Main citation
Wang Q, Dhindsa RS, Carss K, Harper AR, ...&, Petrovski S. (2021) Rare variant contribution to human disease in 281,104 UK Biobank exomes. Nature, 597 (7877) 527-532. doi:10.1038/s41586-021-03855-y. PMID 34375979
ABSTRACT
Genome-wide association studies have uncovered thousands of common variants associated with human disease, but the contribution of rare variants to common disease remains relatively unexplored. The UK Biobank contains detailed phenotypic data linked to medical records for approximately 500,000 participants, offering an unprecedented opportunity to evaluate the effect of rare variation on a broad collection of traits1,2. Here we study the relationships between rare protein-coding variants and 17,361 binary and 1,419 quantitative phenotypes using exome sequencing data from 269,171 UK Biobank participants of European ancestry. Gene-based collapsing analyses revealed 1,703 statistically significant gene-phenotype associations for binary traits, with a median odds ratio of 12.4. Furthermore, 83% of these associations were undetectable via single-variant association tests, emphasizing the power of gene-based collapsing analysis in the setting of high allelic heterogeneity. Gene-phenotype associations were also significantly enriched for loss-of-function-mediated traits and approved drug targets. Finally, we performed ancestry-specific and pan-ancestry collapsing analyses using exome sequencing data from 11,933 UK Biobank participants of African, East Asian or South Asian ancestry. Our results highlight a significant contribution of rare variants to common disease. Summary statistics are publicly available through an interactive portal ( http://azphewas.com/ ).
DOI
10.1038/s41586-021-03855-y
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB fastgwa (Imputation)
PUBMED_LINK
DESCRIPTION
UK Biobank GWAS from fastGWA on imputed genotype data (continuous and binary traits).
URL
TITLE
A resource-efficient tool for mixed model association analysis of large-scale data.
Main citation
Jiang L, Zheng Z, Qi T, Kemper KE, ...&, Yang J. (2019) A resource-efficient tool for mixed model association analysis of large-scale data. Nat Genet, 51 (12) 1749-1755. doi:10.1038/s41588-019-0530-8. PMID 31768069
ABSTRACT
The genome-wide association study (GWAS) has been widely used as an experimental design to detect associations between genetic variants and a phenotype. Two major confounding factors, population stratification and relatedness, could potentially lead to inflated GWAS test statistics and hence to spurious associations. Mixed linear model (MLM)-based approaches can be used to account for sample structure. However, genome-wide association (GWA) analyses in biobank samples such as the UK Biobank (UKB) often exceed the capability of most existing MLM-based tools especially if the number of traits is large. Here, we develop an MLM-based tool (fastGWA) that controls for population stratification by principal components and for relatedness by a sparse genetic relationship matrix for GWA analyses of biobank-scale data. We demonstrate by extensive simulations that fastGWA is reliable, robust and highly resource-efficient. We then apply fastGWA to 2,173 traits on array-genotyped and imputed samples from 456,422 individuals and to 2,048 traits on whole-exome-sequenced samples from 46,191 individuals in the UKB.
DOI
10.1038/s41588-019-0530-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB fastgwa (WES)
PUBMED_LINK
DESCRIPTION
UK Biobank GWAS from fastGWA on whole-exome sequence data.
URL
TITLE
A resource-efficient tool for mixed model association analysis of large-scale data.
Main citation
Jiang L, Zheng Z, Qi T, Kemper KE, ...&, Yang J. (2019) A resource-efficient tool for mixed model association analysis of large-scale data. Nat Genet, 51 (12) 1749-1755. doi:10.1038/s41588-019-0530-8. PMID 31768069
ABSTRACT
The genome-wide association study (GWAS) has been widely used as an experimental design to detect associations between genetic variants and a phenotype. Two major confounding factors, population stratification and relatedness, could potentially lead to inflated GWAS test statistics and hence to spurious associations. Mixed linear model (MLM)-based approaches can be used to account for sample structure. However, genome-wide association (GWA) analyses in biobank samples such as the UK Biobank (UKB) often exceed the capability of most existing MLM-based tools especially if the number of traits is large. Here, we develop an MLM-based tool (fastGWA) that controls for population stratification by principal components and for relatedness by a sparse genetic relationship matrix for GWA analyses of biobank-scale data. We demonstrate by extensive simulations that fastGWA is reliable, robust and highly resource-efficient. We then apply fastGWA to 2,173 traits on array-genotyped and imputed samples from 456,422 individuals and to 2,048 traits on whole-exome-sequenced samples from 46,191 individuals in the UKB.
DOI
10.1038/s41588-019-0530-8
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB fastgwa-glmm (Binary)
PUBMED_LINK
DESCRIPTION
UK Biobank binary-trait GWAS from SAIGE-style GLMM analysis (fastGWA-glmm pipeline).
URL
TITLE
A generalized linear mixed model association tool for biobank-scale data.
Main citation
Jiang L, Zheng Z, Fang H, Yang J. (2021) A generalized linear mixed model association tool for biobank-scale data. Nat Genet, 53 (11) 1616-1621. doi:10.1038/s41588-021-00954-4. PMID 34737426
ABSTRACT
Compared with linear mixed model-based genome-wide association (GWA) methods, generalized linear mixed model (GLMM)-based methods have better statistical properties when applied to binary traits but are computationally much slower. In the present study, leveraging efficient sparse matrix-based algorithms, we developed a GLMM-based GWA tool, fastGWA-GLMM, that is severalfold to orders of magnitude faster than the state-of-the-art tools when applied to the UK Biobank (UKB) data and scalable to cohorts with millions of individuals. We show by simulation that the fastGWA-GLMM test statistics of both common and rare variants are well calibrated under the null, even for traits with extreme case-control ratios. We applied fastGWA-GLMM to the UKB data of 456,348 individuals, 11,842,647 variants and 2,989 binary traits (full summary statistics available at http://fastgwa.info/ukbimpbin ), and identified 259 rare variants associated with 75 traits, demonstrating the use of imputed genotype data in a large cohort to discover rare variants for binary complex traits.
DOI
10.1038/s41588-021-00954-4
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB gene-based (Genebass)
PUBMED_LINK
DESCRIPTION
UK Biobank gene-based association results from the Genebass / exome analysis resource.
URL
TITLE
Systematic single-variant and gene-based association testing of thousands of phenotypes in 394,841 UK Biobank exomes.
Main citation
Karczewski KJ, Solomonson M, Chao KR, Goodrich JK, ...&, Neale BM. (2022) Systematic single-variant and gene-based association testing of thousands of phenotypes in 394,841 UK Biobank exomes. Cell Genom, 2 (9) 100168. doi:10.1016/j.xgen.2022.100168. PMID 36778668
ABSTRACT
Genome-wide association studies have successfully discovered thousands of common variants associated with human diseases and traits, but the landscape of rare variations in human disease has not been explored at scale. Exome-sequencing studies of population biobanks provide an opportunity to systematically evaluate the impact of rare coding variations across a wide range of phenotypes to discover genes and allelic series relevant to human health and disease. Here, we present results from systematic association analyses of 4,529 phenotypes using single-variant and gene tests of 394,841 individuals in the UK Biobank with exome-sequence data. We find that the discovery of genetic associations is tightly linked to frequency and is correlated with metrics of deleteriousness and natural selection. We highlight biological findings elucidated by these data and release the dataset as a public resource alongside the Genebass browser for rapidly exploring rare-variant association results.
DOI
10.1016/j.xgen.2022.100168
RELATED_BIOBANK
MAIN ANCESTRY
EUR
UKB saige
PUBMED_LINK
DESCRIPTION
UK Biobank GWAS with SAIGE (mixed-model association for biobank-scale binary and quantitative traits).
URL
TITLE
Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies.
Main citation
Zhou W, Nielsen JB, Fritsche LG, Dey R, ...&, Lee S. (2018) Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat Genet, 50 (9) 1335-1341. doi:10.1038/s41588-018-0184-y. PMID 30104761
ABSTRACT
In genome-wide association studies (GWAS) for thousands of phenotypes in large biobanks, most binary traits have substantially fewer cases than controls. Both of the widely used approaches, the linear mixed model and the recently proposed logistic mixed model, perform poorly; they produce large type I error rates when used to analyze unbalanced case-control phenotypes. Here we propose a scalable and accurate generalized mixed model association test that uses the saddlepoint approximation to calibrate the distribution of score test statistics. This method, SAIGE (Scalable and Accurate Implementation of GEneralized mixed model), provides accurate P values even when case-control ratios are extremely unbalanced. SAIGE uses state-of-art optimization strategies to reduce computational costs; hence, it is applicable to GWAS for thousands of phenotypes by large biobanks. Through the analysis of UK Biobank data of 408,961 samples from white British participants with European ancestry for > 1,400 binary phenotypes, we show that SAIGE can efficiently analyze large sample data, controlling for unbalanced case-control ratios and sample relatedness.
DOI
10.1038/s41588-018-0184-y
RELATED_BIOBANK
MAIN ANCESTRY
EUR
Yang Lab xQTL
PUBMED_LINK
DESCRIPTION
Yang lab SMR/xQTL data resource — public GWAS and molecular QTL summary statistics for integrative analysis.
URL
TITLE
SMR-Portal: an online platform for integrative analysis of GWAS and xQTL data to identify complex trait genes.
Main citation
Guo Y, Xu T, Luo J, Jiang Z, ...&, Yang J. (2025) SMR-Portal: an online platform for integrative analysis of GWAS and xQTL data to identify complex trait genes. Nat Methods, 22 (2) 220-222. doi:10.1038/s41592-024-02561-7. PMID 39623049
DOI
10.1038/s41592-024-02561-7
MAIN ANCESTRY
EUR
Consortiums
DIAGRAM
PUBMED_LINK
DESCRIPTION
Type 2 diabetes GWAS meta-analysis summary statistics from the DIAGRAM consortium.
URL
TITLE
Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes.
Main citation
Morris AP, Voight BF, Teslovich TM, Ferreira T, ...&, DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium. (2012) Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet, 44 (9) 981-90. doi:10.1038/ng.2383. PMID 22885922
ABSTRACT
To extend understanding of the genetic architecture and molecular basis of type 2 diabetes (T2D), we conducted a meta-analysis of genetic variants on the Metabochip, including 34,840 cases and 114,981 controls, overwhelmingly of European descent. We identified ten previously unreported T2D susceptibility loci, including two showing sex-differentiated association. Genome-wide analyses of these data are consistent with a long tail of additional common variant loci explaining much of the variation in susceptibility to T2D. Exploration of the enlarged set of susceptibility loci implicates several processes, including CREBBP-related transcription, adipocytokine signaling and cell cycle regulation, in diabetes pathogenesis.
DOI
10.1038/ng.2383
MAIN ANCESTRY
EUR
GIANT (Genetic Investigation of ANthropometric Traits)
PUBMED_LINK
DESCRIPTION
Anthropometric trait GWAS meta-analysis summary statistics from the GIANT consortium.
URL
TITLE
Hundreds of variants clustered in genomic loci and biological pathways affect human height.
Main citation
Lango Allen H, Estrada K, Lettre G, Berndt SI, ...&, Hirschhorn JN. (2010) Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature, 467 (7317) 832-8. doi:10.1038/nature09410. PMID 20881960
ABSTRACT
Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence the phenotype. Genome-wide association (GWA) studies have identified more than 600 variants associated with human traits, but these typically explain small fractions of phenotypic variation, raising questions about the use of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P = 0.016) and that underlie skeletal growth defects (P < 0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented among variants that alter amino-acid structure of proteins and expression levels of nearby genes. Our data explain approximately 10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to approximately 16% of phenotypic variation (approximately 20% of heritable variation). Although additional approaches are needed to dissect the genetic architecture of polygenic human traits fully, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways.
DOI
10.1038/nature09410
MAIN ANCESTRY
Multi-ancestry
GLGC (Global Lipids Genetics Consortium)
PUBMED_LINK
DESCRIPTION
Blood lipid trait GWAS meta-analysis summary statistics from the GLGC.
URL
TITLE
Discovery and refinement of loci associated with lipid levels.
Main citation
Willer CJ, Schmidt EM, Sengupta S, Peloso GM, ...&, Global Lipids Genetics Consortium. (2013) Discovery and refinement of loci associated with lipid levels. Nat Genet, 45 (11) 1274-1283. doi:10.1038/ng.2797. PMID 24097068
ABSTRACT
Levels of low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides and total cholesterol are heritable, modifiable risk factors for coronary artery disease. To identify new loci and refine known loci influencing these lipids, we examined 188,577 individuals using genome-wide and custom genotyping arrays. We identify and annotate 157 loci associated with lipid levels at P < 5 × 10(-8), including 62 loci not previously associated with lipid levels in humans. Using dense genotyping in individuals of European, East Asian, South Asian and African ancestry, we narrow association signals in 12 loci. We find that loci associated with blood lipid levels are often associated with cardiovascular and metabolic traits, including coronary artery disease, type 2 diabetes, blood pressure, waist-hip ratio and body mass index. Our results demonstrate the value of using genetic data from individuals of diverse ancestry and provide insights into the biological mechanisms regulating blood lipids to guide future genetic, biological and therapeutic research.
DOI
10.1038/ng.2797
MAIN ANCESTRY
Multi-ancestry
Megastroke
PUBMED_LINK
DESCRIPTION
MEGASTROKE multi-ancestry stroke GWAS meta-analysis summary statistics and portal.
URL
TITLE
Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes.
Main citation
Malik R, Chauhan G, Traylor M, Sargurupremraj M, ...&, Dichgans M. (2018) Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet, 50 (4) 524-537. doi:10.1038/s41588-018-0058-3. PMID 29531354
ABSTRACT
Stroke has multiple etiologies, but the underlying genes and pathways are largely unknown. We conducted a multiancestry genome-wide-association meta-analysis in 521,612 individuals (67,162 cases and 454,450 controls) and discovered 22 new stroke risk loci, bringing the total to 32. We further found shared genetic variation with related vascular traits, including blood pressure, cardiac traits, and venous thromboembolism, at individual loci (n = 18), and using genetic risk scores and linkage-disequilibrium-score regression. Several loci exhibited distinct association and pleiotropy patterns for etiological stroke subtypes. Eleven new susceptibility loci indicate mechanisms not previously implicated in stroke pathophysiology, with prioritization of risk variants and genes accomplished through bioinformatics analyses using extensive functional datasets. Stroke risk loci were significantly enriched in drug targets for antithrombotic therapy.
DOI
10.1038/s41588-018-0058-3
MAIN ANCESTRY
Multi-ancestry
PGC (Psychiatric Genomics Consortium)
PUBMED_LINK
DESCRIPTION
Psychiatric Genomics Consortium meta-analysis summary statistics for psychiatric disorders.
URL
TITLE
Biological insights from 108 schizophrenia-associated genetic loci.
Main citation
Schizophrenia Working Group of the Psychiatric Genomics Consortium. (2014) Biological insights from 108 schizophrenia-associated genetic loci. Nature, 511 (7510) 421-7. doi:10.1038/nature13595. PMID 25056061
ABSTRACT
Schizophrenia is a highly heritable disorder. Genetic risk is conferred by a large number of alleles, including common alleles of small effect that might be detected by genome-wide association studies. Here we report a multi-stage schizophrenia genome-wide association study of up to 36,989 cases and 113,075 controls. We identify 128 independent associations spanning 108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported. Associations were enriched among genes expressed in brain, providing biological plausibility for the findings. Many findings have the potential to provide entirely new insights into aetiology, but associations at DRD2 and several genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses. Independent of genes expressed in brain, associations were enriched among genes expressed in tissues that have important roles in immunity, providing support for the speculated link between the immune system and schizophrenia.
DOI
10.1038/nature13595
MAIN ANCESTRY
Multi-ancestry
Database
NBDC (hum0197)
DESCRIPTION
NBDC human database entry hum0197 — metadata and access route for Japanese GWAS / summary statistics.
URL
MAIN ANCESTRY
EAS
Institution
CNCR CTGLAB
DESCRIPTION
Downloadable GWAS summary statistics from the CNCR complex-trait genomics group (Netherlands).
URL
MAIN ANCESTRY
EUR
CNSGENOMICS
DESCRIPTION
Curated links and files for public GWAS summary statistics from the Program in Complex Trait Genomics (Queensland).
URL
MAIN ANCESTRY
Multi-ancestry
Platform
Cardiovascular Disease Knowledge Portal
PUBMED_LINK
DESCRIPTION
Broad/HMS cardiovascular disease knowledge portal with GWAS, gene, and variant views across CV traits.
URL
TITLE
Cardiovascular Disease Knowledge Portal: A Community Resource for Cardiovascular Disease Research.
Main citation
Costanzo MC, Roselli C, Brandes M, Duby M, ...&, Burtt NP. (2023) Cardiovascular Disease Knowledge Portal: A Community Resource for Cardiovascular Disease Research. Circ Genom Precis Med, 16 (6) e004181. doi:10.1161/CIRCGEN.123.004181. PMID 37814896
DOI
10.1161/CIRCGEN.123.004181
MAIN ANCESTRY
Multi-ancestry
GWAS catalog
PUBMED_LINK
DESCRIPTION
NHGRI–EBI GWAS Catalog — curated SNP–trait associations and deposition hub for full summary statistics.
URL
TITLE
The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource.
Main citation
Sollis E, Mosaku A, Abid A, Buniello A, ...&, Harris LW. (2023) The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res, 51 (D1) D977-D985. doi:10.1093/nar/gkac1010. PMID 36350656
ABSTRACT
The NHGRI-EBI GWAS Catalog (www.ebi.ac.uk/gwas) is a FAIR knowledgebase providing detailed, structured, standardised and interoperable genome-wide association study (GWAS) data to >200 000 users per year from academic research, healthcare and industry. The Catalog contains variant-trait associations and supporting metadata for >45 000 published GWAS across >5000 human traits, and >40 000 full P-value summary statistics datasets. Content is curated from publications or acquired via author submission of prepublication summary statistics through a new submission portal and validation tool. GWAS data volume has vastly increased in recent years. We have updated our software to meet this scaling challenge and to enable rapid release of submitted summary statistics. The scope of the repository has expanded to include additional data types of high interest to the community, including sequencing-based GWAS, gene-based analyses and copy number variation analyses. Community outreach has increased the number of shared datasets from under-represented traits, e.g. cancer, and we continue to contribute to awareness of the lack of population diversity in GWAS. Interoperability of the Catalog has been enhanced through links to other resources including the Polygenic Score Catalog and the International Mouse Phenotyping Consortium, refinements to GWAS trait annotation, and the development of a standard format for GWAS data.
DOI
10.1093/nar/gkac1010
MAIN ANCESTRY
Multi-ancestry
Japan Omics Browser
PUBMED_LINK
DESCRIPTION
Japan Omics Browser (JOB) for browsing omics and GWAS-style association results in Japanese cohorts.
URL
TITLE
JOB: Japan Omics Browser provides integrative visualization of multi-omics data.
Main citation
Takahashi Y, Wang QS, Hasegawa T, Namkoong H, ...&, Japan COVID-19 Task Force. (2025) JOB: Japan Omics Browser provides integrative visualization of multi-omics data. BMC Genomics, 26 (1) 451. doi:10.1186/s12864-025-11639-1. PMID 40335902
ABSTRACT
We present the Japan Omics Browser (JOB), which enables integrative analysis of human omics at different layers. JOB offers visualization of per-variant regulatory effects in the human blood at mRNA and protein level distinctively, quantified from statistical fine-mapping of mRNA-expression quantitative loci (eQTL) and protein QTLs (pQTLs) in 1,405 Japanese, together with fine-mapping results of 94 complex traits in UK Biobank. In addition, JOB shows per-tissue regulatory effect prediction score (EMS), trained via multi-task learning. Furthermore, validation scores from Massively Parallel Reporter Assay (MPRA) in two cell types are available for over 10,000 variants. JOB is publicly available at https://japan-omics.jp/ .
DOI
10.1186/s12864-025-11639-1
RELATED_BIOBANK
MAIN ANCESTRY
EAS
OpenGWAS
DESCRIPTION
MRC IEU OpenGWAS database — harmonized GWAS summary statistics and API for MR and related analyses.
URL
PREPRINT_DOI
10.1101/2020.08.10.244293
SERVER
biorxiv
Main citation
Elsworth, B., Lyon, M., Alexander, T., Liu, Y., Matthews, P., Hallett, J., ... & Hemani, G. (2020). The MRC IEU OpenGWAS data infrastructure. BioRxiv, 2020-08.
MAIN ANCESTRY
Multi-ancestry