Skip to content

Sumstats Structural variant

Curation of Structural variant — listings under the Summary statistics tab.

Summary Table

Click a column header to sort the table.

CNV

PGC schizophrenia CNV

Summary statistics
PUBMED_LINK
27869829
URL
https://pgc.unc.edu/forresearchers/download-results/ ,https://pubmed.ncbi.nlm.nih.gov/27869829/
TITLE
Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects.
Main citation
Marshall CR, Howrigan DP, Merico D, Thiruvahindrapuram B, Wu W, Greer DS, Antaki D, Shetty A, Holmans PA, Pinto D, Gujral M, Brandler WM, Malhotra D, Wang Z, ...&, CNV and Schizophrenia Working Groups of the Psychiatric Genomics Consortium. (2017) Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat Genet, 49 (1) 27-35. doi:10.1038/ng.3725. PMID 27869829
ABSTRACT
Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (odds ratio (OR) = 1.11, P = 5.7 × 10-15), which persisted after excluding loci implicated in previous studies (OR = 1.07, P = 1.7 × 10-6). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 × 10-11) and neurobehavioral phenotypes in mouse (OR = 1.18, P = 7.3 × 10-5). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by nonallelic homologous recombination.
DOI
10.1038/ng.3725
MAIN ANCESTRY
EUR

UKB

Summary statistics
PUBMED_LINK
41639462
URL
https://azphewas.com/
TITLE
Phenome-wide analysis of copy number variants in 470,727 UK Biobank genomes.
Main citation
Zou XZ, Hu F, Lou H, Burren OS, ...&, Carss K. (2026) Phenome-wide analysis of copy number variants in 470,727 UK Biobank genomes. Nature, () . doi:10.1038/s41586-025-10087-x. PMID 41639462
ABSTRACT
Copy number variants (CNVs) are key drivers of human diversity and disease risk1. Here we evaluate the role of CNVs across a broad range of human phenotypes and diseases by analysing CNVs from 470,727 UK Biobank whole-genome sequences and conducting a variant- and gene-level phenome-wide association study (PheWAS) with 2,941 plasma protein abundance measurements, 13,336 binary clinical phenotypes and 1,911 quantitative traits. Proteomic analyses validated functional associations of CNVs with nearby genes (cis-protein quantitative trait loci; cis-pQTLs)-with deletions and duplications typically associated with reduced and increased protein levels, respectively-and uncovered previously unknown protein-protein interactions (trans-pQTLs). Our PheWAS recapitulated known associations and uncovered associations in both coding and non-coding regions. Notably, we identified a rare deletion in ZNF451 associated with increased leukocyte telomere length and a non-coding deletion of a SLC2A9 enhancer associated with reduced gout risk. In addition, by combining CNVs with protein-coding single nucleotide variants and indels, we enhanced the power of our study to detect gene-disease associations. Finally, we leveraged this multiomics dataset to identify several pQTLs that constitute candidate biomarkers, including TMPRSS5 for Charcot-Marie-Tooth disease type 1A. This multiancestry whole-genome-sequence CNV PheWAS offers insights into the roles of CNVs in human health outcomes and could serve as a valuable resource for therapeutic development.
DOI
10.1038/s41586-025-10087-x
RELATED_BIOBANK
UK Biobank
MAIN ANCESTRY
EUR

UKB WES CNV (Loh lab)

Summary statistics
PUBMED_LINK
38548989
URL
https://www.nature.com/articles/s41588-024-01684-z ,https://data.broadinstitute.org/lohlab/UKB_WES_CNV_sumstats/ ,https://doi.org/10.5281/zenodo.10529671
TITLE
Protein-altering variants at copy number-variable regions influence diverse human phenotypes.
Main citation
Hujoel MLA, Handsaker RE, Sherman MA, Kamitaki N, Barton AR, Mukamel RE, Terao C, McCarroll SA, Loh PR. (2024) Protein-altering variants at copy number-variable regions influence diverse human phenotypes. Nat Genet, 56 (4) 569-578. doi:10.1038/s41588-024-01684-z. PMID 38548989
ABSTRACT
Copy number variants (CNVs) are among the largest genetic variants, yet CNVs have not been effectively ascertained in most genetic association studies. Here we ascertained protein-altering CNVs from UK Biobank whole-exome sequencing data (n = 468,570) using haplotype-informed methods capable of detecting subexonic CNVs and variation within segmental duplications. Incorporating CNVs into analyses of rare variants predicted to cause gene loss of function (LOF) identified 100 associations of predicted LOF variants with 41 quantitative traits. A low-frequency partial deletion of RGL3 exon 6 conferred one of the strongest protective effects of gene LOF on hypertension risk (odds ratio = 0.86 (0.82-0.90)). Protein-coding variation in rapidly evolving gene families within segmental duplications-previously invisible to most analysis methods-generated some of the human genome's largest contributions to variation in type 2 diabetes risk, chronotype and blood cell traits. These results illustrate the potential for new genetic insights from genomic variation that has escaped large-scale analysis to date.
DOI
10.1038/s41588-024-01684-z
RELATED_BIOBANK
UK Biobank
MAIN ANCESTRY
EUR

WTCCC CNV

Summary statistics
PUBMED_LINK
20360734
URL
https://www.nature.com/articles/nature08979 ,https://pubmed.ncbi.nlm.nih.gov/20360734/
TITLE
Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls.
Main citation
Wellcome Trust Case Control Consortium. (2010) Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls. Nature, 464 (7289) 713-20. doi:10.1038/nature08979. PMID 20360734
ABSTRACT
Copy number variants (CNVs) account for a major proportion of human genetic polymorphism and have been predicted to have an important role in genetic susceptibility to common disease. To address this we undertook a large, direct genome-wide study of association between CNVs and eight common human diseases. Using a purpose-designed array we typed approximately 19,000 individuals into distinct copy-number classes at 3,432 polymorphic CNVs, including an estimated approximately 50% of all common CNVs larger than 500 base pairs. We identified several biological artefacts that lead to false-positive associations, including systematic CNV differences between DNAs derived from blood and cell lines. Association testing and follow-up replication analyses confirmed three loci where CNVs were associated with disease-IRGM for Crohn's disease, HLA for Crohn's disease, rheumatoid arthritis and type 1 diabetes, and TSPAN8 for type 2 diabetes-although in each case the locus had previously been identified in single nucleotide polymorphism (SNP)-based studies, reflecting our observation that most common CNVs that are well-typed on our array are well tagged by SNPs and so have been indirectly explored through SNP studies. We conclude that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis of common human diseases.
DOI
10.1038/nature08979
MAIN ANCESTRY
EUR