Skip to content

Sumstats Transcriptomics Single cell

Curation of Single cell within Transcriptomics — listings under the Summary statistics tab.

Summary Table

Click a column header to sort the table.

NAME CATEGORY Main citation YEAR
PsychENCODE Phase II Brain
Emani PS et al., Science, 2024
2024
AIDA MISC
Kock KH et al., Cell, 2025
2025
Bian MISC
Bian L et al., Cell Genom, 2025
2025
CIMA MISC
Yin J et al., Science, 2026
2026
Fu MISC
Fu Y et al., Cell Genom, 2026
2026
GTEx MISC
Eraslan G et al., Science, 2022
2022
Ishigaki MISC
Ishigaki K et al., Nat Genet, 2017
2017
JCTF MISC
Wang QS et al., Nat Genet, 2024
2024
OneK1k MISC
Yazar S et al., Science, 2022
2022
Ota MISC
Ota M et al., Cell, 2021
2021
SABR MISC
Castel SE et al., Nat Genet, 2025
2025
TenK10k MISC
Cuomo ASE et al., medRxiv, 2025
2025
sc-eQTLGen MISC
van der Wijst M et al., Elife, 2020
2020

Brain

PsychENCODE Phase II

Summary statistics
PUBMED_LINK
38781369
DESCRIPTION
Phase II of the project focused on single-cell and spatial data, culminating in a collection of 14 papers published on May 24, 2024 (9 in Science, 3 in Science Advances, 1 in Scientific Reports, and 1 in Molecular Psychiatry).
URL
https://www.psychencode.org/home
TITLE
Single-cell genomics and regulatory networks for 388 human brains.
Main citation
Emani PS, Liu JJ, Clarke D, Jensen M, ...&, PsychENCODE Consortium. (2024) Single-cell genomics and regulatory networks for 388 human brains. Science, 384 (6698) eadi5199. doi:10.1126/science.adi5199. PMID 38781369
ABSTRACT
Single-cell genomics is a powerful tool for studying heterogeneous tissues such as the brain. Yet little is understood about how genetic variants influence cell-level gene expression. Addressing this, we uniformly processed single-nuclei, multiomics datasets into a resource comprising >2.8 million nuclei from the prefrontal cortex across 388 individuals. For 28 cell types, we assessed population-level variation in expression and chromatin across gene families and drug targets. We identified >550,000 cell type-specific regulatory elements and >1.4 million single-cell expression quantitative trait loci, which we used to build cell-type regulatory and cell-to-cell communication networks. These networks manifest cellular changes in aging and neuropsychiatric disorders. We further constructed an integrative model accurately imputing single-cell expression and simulating perturbations; the model prioritized ~250 disease-risk genes and drug targets with associated cell types.
DOI
10.1126/science.adi5199

MISC

AIDA

Summary statistics
PUBMED_LINK
40112801
DESCRIPTION
Asian Immune Diversity Atlas
URL
https://cellxgene.cziscience.com/collections/ced320a1-29f3-47c1-a735-513c7084d508
TITLE
Asian diversity in human immune cells.
Main citation
Kock KH, Tan LM, Han KY, Ando Y, ...&, Prabhakar S. (2025) Asian diversity in human immune cells. Cell, 188 (8) 2288-2306.e24. doi:10.1016/j.cell.2025.02.017. PMID 40112801
ABSTRACT
The relationships of human diversity with biomedical phenotypes are pervasive yet remain understudied, particularly in a single-cell genomics context. Here, we present the Asian Immune Diversity Atlas (AIDA), a multi-national single-cell RNA sequencing (scRNA-seq) healthy reference atlas of human immune cells. AIDA comprises 1,265,624 circulating immune cells from 619 donors, spanning 7 population groups across 5 Asian countries, and 6 controls. Though population groups are frequently compared at the continental level, we found that sub-continental diversity, age, and sex pervasively impacted cellular and molecular properties of immune cells. These included differential abundance of cell neighborhoods as well as cell populations and genes relevant to disease risk, pathogenesis, and diagnostics. We discovered functional genetic variants influencing cell-type-specific gene expression, which were under-represented in non-Asian populations, and helped contextualize disease-associated variants. AIDA enables analyses of multi-ancestry disease datasets and facilitates the development of precision medicine efforts in Asia and beyond.
DOI
10.1016/j.cell.2025.02.017

Bian

Summary statistics
PUBMED_LINK
40112817
DESCRIPTION
scGaTE
URL
http://ccra.njmu.edu.cn/scgate/
TITLE
Single-cell eQTL mapping reveals cell-type-specific genes associated with the risk of gastric cancer.
Main citation
Bian L, Hu B, Li F, Gu Y, ...&, Jin G. (2025) Single-cell eQTL mapping reveals cell-type-specific genes associated with the risk of gastric cancer. Cell Genom, 5 (4) 100812. doi:10.1016/j.xgen.2025.100812. PMID 40112817
ABSTRACT
Most expression quantitative trait locus (eQTL) analyses have been conducted in heterogeneous gastric tissues, limiting understanding of cell-type-specific regulatory mechanisms. Here, we employed a pooled multiplexing strategy to profile 399,683 gastric cells from 203 Chinese individuals using single-cell RNA sequencing (scRNA-seq). We identified 19 distinct gastric cell types and performed eQTL analyses, uncovering 8,498 independent eQTLs, with a considerable fraction (81%, 6,909/8,498) exhibiting cell-type-specific effects. Integration of these eQTLs with genome-wide association studies for gastric cancer (GC) revealed four co-localization signals in specific cell types. Genetically predicted cell-type-specific gene expression identified 15 genes associated with GC risk, including the upregulation of MUC1 exclusively in parietal cells, linked to decreased GC risk. Our findings highlight substantial heterogeneity in the genetic regulation of gene expression across gastric cell types and provide critical cell-type-specific annotations of genetic variants associated with GC risk, offering new molecular insights underlying GC.
DOI
10.1016/j.xgen.2025.100812

CIMA

Summary statistics
PUBMED_LINK
41505528
DESCRIPTION
Chinese Immune Multi-Omics Atlas
TITLE
Chinese Immune Multi-Omics Atlas.
Main citation
Yin J, Zheng Y, Huang Z, Zhou W, ...&, Liu C. (2026) Chinese Immune Multi-Omics Atlas. Science, 391 (6781) eadt3130. doi:10.1126/science.adt3130. PMID 41505528
ABSTRACT
Human peripheral blood exhibits molecular and cellular heterogeneity across populations, yet the underlying mechanisms remain unclear. We present the Chinese Immune Multi-Omics Atlas (CIMA), characterizing molecular variations linked to sex, age, and genetic variants through multi-omics analysis of more than 10 million circulating immune cells from 428 Chinese adults. CIMA established an enhancer-driven gene regulatory network comprising 237 robust regulons; identified 9600 eGenes and 52,361 caPeaks at cell type resolution; and revealed pleiotropic associations among immune-related disease risk loci, cis-expression quantitative trait loci (QTLs), and chromatin accessibility QTLs. Furthermore, the cell language model CIMA-CLM predicted chromatin accessibility and evaluated the effects of noncoding variants from chromatin sequences and gene expression. CIMA provides a comprehensive reference for immune-related disease research.
DOI
10.1126/science.adt3130

Fu

Summary statistics
PUBMED_LINK
41386230
TITLE
Single-cell eQTL mapping reveals cell-type-specific genetic regulation in lung cancer.
Main citation
Fu Y, Wang Y, Jin C, Zhang C, ...&, Ma H. (2026) Single-cell eQTL mapping reveals cell-type-specific genetic regulation in lung cancer. Cell Genom, 6 (3) 101100. doi:10.1016/j.xgen.2025.101100. PMID 41386230
ABSTRACT
Genome-wide association studies (GWASs) have identified over 50 lung cancer risk loci; however, the precise cellular context of these genetic mechanisms remains unclear due to limitations in bulk tissue expression quantitative trait locus (eQTL) analyses. Here, we present the largest single-cell eQTL (sc-eQTL) atlas of human lung tissue to date, profiling 222 donors using multiplexed single-cell RNA sequencing (scRNA-seq). We identified 4,341 independent eQTLs across 17 cell types, with over 60% of sc-eQTLs and 51% of eGenes being cell-type specific, and fewer than 52% were detectable in paired bulk datasets. Integration with GWASs for non-small cell lung cancer highlighted epithelial and immune cells as key contributors to genetic susceptibility, identifying 28 candidate genes within known risk loci and 24 in novel regions. Notably, 47% of established non-small cell lung cancer (NSCLC) susceptibility loci exhibited cell-type-specific pleiotropic genetic regulation. This study provides a valuable resource of lung sc-eQTLs and illuminates how genetic variation modulates gene expression in a cell-type-specific fashion, contributing to lung cancer susceptibility.
DOI
10.1016/j.xgen.2025.101100

GTEx

Summary statistics
PUBMED_LINK
35549429
DESCRIPTION
V9 snRNA-Seq
TITLE
Single-nucleus cross-tissue molecular reference maps toward understanding disease gene function.
Main citation
Eraslan G, Drokhlyansky E, Anand S, Fiskin E, ...&, Regev A. (2022) Single-nucleus cross-tissue molecular reference maps toward understanding disease gene function. Science, 376 (6594) eabl4290. doi:10.1126/science.abl4290. PMID 35549429
ABSTRACT
Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA sequencing methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross-tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue-specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, metabolic, and immune components of monogenic diseases and the biological processes involved in their pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex traits analyzed by genome-wide association studies.
DOI
10.1126/science.abl4290

Ishigaki

Summary statistics
PUBMED_LINK
28553958
TITLE
Polygenic burdens on cell-specific pathways underlie the risk of rheumatoid arthritis.
Main citation
Ishigaki K, Kochi Y, Suzuki A, Tsuchida Y, ...&, Yamamoto K. (2017) Polygenic burdens on cell-specific pathways underlie the risk of rheumatoid arthritis. Nat Genet, 49 (7) 1120-1125. doi:10.1038/ng.3885. PMID 28553958
ABSTRACT
Recent evidence suggests that a substantial portion of complex disease risk alleles modify gene expression in a cell-specific manner. To identify candidate causal genes and biological pathways of immune-related complex diseases, we conducted expression quantitative trait loci (eQTL) analysis on five subsets of immune cells (CD4+ T cells, CD8+ T cells, B cells, natural killer (NK) cells and monocytes) and unfractionated peripheral blood from 105 healthy Japanese volunteers. We developed a three-step analytical pipeline comprising (i) prediction of individual gene expression using our eQTL database and public epigenomic data, (ii) gene-level association analysis and (iii) prediction of cell-specific pathway activity by integrating the direction of eQTL effects. By applying this pipeline to rheumatoid arthritis data sets, we identified candidate causal genes and a cytokine pathway (upregulation of tumor necrosis factor (TNF) in CD4+ T cells). Our approach is an efficient way to characterize the polygenic contributions and potential biological mechanisms of complex diseases.
DOI
10.1038/ng.3885

JCTF

Summary statistics
PUBMED_LINK
39317738
DESCRIPTION
Japan COVID-19 Task Force
TITLE
Statistically and functionally fine-mapped blood eQTLs and pQTLs from 1,405 humans reveal distinct regulation patterns and disease relevance.
Main citation
Wang QS, Hasegawa T, Namkoong H, Saiki R, ...&, Japan COVID-19 Task Force. (2024) Statistically and functionally fine-mapped blood eQTLs and pQTLs from 1,405 humans reveal distinct regulation patterns and disease relevance. Nat Genet, 56 (10) 2054-2067. doi:10.1038/s41588-024-01896-3. PMID 39317738
ABSTRACT
Studying the genetic regulation of protein expression (through protein quantitative trait loci (pQTLs)) offers a deeper understanding of regulatory variants uncharacterized by mRNA expression regulation (expression QTLs (eQTLs)) studies. Here we report cis-eQTL and cis-pQTL statistical fine-mapping from 1,405 genotyped samples with blood mRNA and 2,932 plasma samples of protein expression, as part of the Japan COVID-19 Task Force (JCTF). Fine-mapped eQTLs (n = 3,464) were enriched for 932 variants validated with a massively parallel reporter assay. Fine-mapped pQTLs (n = 582) were enriched for missense variations on structured and extracellular domains, although the possibility of epitope-binding artifacts remains. Trans-eQTL and trans-pQTL analysis highlighted associations of class I HLA allele variation with KIR genes. We contrast the multi-tissue origin of plasma protein with blood mRNA, contributing to the limited colocalization level, distinct regulatory mechanisms and trait relevance of eQTLs and pQTLs. We report a negative correlation between ABO mRNA and protein expression because of linkage disequilibrium between distinct nearby eQTLs and pQTLs.
DOI
10.1038/s41588-024-01896-3

OneK1k

Summary statistics
PUBMED_LINK
35389779
URL
https://onek1k.org/
TITLE
Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease.
Main citation
Yazar S, Alquicira-Hernandez J, Wing K, Senabouth A, ...&, Powell JE. (2022) Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease. Science, 376 (6589) eabf3041. doi:10.1126/science.abf3041. PMID 35389779
ABSTRACT
The human immune system displays substantial variation between individuals, leading to differences in susceptibility to autoimmune disease. We present single-cell RNA sequencing (scRNA-seq) data from 1,267,758 peripheral blood mononuclear cells from 982 healthy human subjects. For 14 cell types, we identified 26,597 independent cis-expression quantitative trait loci (eQTLs) and 990 trans-eQTLs, with most showing cell type-specific effects on gene expression. We subsequently show how eQTLs have dynamic allelic effects in B cells that are transitioning from naïve to memory states and demonstrate how commonly segregating alleles lead to interindividual variation in immune function. Finally, using a Mendelian randomization approach, we identify the causal route by which 305 risk loci contribute to autoimmune disease at the cellular level. This work brings together genetic epidemiology with scRNA-seq to uncover drivers of interindividual variation in the immune system.
DOI
10.1126/science.abf3041

Ota

Summary statistics
PUBMED_LINK
33930287
TITLE
Dynamic landscape of immune cell-specific gene regulation in immune-mediated diseases.
Main citation
Ota M, Nagafuchi Y, Hatano H, Ishigaki K, ...&, Fujio K. (2021) Dynamic landscape of immune cell-specific gene regulation in immune-mediated diseases. Cell, 184 (11) 3006-3021.e17. doi:10.1016/j.cell.2021.03.056. PMID 33930287
ABSTRACT
Genetic studies have revealed many variant loci that are associated with immune-mediated diseases. To elucidate the disease pathogenesis, it is essential to understand the function of these variants, especially under disease-associated conditions. Here, we performed a large-scale immune cell gene-expression analysis, together with whole-genome sequence analysis. Our dataset consists of 28 distinct immune cell subsets from 337 patients diagnosed with 10 categories of immune-mediated diseases and 79 healthy volunteers. Our dataset captured distinctive gene-expression profiles across immune cell types and diseases. Expression quantitative trait loci (eQTL) analysis revealed dynamic variations of eQTL effects in the context of immunological conditions, as well as cell types. These cell-type-specific and context-dependent eQTLs showed significant enrichment in immune disease-associated genetic variants, and they implicated the disease-relevant cell types, genes, and environment. This atlas deepens our understanding of the immunogenetic functions of disease-associated variants under in vivo disease conditions.
DOI
10.1016/j.cell.2021.03.056

SABR

Summary statistics
PUBMED_LINK
40500424
DESCRIPTION
South African Blood Regulatory
URL
https://zenodo.org/records/15334125
TITLE
A map of blood regulatory variation in South Africans enables GWAS interpretation.
Main citation
Castel SE, Tluway FD, Emde AK, Smyth N, ...&, Ramsay M. (2025) A map of blood regulatory variation in South Africans enables GWAS interpretation. Nat Genet, 57 (7) 1628-1637. doi:10.1038/s41588-025-02223-0. PMID 40500424
ABSTRACT
Functional genomics resources are critical for interpreting human genetic studies, but currently they are predominantly from European-ancestry individuals. Here we present the South African Blood Regulatory (SABR) resource, a map of blood regulatory variation that includes three South Eastern Bantu-speaking groups. Using paired whole-genome and blood transcriptome data from over 600 individuals, we map the genetic architecture of 40 blood cell traits derived from deconvolution analysis, as well as expression, splice and cell-type interaction quantitative trait loci. We comprehensively compare SABR to the Genotype Tissue Expression Project and characterize thousands of regulatory variants only observed in African-ancestry individuals. Finally, we demonstrate the increased utility of SABR for interpreting African-ancestry association studies by identifying putatively causal genes and molecular mechanisms through colocalization analysis of blood-relevant traits from the Pan-UK Biobank. Importantly, we make full SABR summary statistics publicly available to support the African genomics community.
DOI
10.1038/s41588-025-02223-0

TenK10k

Summary statistics
DESCRIPTION
Phase 1: matched WGS and scRNA-seq in ~1.9k individuals; common and rare variant sc-eQTLs in 28 immune cell types (SAIGE-QTL).
URL
https://www.medrxiv.org/content/10.1101/2025.03.20.25324352v2
TITLE
Impact of rare and common genetic variation on cell type-specific gene expression in human blood.
Main citation
Cuomo ASE, Spenceley E, Tanudisastro HA, Bowen B, ...&, Powell JE. (2025) Impact of rare and common genetic variation on cell type-specific gene expression in human blood. medRxiv, () . doi:10.1101/2025.03.20.25324352
ABSTRACT
Understanding the genetic basis of gene expression can shed light on the regulatory mechanisms underlying complex traits and diseases. Single cell-resolved measures of RNA levels and single-cell expression quantitative trait loci (sc-eQTLs) have revealed genetic regulation that drives sub-tissue cell states and types across diverse human tissues. Here, we describe the first phase of TenK10K, the largest-to-date dataset of matched whole-genome sequencing (WGS) and single-cell RNA-sequencing (scRNA-seq). We leverage scRNA-seq data from over 5 million cells across 28 immune cell types, and matched WGS, from 1,925 individuals, which provides power to detect associations between rare and low-frequency genetic variants that have largely been uncharacterised in their impact on cell-specific gene expression. We map the effects of both common and rare variants in a cell type-specific manner using a recently introduced method that increases power by modelling single cells directly rather than relying on aggregated ‘pseudobulk’ counts. We identify putative common regulatory variants for 83% of all 21,404 genes tested and cumulative rare variant signals for 47% of genes. We explore how genetic effects vary across cell type and state spectra, develop a framework to determine the degree to which sc-eQTLs are cell type-specific, and show that about half of the effects are observed only in one or a few cell types. By integrating our results with functional annotations and disease information, we also further characterise the likely molecular modes of action for many disease-variant associations. Finally, we explore the effects that genetic variants have on gene expression across continuous cell states and functions, and effects that vary cell state abundance directly.
DOI
10.1101/2025.03.20.25324352

sc-eQTLGen

Summary statistics
PUBMED_LINK
32149610
URL
https://www.eqtlgen.org/sc/
TITLE
The single-cell eQTLGen consortium.
Main citation
van der Wijst M, de Vries DH, Groot HE, Trynka G, ...&, Franke L. (2020) The single-cell eQTLGen consortium. Elife, 9 () . doi:10.7554/eLife.52155. PMID 32149610
ABSTRACT
In recent years, functional genomics approaches combining genetic information with bulk RNA-sequencing data have identified the downstream expression effects of disease-associated genetic risk factors through so-called expression quantitative trait locus (eQTL) analysis. Single-cell RNA-sequencing creates enormous opportunities for mapping eQTLs across different cell types and in dynamic processes, many of which are obscured when using bulk methods. Rapid increase in throughput and reduction in cost per cell now allow this technology to be applied to large-scale population genetics studies. To fully leverage these emerging data resources, we have founded the single-cell eQTLGen consortium (sc-eQTLGen), aimed at pinpointing the cellular contexts in which disease-causing genetic variants affect gene expression. Here, we outline the goals, approach and potential utility of the sc-eQTLGen consortium. We also provide a set of study design considerations for future single-cell eQTL studies.
DOI
10.7554/eLife.52155