Skip to content

References Functional

Curation of Functional — listings under the References tab.

Summary Table

Click a column header to sort the table.

NAME CATEGORY Main citation YEAR
ENCODE 2004 ENCODE
ENCODE Project Consortium, Science, 2004
2004
ENCODE 2007 ENCODE
ENCODE Project Consortium et al., Nature, 2007
2007
ENCODE 2012 ENCODE
ENCODE Project Consortium, Nature, 2012
2012
ENCODE 2020 ENCODE
ENCODE Project Consortium et al., Nature, 2020
2020
ENCODE 2026 ENCODE
Moore JE et al., Nature, 2026
2026
ENCODE portal ENCODE
Kagda MS et al., Nat Commun, 2025
2025

ENCODE

ENCODE 2004

Reference
PUBMED_LINK
15499007
TITLE
The ENCODE (ENCyclopedia Of DNA Elements) Project.
Main citation
ENCODE Project Consortium. (2004) The ENCODE (ENCyclopedia Of DNA Elements) Project. Science, 306 (5696) 636-40. doi:10.1126/science.1105136. PMID 15499007
ABSTRACT
The ENCyclopedia Of DNA Elements (ENCODE) Project aims to identify all functional elements in the human genome sequence. The pilot phase of the Project is focused on a specified 30 megabases (approximately 1%) of the human genome sequence and is organized as an international consortium of computational and laboratory-based scientists working to develop and apply high-throughput approaches for detecting all sequence elements that confer biological function. The results of this pilot phase will guide future efforts to analyze the entire human genome.
DOI
10.1126/science.1105136

ENCODE 2007

Reference
PUBMED_LINK
17571346
TITLE
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.
Main citation
ENCODE Project Consortium, Birney E, Stamatoyannopoulos JA, Dutta A, ...&, de Jong PJ. (2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature, 447 (7146) 799-816. doi:10.1038/nature05874. PMID 17571346
ABSTRACT
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
DOI
10.1038/nature05874

ENCODE 2012

Reference
PUBMED_LINK
22955616
DESCRIPTION
ENCODE Phase II
TITLE
An integrated encyclopedia of DNA elements in the human genome.
Main citation
ENCODE Project Consortium. (2012) An integrated encyclopedia of DNA elements in the human genome. Nature, 489 (7414) 57-74. doi:10.1038/nature11247. PMID 22955616
ABSTRACT
The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.
DOI
10.1038/nature11247

ENCODE 2020

Reference
PUBMED_LINK
32728249
DESCRIPTION
ENCODE Phase III
TITLE
Expanded encyclopaedias of DNA elements in the human and mouse genomes.
Main citation
ENCODE Project Consortium, Moore JE, Purcaro MJ, Pratt HE, ...&, Weng Z. (2020) Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature, 583 (7818) 699-710. doi:10.1038/s41586-020-2493-4. PMID 32728249
ABSTRACT
The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.
DOI
10.1038/s41586-020-2493-4

ENCODE 2026

Reference
PUBMED_LINK
41501460
DESCRIPTION
ENCODE Phase IV (ENCODE4)
TITLE
An expanded registry of candidate cis-regulatory elements.
Main citation
Moore JE, Pratt HE, Fan K, Phalke N, ...&, Weng Z. (2026) An expanded registry of candidate cis-regulatory elements. Nature, () . doi:10.1038/s41586-025-09909-9. PMID 41501460
ABSTRACT
Mammalian genomes contain millions of regulatory elements that control the complex patterns of gene expression1. Previously, the ENCODE consortium mapped biochemical signals across hundreds of cell types and tissues and integrated these data to develop a registry containing 0.9 million human and 300,000 mouse candidate cis-regulatory elements (cCREs) annotated with potential functions2. Here we have expanded the registry to include 2.37 million human and 967,000 mouse cCREs, leveraging new ENCODE datasets and enhanced computational methods. This expanded registry covers hundreds of unique cell and tissue types, providing a comprehensive understanding of gene regulation. Functional characterization data from assays such as STARR-seq3, massively parallel reporter assay4, CRISPR perturbation5,6 and transgenic mouse assays7 have profiled more than 90% of human cCREs, revealing complex regulatory functions. We identified thousands of novel silencer cCREs and demonstrated their dual enhancer and silencer roles in different cellular contexts. Integrating the registry with other ENCODE annotations facilitates genetic variation interpretation and trait-associated gene identification, exemplified by the identification of KLF1 as a novel causal gene for red blood cell traits. This expanded registry is a valuable resource for studying the regulatory genome and its impact on health and disease.
DOI
10.1038/s41586-025-09909-9

ENCODE portal

Reference
PUBMED_LINK
41168159
TITLE
Data navigation on the ENCODE portal.
Main citation
Kagda MS, Lam B, Litton C, Small C, ...&, Hitz BC. (2025) Data navigation on the ENCODE portal. Nat Commun, 16 (1) 9592. doi:10.1038/s41467-025-64343-9. PMID 41168159
ABSTRACT
Spanning two decades, the collaborative ENCODE project aims to identify all the functional elements within human and mouse genomes. To best serve the scientific community, the comprehensive ENCODE data including results from 23,000+ functional genomics experiments, 800+ functional elements characterization experiments and 60,000+ results from integrative computational analyses are available on an open-access data-portal ( https://www.encodeproject.org/ ). The final phase of the project includes data from several novel assays aimed at characterization and validation of genomic elements. In addition to developing and maintaining the data portal, the Data Coordination Center (DCC) implemented and utilised uniform processing pipelines to generate uniformly processed data. Here we report recent updates to the data portal including a redesigned home page, an improved search interface, new custom-designed pages highlighting biologically related datasets and an enhanced cart interface for data visualisation plus user-friendly data download options. A summary of data generated using uniform processing pipelines is also provided.
DOI
10.1038/s41467-025-64343-9