Skip to content

Major Databases International

Curation of International — major databases in this region (see the Major databases hub).

Summary Table

Click a column header to sort the table.

International Nucleotide Sequence Database Collaboration (INSDC)

Database
DESCRIPTION
The International Nucleotide Sequence Database Collaboration (INSDC) is the agreement among **GenBank (U.S. NCBI), ENA (European EMBL-EBI), and DDBJ (Japan NIG) to form one globally mirrored nucleotide archive: submissions receive stable accessions and propagate daily between the three sites. The same partnership governs raw read archives (SRA, DRA, and raw reads in ENA) and shared BioProject / BioSample metadata so studies stay traceable from sample through reads to assembled records. Policy, minimal information** expectations, and format updates are coordinated through the INSDC web site and member notices.
GenBank
https://www.ncbi.nlm.nih.gov/genbank/
NCBI member of INSDC for annotated and assembled nucleotide sequences; exchanges accessions with ENA and DDBJ so a DDBJ accession resolves at GenBank and vice versa.
ENA
https://www.ebi.ac.uk/ena/browser/home
European Nucleotide Archive at EMBL-EBI—INSDC entry for annotated sequences and indexed raw reads; mirrors SRA and DRA run metadata under shared INSDC accession rules.
DDBJ
https://www.ddbj.nig.ac.jp/ddbj/index-e.html
DNA Data Bank of Japan at NIG—Japanese INSDC node for DDBJ nucleotide records plus **DRA reads, GEA expression, and JGA** human controlled-access data under national policy.
SRA
https://www.ncbi.nlm.nih.gov/sra/
Sequence Read Archive at NCBI—INSDC raw read partner; run accessions are synchronized with ENA and **DRA** so submitters can retrieve the same logical dataset from any member site.
BioProject
https://www.ncbi.nlm.nih.gov/bioproject/
Study-level accessioning shared across INSDC; a BioProject ID ties together BioSample, read archives, and sequence records regardless of which member database accepted the original submission.
INSDC technical specifications
https://www.insdc.org/technical-specifications/
Coordinated file formats, feature tables, controlled vocabularies, and minimal metadata rules agreed by GenBank, ENA, and DDBJ—first stop when preparing bulk or high-throughput submissions.
URL
https://www.insdc.org/

Edit JSON under json/databases/ (folders international/), then run python main.py from src/.