Skip to content

1000 Genomes

Catalog entries using this tag (links open the entry card on its page):

Entries

b37

Reference 1000 Genomes WGS
FULL NAME
Broad Institute Homo_sapiens_assembly19 (b37)
DESCRIPTION
GRCh37-compatible reference FASTA used across Broad Institute and 1000 Genomes workflows: chromosomes 1-22, X, Y, MT, plus GL/NC unlocalized and unplaced contigs (as in the distributed assembly19 package). Coordinate system matches the 1KG/b37 ecosystem used by many GWAS imputation and joint-calling pipelines.
URL
https://data.broadinstitute.org/snowman/hg19/
KEYWORDS
GRCh37; 1000 Genomes; Broad; b37; reference FASTA
Main citation
Broad Institute / 1000 Genomes Project. Homo_sapiens_assembly19.fasta (b37). https://data.broadinstitute.org/snowman/hg19/

hs37d5

Reference 1000 Genomes WGS
FULL NAME
1000 Genomes GRCh37 + decoy (hs37d5)
DESCRIPTION
GRCh37 (b37-style) primary chromosomes and contigs plus the hs37d5 decoy sequence set (HuRef/BAC/Fosmid/NA12878-derived sequences) to reduce spurious alignments in short-read mapping. Standard reference for Phase 3-era 1000 Genomes alignment and many imputation and low-pass WGS workflows that target the 1KG coordinate system.
URL
https://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/
KEYWORDS
GRCh37; decoy; 1000 Genomes; alignment; hs37d5
Main citation
1000 Genomes Project / Broad Institute. hs37d5 reference (GRCh37 plus decoy sequences). https://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/

humanG1Kv37

Reference 1000 Genomes WGS
FULL NAME
1000 Genomes human_g1k_v37 reference
DESCRIPTION
GRCh37-based reference FASTA distributed by the 1000 Genomes Project (human_g1k_v37): chromosomes 1-22, X, Y, MT, plus GL unlocalized/unplaced contigs, without separate haplotype scaffolds or EBV. Commonly used as the Phase 1/III alignment reference when harmonizing with public 1KG VCFs and phase panels.
URL
https://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/
KEYWORDS
GRCh37; 1000 Genomes; reference FASTA; human_g1k_v37
Main citation
1000 Genomes Project. human_g1k_v37 reference (GRCh37). https://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/