Skip to content

Biobanks & Cohorts

This is an effort to collect the information on major biobanks or cohorts with genomic data around the world.

Summary Table

Name CONTINENT SAMPLE SIZE Link
Nigerian 100K Genome Project AFRICA ~100k Here
Uganda Genome Resource AFRICA ~6k Here
All of Us AMERICA ~413k Here
Biobank of the Americas AMERICA ~20k Here
BioMe AMERICA ~32k Here
BioVU AMERICA ~120k Here
CanPath - Ontario Health Study AMERICA ~7k Here
CARTaGENE biobank AMERICA ~30K Here
Colorado Center for Personalized Medicine AMERICA ~34k Here
Massachusetts General Brigham Biobank AMERICA ~26K Here
Mexico City Prospective Study AMERICA ~150k Here
Michigan Genomics Initiative AMERICA ~55k Here
Million Veteran Program AMERICA ~900k Here
Penn Medicine Biobank AMERICA ~40k Here
The Canadian Longitudinal Study on Aging AMERICA ~50k Here
UCLA Precision Health Biobank AMERICA ~27k Here
BioBank Japan ASIA ~270k Here
Born in Guangzhou Cohort Study ASIA ~50K Here
China Kadoorie Biobank ASIA ~512k Here
Chinese Millionome Database ASIA ~141k Here
Han Chinese Genome Initiative Phase 1 the Han100K Project ASIA ~114k Here
IndiGenomes ASIA ~10k Here
Korean Genome Project Phase 1 ASIA ~1K Here
Korean Genome Project Phase 2 ASIA ~4K Here
National Biobank of Korea ASIA ~210K Here
National Center Biobank Network ASIA ~120K Here
NyuWa genome resource ASIA ~3k Here
Qatar Biobank ASIA ~80K Here
Qatar Genome Program ASIA ~6K Here
SG10K_Health ASIA ~10k Here
Taiwan Biobank ASIA ~150k Here
Taizhou Imaging Study ASIA ~1K Here
The China Metabolic Analytics Project ASIA ~10k Here
The Hisayama Study ASIA ~8K Here
The Japan Prospective Studies Collaboration for Aging and Dementia ASIA ~11K Here
The Malaysian Cohort ASIA ~100k Here
The Nagahama Study ASIA ~10K Here
The STROMICS genome study ASIA ~10k Here
Tohoku Medical Megabank ASIA ~157k Here
Westlake BioBank for Chinese ASIA ~14k Here
Biobank Graz EUROPE ~1200k Here
deCODE Genetics EUROPE ~250k Here
East London Genes & Health EUROPE ~100k Here
Estonian Biobank EUROPE ~200k Here
Fenland Study EUROPE ~12k Here
FinnGen EUROPE ~500k Here
Generation Scotland EUROPE ~24k Here
INTERVAL Study EUROPE ~50k Here
Lifelines EUROPE ~167k Here
The International Agency for Research on Cancer (IARC) Biobank EUROPE ~560k Here
The Trøndelag Health Study EUROPE ~229k Here
UK Biobank EUROPE ~500k Here
QIMR Berghofer - QIMR Biobank OCIENIA ~17k Here
The Egypt Genome Project AFRICA ~100K Here
Biobank Russia EUROPE ~4K Here

AFRICA

Nigerian 100K Genome Project

  • BIOBANK&COHORT : Nigerian 100K Genome Project
  • CONTINENT : AFRICA
  • REGION : Nigeria
  • ANCESTRY : AFR
  • SAMPLE SIZE : ~100k
  • WGS/WES : ~1K
  • NOTE : First phase: Non-Communicable Diseases Genetic Heritage Study (NCD-GHS)
  • CITATION : Fatumo, S., Yakubu, A., Oyedele, O., Popoola, J., Attipoe, D. A., Eze-Echesi, G., ... & Ene-Obong, A. (2022). Promoting the genomic revolution in Africa through the Nigerian 100K Genome Project. Nature Genetics, 54(5), 531-536.
  • CITATION : Joshi, E., Biddanda, A., Popoola, J., Yakubu, A., Osakwe, O., Attipoe, D., ... & Salako, B. (2023). Whole-genome sequencing across 449 samples spanning 47 ethnolinguistic groups provides insights into genetic diversity in Nigeria. Cell genomics, 3(9).
  • URL : https://allofus.nih.gov/ , https://www.researchallofus.org/register/
  • DESCRIPTION : Genomic studies in African populations provide unique opportunities to understand disease aetiology, human genetic diversity and population history in a regional and a global context. To leverage the relative benefits of different strategies, we undertook a combined approach of genotyping and whole-genome sequencing (WGS) in a population-based study of 6,400 individuals from a geographically defined rural community in South-West Uganda. We present data from 4,778 individuals with genotypes for ~2.2 million SNPs from the Uganda GWAS resource (UGWAS), and sequence data on up to 1,978 individuals spanning 41.5M SNPs and 4.5M indels (UG2G); 343 individuals overlap between the two datasets. We highlight the value of the largest sequence panel from Africa to date as a global resource for variant discovery, imputation and understanding the mutational spectrum and its clinical relevance in African populations. Alongside phenotype data, we provide a rich new genomic resource for researchers in Africa and globally
  • NAME_FOR_TABLE : nigerian-100k-genome-project
  • Name : Nigerian 100K Genome Project
  • Link : Here

The Egypt Genome Project

  • BIOBANK&COHORT : The Egypt Genome Project
  • CONTINENT : AFRICA
  • REGION : Egypt
  • SAMPLE SIZE : ~100K
  • CITATION : Elmonem, M.A., Soliman, N.A., Moustafa, A. et al. The Egypt Genome Project. Nat Genet (2024). https://doi.org/10.1038/s41588-024-01739-1
  • URL : https://egp.sci.eg/
  • DESCRIPTION : EGYPT CENTER FOR RESEARCH AND REGENERATIVE MEDICINE (ECRRM) IS ONE OF THE RESEARCH UNITS ASSOCIATED WITH THE MINISTRY OF DEFENSE. IT WAS ESTABLISHED BY A PRESIDENTIAL DECREE IN 2017 AND HAS A LEGAL ENTITY AFFILIATED WITH THE MINISTRY OF DEFENSE. THE CENTER WILL INITIATE THE EGYPTIAN GENOME REFERENCE PROJECT, UPON THE PRESIDENT AUTHORIZATION, IN COLLABORATION WITH THE ACADEMY OF SCIENTIFIC RESEARCH AND TECHNOLOGY AND THE MINISTRY OF HIGHER EDUCATION, TO HELP IN THE OVERALL ENHANCEMENT OF THE GENERAL HEALTH CARE IN EGYPT.
  • NAME_FOR_TABLE : the-egypt-genome-project
  • Name : The Egypt Genome Project
  • Link : Here

Uganda Genome Resource

  • BIOBANK&COHORT : Uganda Genome Resource
  • ABBREVIATION : UGR
  • CONTINENT : AFRICA
  • REGION : Uganda
  • ANCESTRY : AFR
  • SAMPLE SIZE : ~6k
  • NOTE : (MedicalResearchCouncil (MRC)/UgandaVirusResearchInstitute (UVRI) and LSHTM Uganda Research Unit)
  • CITATION : Gurdasani, D., Carstensen, T., Fatumo, S., Chen, G., Franklin, C. S., Prado-Martinez, J., ... & Sandhu, M. S. (2019). Uganda genome resource enables insights into population history and genomic discovery in Africa. Cell, 179(4), 984-1002.
  • URL : https://www. lshtm.ac.uk/research/units/mrc-uganda
  • DESCRIPTION : The IndiGenomes resource encompasses the genomic data from over 1000 whole genome sequences sequenced from across India as part of the IndiGen programme and represents diverse geographies and ethnicities. The resource provides access to over 55 million genetic variants comprising of single nucleotide variants and indels. The variants are systematically annotated according to the recent Genome Reference Consortium Human Build 38 (GRCh38). Clinically relevant annotations as well as allele frequencies from global populations have also been integrated.
  • NAME_FOR_TABLE : uganda-genome-resource
  • Name : Uganda Genome Resource
  • Link : Here

AMERICA

All of Us

  • BIOBANK&COHORT : All of Us
  • ABBREVIATION : AoU
  • CONTINENT : AMERICA
  • REGION : U.S.
  • ANCESTRY : EUR, AFR, AMR, EAS, SAS, WAS
  • SAMPLE SIZE : ~413k
  • WGS/WES : ~245K
  • CITATION : Investigators, A. U. R. P. (2019). The “All of Us” research program. New England Journal of Medicine, 381(7), 668-676.
  • CITATION : Bick, A. G., Metcalf, G. A., Mayo, K. R., Lichtenstein, L., Rura, S., Carroll, R. J., ... & Denny, J. C. (2024). Genomic data in the All of Us research program. Nature.
  • URL : https://precisionhealth.umich.edu/our-research/michigangenomics/
  • DESCRIPTION : The All of Us Research Program is a historic effort to collect and study data from one million or more people living in the United States. The goal of the program is better health for all of us. The program began national enrollment in 2018 and is expected to last at least 10 years.
  • NAME_FOR_TABLE : all-of-us
  • Name : All of Us
  • Link : Here

BioMe

  • BIOBANK&COHORT : BioMe
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~32k
  • CITATION : Roden, D. M., Pulley, J. M., Basford, M. A., Bernard, G. R., Clayton, E. W., Balser, J. R., & Masys, D. R. (2008). Development of a large‐scale de‐identified DNA biobank to enable personalized medicine. Clinical Pharmacology & Therapeutics, 84(3), 362-369.
  • URL : https://www.vumc.org/dbmi/biovu
  • DESCRIPTION : The Institute for Personalized Medicine at the Icahn School of Medicine at Mount Sinai is leading the movement toward diagnosis and classification of disease according to the patient’s molecular profile. This approach accommodates differences at all possible levels of exposure (genome, environment, and lifestyle) and at all stages of the process, from prevention to post-treatment follow-up. At the center of this effort is BioMe, an electronic medical record-linked biobank that enables researchers to rapidly and efficiently conduct genetic, epidemiologic, molecular, and genomic studies on large collections of research specimens linked with medical information.
  • NAME_FOR_TABLE : biome
  • Name : BioMe
  • Link : Here

BioVU

  • BIOBANK&COHORT : BioVU
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~120k
  • URL : https://bbofa.org/
  • DESCRIPTION : Planning for BioVU began in mid-2004 and the first samples were collected in February 2007. Prior to collecting DNA samples, all aspects of the BioVU project were extensively tested. BioVU now accrues 500-1000 samples per week, totaling more than 275,000 DNA samples as of January 2022. Vanderbilt clinic patients may sign the BioVU Consent Form if they wish to donate their excess blood samples, or not sign the form if they do not wish to participate.
  • NAME_FOR_TABLE : biovu
  • Name : BioVU
  • Link : Here

Biobank of the Americas

  • BIOBANK&COHORT : Biobank of the Americas
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~20k
  • URL : https://www.galatea.bio/#main-biobank
  • DESCRIPTION : Biobank consented samples with associated clinical data from diverse populations from throughout the United States and Latin America via healthcare and biopharma partnerships.
  • NAME_FOR_TABLE : biobank-of-the-americas
  • Name : Biobank of the Americas
  • Link : Here

CARTaGENE biobank

  • BIOBANK&COHORT : CARTaGENE biobank
  • CONTINENT : AMERICA
  • REGION : Canada
  • SAMPLE SIZE : ~30K
  • Array : ~30K
  • WGS/WES : ~2K
  • Transcriptome : ~0.9K
  • DATA ACCESS : https://cartagene.qc.ca/en/researchers.html
  • CITATION : Awadalla, P., Boileau, C., Payette, Y., Idaghdour, Y., Goulet, J. P., Knoppers, B., ... & Laberge, C. (2013). Cohort profile of the CARTaGENE study: Quebec’s population-based biobank for public health and personalized genomics. International journal of epidemiology, 42(5), 1285-1299.
  • URL : https://cartagene.qc.ca/en/
  • DESCRIPTION : CARTaGENE is a public research platform of the CHU Sainte-Justine aiming to accelerate health research. CARTaGENE is made up of both biological samples and data on the health and lifestyle of 43,000 Quebec men and women between the ages of 40 and 69 at recruitment.
  • NAME_FOR_TABLE : cartagene-biobank
  • Name : CARTaGENE biobank
  • Link : Here

CanPath - Ontario Health Study

  • BIOBANK&COHORT : CanPath - Ontario Health Study
  • CONTINENT : AMERICA
  • REGION : Canada
  • SAMPLE SIZE : ~7k
  • CITATION : Kirsh, V. A., Skead, K., McDonald, K., Kreiger, N., Little, J., Menard, K., ... & Awadalla, P. (2022). Cohort Profile: The Ontario Health Study (OHS). International Journal of Epidemiology.
  • URL : https://canpath.ca/cohort/ontario-health-study/
  • DESCRIPTION : The Ontario Health Study (OHS) is a resource for investigating the ways in which lifestyle, the environment and genetics affect people’s health. It is one of the regional cohorts that collectively form the Canadian Partnership for Tomorrow’s Health (CanPath)—a pan-Canadian cohort with >330 000 participants. The linking of Canada’s rich collection of administrative health data with the cohort’s data represents a powerful means to disseminate high-quality, timely data.
  • NAME_FOR_TABLE : canpath-ontario-health-study
  • Name : CanPath - Ontario Health Study
  • Link : Here

Colorado Center for Personalized Medicine

  • BIOBANK&COHORT : Colorado Center for Personalized Medicine
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~34k
  • URL : https://medschool.cuanschutz.edu/cobiobank
  • DESCRIPTION : Established in 2014 as a partnership between UCHealth and University of Colorado Anschutz Medical Campus, the Colorado Center for Personalized Medicine (CCPM) brings together multiple disciplines and institutions to uncover advancements in genomics that can improve diagnosis and treatment of disease, and identify more tailored approaches to population health management.To facilitate discoveries in personalized medicine, CCPM has created a Biobank that aims to be one of the largest academic medicine biospecimen repositories in the mountain and midwest regions of the U.S. The CCPM Biobank is able to link biospecimens and genotype information with patient health information from electronic medical records in an enterprise data warehouse (Health Data Compass) to support a broad range of research, operational, and clinical quality improvement agendas.
  • NAME_FOR_TABLE : colorado-center-for-personalized-medicine
  • Name : Colorado Center for Personalized Medicine
  • Link : Here

Massachusetts General Brigham Biobank

  • BIOBANK&COHORT : Massachusetts General Brigham Biobank
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~26K
  • CITATION : Boutin, N. T., Schecter, S. B., Perez, E. F., Tchamitchian, N. S., Cerretani, X. R., Gainer, V. S., ... & Smoller, J. W. (2022). The Evolution of a Large Biobank at Mass General Brigham. Journal of Personalized Medicine, 12(8), 1323.
  • CITATION : Castro, V. M., Gainer, V., Wattanasin, N., Benoit, B., Cagan, A., Ghosh, B., ... & Murphy, S. N. (2022). The Mass General Brigham Biobank Portal: an i2b2-based data repository linking disparate and high-dimensional patient data to support multimodal analytics. Journal of the American Medical Informatics Association, 29(4), 643-651.
  • URL : https://www.massgeneralbrigham.org/en/research-and-innovation/participate-in-research/biobank
  • DESCRIPTION : The Mass General Brigham Biobank is a large research program designed to help researchers understand how people’s health is affected by their genes, lifestyle, and environment. By participating in the Mass General Brigham Biobank, you can help us better understand, treat, and even prevent the diseases that might affect your health and the health of future generations.
  • NAME_FOR_TABLE : massachusetts-general-brigham-biobank
  • Name : Massachusetts General Brigham Biobank
  • Link : Here

Mexico City Prospective Study

  • BIOBANK&COHORT : Mexico City Prospective Study
  • CONTINENT : AMERICA
  • REGION : Mexico
  • SAMPLE SIZE : ~150k
  • CITATION : Ziyatdinov, A., Torres, J., Alegre-Diaz, J., Backman, J., Mbatchou, J., Turner, M., ... & Tapia-Conyer, R. (2022). Genotyping, sequencing and analysis of 140,000 adults from the Mexico City Prospective Study. bioRxiv.
  • URL : https://www.ctsu.ox.ac.uk/research/prospective-blood-based-study-of-150-000-individuals-in-mexico
  • DESCRIPTION : Between 1998 and 2004, CTSU, in collaboration with the Mexican Ministry of Health, established a study in Mexico City, in which over 150,000 middle-aged adults (including 100,000 women and 50,000 men) provided information about their lifestyle and disease history, had physical measurements recorded (including weight, waist and hip circumference, blood pressure) and had a blood sample taken.
  • NAME_FOR_TABLE : mexico-city-prospective-study
  • Name : Mexico City Prospective Study
  • Link : Here

Michigan Genomics Initiative

  • BIOBANK&COHORT : Michigan Genomics Initiative
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~55k
  • CITATION : Zawistowski, M., Fritsche, L. G., Pandit, A., Vanderwerff, B., Patil, S., Scmidt, E. M., ... & Zoellner, S. (2021). The Michigan Genomics Initiative: a biobank linking genotypes and electronic clinical records in Michigan Medicine patients. medRxiv.
  • URL : https://pmbb.med.upenn.edu/
  • DESCRIPTION : The Michigan Genomics Initiative (MGI) is a collaborative research effort among physicians, researchers, and patients at the University of Michigan (U-M) with the goal of combining patient electronic health record (EHR) data with corresponding genetic data to gain novel biomedical insights. There are currently ~84K consented participants through the MGI and partner studies and the addition of ~10K new participants per year is anticipated. Currently, all MGI participants with available genetic data have received care at the University of Michigan Health System.
  • NAME_FOR_TABLE : michigan-genomics-initiative
  • Name : Michigan Genomics Initiative
  • Link : Here

Million Veteran Program

  • BIOBANK&COHORT : Million Veteran Program
  • ABBREVIATION : MVP
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~900k
  • CITATION : Gaziano, J. M., Concato, J., Brophy, M., Fiore, L., Pyarajan, S., Breeling, J., ... & O'Leary, T. J. (2016). Million Veteran Program: A mega-biobank to study genetic influences on health and disease. Journal of clinical epidemiology, 70, 214-223.
  • CITATION : Hunter-Zinck, H., Shi, Y., Li, M., Gorman, B. R., Ji, S. G., Sun, N., ... & Pyarajan, S. (2020). Genotyping array design and data quality control in the Million Veteran Program. The American Journal of Human Genetics, 106(4), 535-548.
  • URL : https://www.mvp.va.gov/pwa/
  • DESCRIPTION : The Million Veteran Program (MVP) is a national research program to learn how genes, lifestyle, and military exposures affect health and illness. Since launching in 2011, over 900,000 Veteran partners have joined one of the world's largest programs on genetics and health.
  • NAME_FOR_TABLE : million-veteran-program
  • Name : Million Veteran Program
  • Link : Here

Penn Medicine Biobank

  • BIOBANK&COHORT : Penn Medicine Biobank
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~40k
  • URL : https://www.uclahealth.org/precision-health/programs/ucla-atlas-community-health-initiative/ucla-atlas-precision-health-biobank
  • DESCRIPTION : The Penn Medicine BioBank (PMBB) is a research program created to study the causes and treatments of many diseases. Any Penn Medicine patient (age 18 and up) can sign up. The PMBB is a collection of biological samples, such as blood or tissue, that are donated by patient volunteers. These samples are then connected to clinical information, such as diseases or lab measures. These data are then used by researchers to discover new ways to detect, treat, and maybe even prevent or cure disease. Some of these studies may be about how genes affect health and disease. Other studies look at how genes affect response to medicines.
  • NAME_FOR_TABLE : penn-medicine-biobank
  • Name : Penn Medicine Biobank
  • Link : Here

The Canadian Longitudinal Study on Aging

  • BIOBANK&COHORT : The Canadian Longitudinal Study on Aging
  • ABBREVIATION : CLSA
  • CONTINENT : AMERICA
  • REGION : Canada
  • SAMPLE SIZE : ~50k
  • CITATION : Raina, P. S., Wolfson, C., Kirkland, S. A., Griffith, L. E., Oremus, M., Patterson, C., ... & Brazil, K. (2009). The Canadian longitudinal study on aging (CLSA). Canadian Journal on Aging/La Revue canadienne du vieillissement, 28(3), 221-229.
  • URL : https://www.clsa-elcv.ca/
  • DESCRIPTION : The Canadian Longitudinal Study on Aging (CLSA) is a large, national, long-term study that will follow approximately 50,000 individuals who are between the ages of 45 and 85 when recruited, for at least 20 years. The CLSA will collect information on the changing biological, medical, psychological, social, lifestyle and economic aspects of people’s lives. These factors will be studied to understand how, individually and in combination, they have an impact in both maintaining health and in the development of disease and disability as people age.
  • NAME_FOR_TABLE : the-canadian-longitudinal-study-on-aging
  • Name : The Canadian Longitudinal Study on Aging
  • Link : Here

UCLA Precision Health Biobank

  • BIOBANK&COHORT : UCLA Precision Health Biobank
  • CONTINENT : AMERICA
  • REGION : U.S.
  • SAMPLE SIZE : ~27k
  • CITATION : Johnson, R. D., Ding, Y., Bhattacharya, A., Chiu, A., Lajonchere, C., Geschwind, D. H., & Pasaniuc, B. (2022). The UCLA ATLAS Community Health Initiative: promoting precision health research in a diverse biobank. medRxiv.
  • URL : https://icahn.mssm.edu/research/ipm/programs/biome-biobank
  • DESCRIPTION : The UCLA ATLAS Precision Health Biobank, under the supervision of the Translational Pathology Core Laboratory (TCPL), collects biological samples from patients who have consented to participate in the UCLA ATLAS Community Health Initiative. As a collaborator with UCLA ATLAS Community Health Initiative, the UCLA ATLAS Precision Health Biobank manages the collection and distribution of biological samples by removing the personally identifiable information.
  • NAME_FOR_TABLE : ucla-precision-health-biobank
  • Name : UCLA Precision Health Biobank
  • Link : Here

ASIA

BioBank Japan

  • BIOBANK&COHORT : BioBank Japan
  • ABBREVIATION : BBJ
  • CONTINENT : ASIA
  • REGION : Japan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~270k
  • Array : ~270K
  • WGS/WES : ~14K
  • Metabolome : ~4K
  • CITATION : Nagai, A., Hirata, M., Kamatani, Y., Muto, K., Matsuda, K., Kiyohara, Y., ... & Kubo, M. (2017). Overview of the BioBank Japan Project: study design and profile. Journal of epidemiology, 27(Supplement_III), S2-S8.
  • URL : https://biobankjp.org/
  • DESCRIPTION : In 2003, BioBank Japan (BBJ) started developing one of the world’s largest disease biobanks, creating a foundation for research aimed at achieving medical care tailored to the individual traits of each patient. From a total of 260,000 patients representing 440,000 cases of 51 primarily multifactorial (common) diseases, BBJ has collected DNA, serum, medical records (clinical information), etc. with their consent. No less than 5,800 items of screened information are available for research, including the patients’ survival information, with 95% of the patients tracked over an average of 10 years. In addition to large-scale genomic analyses, omics analyses including whole genome sequencing and metabolome/proteome analyses have been performed on the DNA, serum and other biological samples collected, producing significant research findings. The genomic information acquired through the analyses continues to be used as data. The biological samples and data are widely distributed and used by researchers.
  • NAME_FOR_TABLE : biobank-japan
  • Name : BioBank Japan
  • Link : Here

Born in Guangzhou Cohort Study

  • BIOBANK&COHORT : Born in Guangzhou Cohort Study
  • ABBREVIATION : BIGCS
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~50K
  • WGS/WES : ~4K
  • STUDY TYPE : trio or duo
  • CITATION : Qiu, X., Lu, J. H., He, J. R., Lam, K. B. H., Shen, S. Y., Guo, Y., ... & Xia, H. M. (2017). The born in Guangzhou cohort study (BIGCS). European journal of epidemiology, 32, 337-346.
  • CITATION : Huang, S., Liu, S., Huang, M., He, J. R., Wang, C., Wang, T., ... & Qiu, X. (2024). The Born in Guangzhou Cohort Study enables generational genetic discoveries. Nature, 626(7999), 565-573.
  • URL : http://www.bigcs.com.cn/en_index.html
  • DESCRIPTION : The Born in Guangzhou Cohort Study (BIGCS) is a large-scale prospective observational study investigating the role of social, biological and environmental influences on pregnancy and child health and development in an urban setting in southern China.
  • NAME_FOR_TABLE : born-in-guangzhou-cohort-study
  • Name : Born in Guangzhou Cohort Study
  • Link : Here

China Kadoorie Biobank

  • BIOBANK&COHORT : China Kadoorie Biobank
  • ABBREVIATION : CKB
  • CONTINENT : ASIA
  • REGION : U.K. and China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~512k
  • NOTE : University of Oxford, BJMU, Peking Union Medical College
  • CITATION : Chen, Z., Chen, J., Collins, R., Guo, Y., Peto, R., Wu, F., & Li, L. (2011). China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. International journal of epidemiology, 40(6), 1652-1666.
  • CITATION : Walters, R. G., Millwood, I. Y., Lin, K., Valle, D. S., McDonnell, P., Hacker, A., ... & Chen, Z. (2023). Genotyping and population characteristics of the China Kadoorie Biobank. Cell Genomics, 3(8).
  • URL : https://www.ckbiobank.org/
  • DESCRIPTION : The China Kadoorie Biobank is one of the world’s largest prospective cohort studies. A long-term collaboration between the UK and China, it aims to generate reliable evidence about the lifestyle, environmental and genetic determinants of a wide range of common diseases that can inform disease prevention, risk prediction and treatment worldwide.
  • NAME_FOR_TABLE : china-kadoorie-biobank
  • Name : China Kadoorie Biobank
  • Link : Here

Chinese Millionome Database

  • BIOBANK&COHORT : Chinese Millionome Database
  • ABBREVIATION : CMDB
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~141k
  • NOTE : Chinese Academy of Sciences (CAS) and German Max Planck Society (MPG) partner institute for computational biology
  • CITATION : Li, Z., Jiang, X., Fang, M., Bai, Y., Liu, S., Huang, S., & Jin, X. (2022). CMDB: the comprehensive population genome variation database of China. Nucleic Acids Research.
  • CITATION : Liu, S., Huang, S., Chen, F., Zhao, L., Yuan, Y., Francis, S. S., ... & Xu, X. (2018). Genomic analyses from non-invasive prenatal testing reveal genetic associations, patterns of viral infections, and Chinese population history. Cell, 175(2), 347-359.
  • URL : https://db.cngb.org/cmdb/
  • DESCRIPTION : the largest and the most representative Chinese genome variation database to date. The CMDB database contains 9.04 million single nucleotide variants (SNVs) and the allele frequency information from low-coverage (0.06×–0.1×) WGS data of 141 431 unrelated healthy Chinese individuals.
  • NAME_FOR_TABLE : chinese-millionome-database
  • Name : Chinese Millionome Database
  • Link : Here

Han Chinese Genome Initiative Phase 1 the Han100K Project

  • BIOBANK&COHORT : Han Chinese Genome Initiative Phase 1 the Han100K Project
  • ABBREVIATION : Han100K
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~114k
  • WGS/WES : ~114k
  • NOTE : PGG.Han ShanghaiTech University; University of Chinese Academy of Sciences
  • CITATION : Gao, Y., Zhang, C., Yuan, L., Ling, Y., Wang, X., Liu, C., ... & Xu, S. (2020). PGG. Han: the Han Chinese genome database and analysis platform. Nucleic acids research, 48(D1), D971-D976.
  • URL : https://www.hanchinesegenomes.org/
  • DESCRIPTION : a reference panel of 114 783 Han Chinese individuals (the Han100K), with whole-genome deep-sequenced or high-density genome-wide single-nucleotide variants (SNVs) genotyped or imputed.
  • NAME_FOR_TABLE : han-chinese-genome-initiative-phase-1-the-han100k-project
  • Name : Han Chinese Genome Initiative Phase 1 the Han100K Project
  • Link : Here

IndiGenomes

  • BIOBANK&COHORT : IndiGenomes
  • CONTINENT : ASIA
  • REGION : India
  • ANCESTRY : SAS
  • SAMPLE SIZE : ~10k
  • CITATION : Jain, A., Bhoyar, R. C., Pandhare, K., Mishra, A., Sharma, D., Imran, M., ... & Sivasubbu, S. (2021). IndiGenomes: a comprehensive resource of genetic variants from over 1000 Indian genomes. Nucleic acids research, 49(D1), D1225-D1232.
  • URL : http://clingen.igib.res.in/indigen/
  • DESCRIPTION : The Malaysian Cohort study was initiated in 2005 by the Malaysian government. The top-down approach to this population-based cohort study ensured the allocation of sufficient funding for the project which aimed to recruit 100 000 individuals aged 35–70 years. Participants were recruited from rural and urban areas as well as from various socioeconomic groups. The main objectives of the study were to identify risk factors, to study gene-environment interaction and to discover biomarkers for the early detection of cancers and other diseases.
  • NAME_FOR_TABLE : indigenomes
  • Name : IndiGenomes
  • Link : Here

Korean Genome Project Phase 1

  • BIOBANK&COHORT : Korean Genome Project Phase 1
  • ABBREVIATION : KGP Korea1K
  • CONTINENT : ASIA
  • REGION : Korea
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~1K
  • WGS/WES : ~1K
  • CITATION : Jeon, S., Bhak, Y., Choi, Y., Jeon, Y., Kim, S., Jang, J., ... & Bhak, J. (2020). Korean Genome Project: 1094 Korean personal genomes with clinical information. Science advances, 6(22), eaaz7835.
  • URL : http://koreangenome.org/Main_Page
  • NAME_FOR_TABLE : korean-genome-project-phase-1
  • Name : Korean Genome Project Phase 1
  • Link : Here

Korean Genome Project Phase 2

  • BIOBANK&COHORT : Korean Genome Project Phase 2
  • ABBREVIATION : KGP Korea4K
  • CONTINENT : ASIA
  • REGION : Korea
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~4K
  • WGS/WES : ~4K
  • NOTE : Second phase of Korean Genome Project
  • CITATION : Jeon, S., Choi, H., Jeon, Y., Choi, W. H., Choi, H., An, K., ... & Bhak, J. (2024). Korea4K: whole genome sequences of 4,157 Koreans with 107 phenotypes derived from extensive health check-ups. GigaScience, 13, giae014.
  • URL : http://koreangenome.org/Korea4K_Genomes
  • DESCRIPTION : Korea4K is the second phase data release of the Korean Genome Project (KGP).
  • NAME_FOR_TABLE : korean-genome-project-phase-2
  • Name : Korean Genome Project Phase 2
  • Link : Here

National Biobank of Korea

  • BIOBANK&COHORT : National Biobank of Korea
  • ABBREVIATION : NBK
  • CONTINENT : ASIA
  • REGION : Korea
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~210K
  • DATA ACCESS : https://koges.leelabsg.org/ , https://zenodo.org/record/7042518
  • CITATION : Cho, S. Y., Hong, E. J., Nam, J. M., Han, B., Chu, C., & Park, O. (2012). Opening of the national biobank of Korea as the infrastructure of future biomedical science in Korea. Osong public health and research perspectives, 3(3), 177-184.
  • CITATION : Nam, K., Kim, J., & Lee, S. (2022). Genome-wide study on 72,298 individuals in Korean biobank data for 76 traits. Cell Genomics, 100189.
  • URL : https://nih.go.kr/NIH/cms/content/eng/14/65714_view.html
  • DESCRIPTION : The NBK is the national control center for the collection, management, and utilization of human bioresources in Korea. And NBK manages KBN, it contributes to the development of policies related to human bioresources, standardization of human bioresource management, and advancement of domestic biobanks through developing and providing support for human bioresource technologies. For guaranteeing the fairness in bioresource distribution and development of an efficient distribution system, the NBK also serves as the human bioresource supply hub that supports national healthcare and medical R&D.
  • NAME_FOR_TABLE : national-biobank-of-korea
  • Name : National Biobank of Korea
  • Link : Here

National Center Biobank Network

  • BIOBANK&COHORT : National Center Biobank Network
  • ABBREVIATION : NCBN
  • CONTINENT : ASIA
  • REGION : Japan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~120K
  • WGS/WES : ~10K
  • CITATION : Omae, Yosuke, Yu-ichi Goto, and Katsushi Tokunaga. "National Center Biobank Network." Human Genome Variation 9.1 (2022): 1-6.
  • URL : https://ncbiobank.org/en/home.php
  • DESCRIPTION : Six National Centers in Japan conduct specialized medical research under the coordination of the National Center Biobank Network (NCBN) and develop therapeutics to improve and protect national health. They actively collaborate to establish a shared biobank and are developing a structure to facilitate industry-academia-government cooperation regarding bioresources through broad joint research. NCBN strives to promote the success of the National Centers and to create bright future for health and human life.
  • NAME_FOR_TABLE : national-center-biobank-network
  • Name : National Center Biobank Network
  • Link : Here

NyuWa genome resource

  • BIOBANK&COHORT : NyuWa genome resource
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~3k
  • WGS/WES : ~3K
  • NOTE : Health Institute of Biophysics, Chinese Academy of Sciences
  • CITATION : Zhang, P., Luo, H., Li, Y., Wang, Y., Wang, J., Zheng, Y., ... & Han100K Initiative. (2021). NyuWa Genome resource: a deep whole-genome sequencing-based variation profile and reference panel for the Chinese population. Cell Reports, 37(7), 110017.
  • URL : http://bigdata.ibp.ac.cn/NyuWa/
  • DESCRIPTION : NyuWa, or NüWa, is the mother goddess who was the creator of the human population in Chinese mythology. Here we presented the NyuWa genome resource based on high depth (median 26X) WGS of 2,999 Chinese individuals from 23 out of 34 administrative divisions in China. NyuWa Genome Resource present in this website mainly contains two parts as NyuWa Chinese Population Variant Database and NyuWa reference panel server.
  • NAME_FOR_TABLE : nyuwa-genome-resource
  • Name : NyuWa genome resource
  • Link : Here

Qatar Biobank

  • BIOBANK&COHORT : Qatar Biobank
  • CONTINENT : ASIA
  • REGION : Qatar
  • SAMPLE SIZE : ~80K
  • CITATION : Al Kuwari, H., Al Thani, A., Al Marri, A., Al Kaabi, A., Abderrahim, H., Afifi, N., ... & Elliott, P. (2015). The Qatar Biobank: background and methods. BMC public health, 15(1), 1-9.
  • URL : : https://www.qatarbiobank.org.qa/
  • DESCRIPTION : KoGES, part of the National Biobank of Korea, is a prospective cohort study with a comprehensive range of phenotypic measures and biological samples, such as DNA, serum, plasma, and urine, collected on approximately 210,000 individuals. KoGES includes the community-based Ansan and Ansung study, the urban community-based health examinee study, and the rural community-based cardiovascular disease association study.
  • NAME_FOR_TABLE : qatar-biobank
  • Name : Qatar Biobank
  • Link : Here

Qatar Genome Program

  • BIOBANK&COHORT : Qatar Genome Program
  • ABBREVIATION : QGP
  • CONTINENT : ASIA
  • REGION : Qatar
  • SAMPLE SIZE : ~6K
  • Array : ~6K
  • NOTE : Part of Qatar Biobank
  • CITATION : RAZALI, Rozaimi Mohamad, et al. Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes. Nature Communications, 2021, 12.1: 5929.
  • URL : https://www.qatargenome.org.qa/
  • NAME_FOR_TABLE : qatar-genome-program
  • Name : Qatar Genome Program
  • Link : Here

SG10K_Health

  • BIOBANK&COHORT : SG10K_Health
  • ABBREVIATION : SG10K
  • CONTINENT : ASIA
  • REGION : Singapore
  • ANCESTRY : EAS, SAS, SEA
  • SAMPLE SIZE : ~10k
  • NOTE : the headline project of the Singapore National Precision Medicine programme (NPM Phase I)
  • CITATION : Chan, Sock Hoai, et al. "Analysis of clinically relevant variants from ancestrally diverse Asian genomes." Nature communications 13.1 (2022): 1-15.
  • URL : https://npm.a-star.edu.sg/
  • DESCRIPTION : SG10K_Health is the headline project of the Singapore National Precision Medicine programme (NPM Phase I). Comprising 10,000 whole-genome sequences from healthy Chinese, Indian, and Malay consented volunteers. SG10K_Health involved a research collaboration across multiple institutions in Singapore, enabling the country to develop the necessary infrastructure and deep capabilities to process, store, and analyse genetic data at the population scale in a safe, secure, and rapid manner. SG10K_Health provides near complete assessment of common genetic variants in Singapore’s three major ethnic groups, which can be used by clinicians to better manage Asian patients with genetic disease and as a control data set to compare against disease studies. Work is ongoing to link the SG10K_Health genomic data to research traits (e.g., height, weight, blood pressure) and clinical records.
  • NAME_FOR_TABLE : sg10khealth
  • Name : SG10K_Health
  • Link : Here

Taiwan Biobank

  • BIOBANK&COHORT : Taiwan Biobank
  • ABBREVIATION : TWB
  • CONTINENT : ASIA
  • REGION : Taiwan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~150k
  • Array : ~109K
  • WGS/WES : ~2K
  • DATA ACCESS : https://taiwanview.twbiobank.org.tw/data_appl (application required)
  • CITATION : Feng, Y. C. A., Chen, C. Y., Chen, T. T., Kuo, P. H., Hsu, Y. H., Yang, H. I., ... & Lin, Y. F. (2021). Taiwan Biobank: a rich biomedical research database of the Taiwanese population. medRxiv.
  • CITATION : Feng, Y. C. A., Chen, C. Y., Chen, T. T., Kuo, P. H., Hsu, Y. H., Yang, H. I., ... & Lin, Y. F. (2022). Taiwan Biobank: a rich biomedical research database of the Taiwanese population. Cell Genomics, 100197.
  • URL : https://www.twbiobank.org.tw/
  • DESCRIPTION : The Taiwan Biobank (TWB) is an ongoing prospective study of over 150,000 individuals aged 30-70 recruited from across Taiwan beginning in 2012. A comprehensive list of phenotypes was collected for each consented participant at recruitment and follow-up visits through structured interviews and physical measurements. Biomarkers and genetic data were also generated for all participants from blood and urine samples.
  • NAME_FOR_TABLE : taiwan-biobank
  • Name : Taiwan Biobank
  • Link : Here

Taizhou Imaging Study

  • BIOBANK&COHORT : Taizhou Imaging Study
  • ABBREVIATION : TIS
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~1K
  • Array : ~1K
  • Metabolome : ~1K
  • Metagenome : ~1K
  • Imaging : ~1K
  • CITATION : Jiang, Y., Cui, M., Tian, W., Zhu, S., Chen, J., Suo, C., ... & Taizhou Imaging Study Group. (2021). Lifestyle, multi‐omics features, and preclinical dementia among Chinese: the Taizhou Imaging Study. Alzheimer's & Dementia, 17(1), 18-28.
  • URL : https://www.fdtzihs.org.cn/dljs
  • NAME_FOR_TABLE : taizhou-imaging-study
  • Name : Taizhou Imaging Study
  • Link : Here

The China Metabolic Analytics Project

  • BIOBANK&COHORT : The China Metabolic Analytics Project
  • ABBREVIATION : ChinaMAP
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~10k
  • NOTE : Shanghai Jiao Tong University
  • CITATION : Cao, Y., Li, L., Xu, M., Feng, Z., Sun, X., Lu, J., ... & Wang, W. (2020). The ChinaMAP analytics of deep whole genome sequences in 10,588 individuals. Cell research, 30(9), 717-731.
  • URL : http://www.mbiobank.com/
  • DESCRIPTION : The ChinaMAP is based on three large-scale cohorts: The China Noncommunicable Disease Surveillance 2010, a nationally representative study with 150,000 participants; the Risk Evaluation of cAncers in Chinese diabeTic Individuals: a lONgitudinal (REACTION) study with 250,000 participants15 and the Community-based Cardiovascular Risk During Urbanization in Shanghai with 50,000 participants.
  • NAME_FOR_TABLE : the-china-metabolic-analytics-project
  • Name : The China Metabolic Analytics Project
  • Link : Here

The Hisayama Study

  • BIOBANK&COHORT : The Hisayama Study
  • CONTINENT : ASIA
  • REGION : Japan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~8K
  • CITATION : Ninomiya, T. (2018). Japanese legacy cohort studies: the Hisayama Study. Journal of epidemiology, 28(11), 444-451.
  • URL : https://www.hisayama.med.kyushu-u.ac.jp/en/
  • DESCRIPTION : The Hisayama Study is a population-based prospective cohort study that has been conducted in the town of Hisayama, Japan since 1961.
  • NAME_FOR_TABLE : the-hisayama-study
  • Name : The Hisayama Study
  • Link : Here

The Japan Prospective Studies Collaboration for Aging and Dementia

  • BIOBANK&COHORT : The Japan Prospective Studies Collaboration for Aging and Dementia
  • ABBREVIATION : JPSC-AD
  • CONTINENT : ASIA
  • REGION : Japan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~11K
  • CITATION : JPSFC-AD Study Group. (2020). Study design and baseline characteristics of a population-based prospective cohort study of dementia in Japan: the Japan Prospective Studies Collaboration for Aging and Dementia (JPSC-AD). Environmental health and preventive medicine, 25(1), 64.
  • URL : https://www.eph.med.kyushu-u.ac.jp/jpsc/en/
  • DESCRIPTION : Japan Prospective Studies Collaboration for Aging and Dementia (JPSC-AD) study is a collaborative prospective cohort study of approximately 10,000 elderly people from 8 newly-established community-based dementia cohort studies in Japan, in which the data is prospectively collected by using the pre-specified standardized protocol. The purpose of this study is to evaluate quantitatively environmental and genomic risk factors for dementia in Japanese and to establish effective preventive strategies for dementia, in order to realize healthy aging society.
  • NAME_FOR_TABLE : the-japan-prospective-studies-collaboration-for-aging-and-dementia
  • Name : The Japan Prospective Studies Collaboration for Aging and Dementia
  • Link : Here

The Malaysian Cohort

  • BIOBANK&COHORT : The Malaysian Cohort
  • ABBREVIATION : TMC
  • CONTINENT : ASIA
  • REGION : Malaysia
  • SAMPLE SIZE : ~100k
  • STUDY TYPE : community-dwelling individuals aged 65 years or older at 8 sites of Japan
  • CITATION : Jamal, R., Syed Zakaria, S. Z., Kamaruddin, M. A., Abd Jalal, N., Ismail, N., Mohd Kamil, N., ... & Malaysian Cohort Study Group. (2015). Cohort profile: The Malaysian Cohort (TMC) project: a prospective study of non-communicable diseases in a multi-ethnic population. International journal of epidemiology, 44(2), 423-431.
  • URL : https://www.ukm.my/mycohort/ms/
  • DESCRIPTION : Qatar Biobank, a center within Qatar Foundation, was created in collaboration with Hamad Medical Corporation and the Ministry of Public Health to enable local scientists to conduct medical research on prevalent health issues in Qatar.
  • NAME_FOR_TABLE : the-malaysian-cohort
  • Name : The Malaysian Cohort
  • Link : Here

The Nagahama Study

  • BIOBANK&COHORT : The Nagahama Study
  • CONTINENT : ASIA
  • REGION : Japan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~10K
  • Array : ~9K
  • WGS/WES : ~2K
  • Metabolome : ~9K
  • Proteome : ~2K
  • CITATION : Setoh, K., & Matsuda, F. (2022). Cohort profile: the Nagahama prospective genome cohort for comprehensive human bioscience (The Nagahama Study). Socio-Life Science and the COVID-19 Outbreak: Public Health and Public Policy, 127-143.
  • URL : https://zeroji-cohort.com/english/
  • DESCRIPTION : The Nagahama Primary Prevention Cohort Project is a joint project based on an agreement between Kyoto University Graduate School of Medicine and Nagahama City, Shiga Prefecture, with the cooperation of approximately 10,000 Nagahama residents. In addition, the project conducts follow-up surveys on morbidity and mortality, special tests and surveys on sleep, brain imaging, memory, motor function, skin condition, socioeconomic status, etc., during health checkups and periodic surveys conducted every five years after that. Furthermore, we have completed a multi-omics analysis focusing on genome analysis of approximately 9,000 people (including whole genome sequencing of roughly 2,500 people), comprehensive metabolite analysis of 3-time points, and comprehensive protein analysis of 2,000 people (as of August 2021), and based on these rich and diverse data, we have been searching for health risk Based on these abundant and varied data, we aim to search for health risk factors and elucidate their interactions.
  • NAME_FOR_TABLE : the-nagahama-study
  • Name : The Nagahama Study
  • Link : Here

The STROMICS genome study

  • BIOBANK&COHORT : The STROMICS genome study
  • ABBREVIATION : STROMICS
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~10k
  • Array : ~10K
  • STUDY TYPE : prospective registry for patients presented to hospitals with acute ischaemic cerebrovascular events with long-term follow-up
  • CITATION : Cheng, S., Xu, Z., Bian, S., Chen, X., Shi, Y., Li, Y., ... & Wang, Y. (2023). The STROMICS genome study: deep whole-genome sequencing and analysis of 10K Chinese patients with ischemic stroke reveal complex genetic and phenotypic interplay. Cell Discovery, 9(1), 75.
  • URL : http://www.stromics.org.cn/
  • DESCRIPTION : The Stroke Omics Atlas (STROMICS) is committed to using multi-omics and clinical big data to achieve accurate diagnosis and treatment for stroke patients, reduce treatment costs, and contribute to the health of the people. Using artificial intelligence and cutting-edge high-throughput omics technologies (genomics, transcriptomics, epigenomics, proteomics, metabolomics, metagenomics, etc.), potential drug targets for stroke can be found on a large scale and with high efficiency, providing strong technical support for clinical transformation. Relying on the China National Clinical Research Center for Neurological Diseases and Center of excellence for Omics Research (CORe), STROMICS has realized the interdisciplinary integration of clinical medicine, bioinformatics, and multi-omics, creating a new paradigm of drug research and development.
  • NAME_FOR_TABLE : the-stromics-genome-study
  • Name : The STROMICS genome study
  • Link : Here

Tohoku Medical Megabank

  • BIOBANK&COHORT : Tohoku Medical Megabank
  • ABBREVIATION : TMM
  • CONTINENT : ASIA
  • REGION : Japan
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~157k
  • WGS/WES : ~69K
  • Imaging : ~12K Brain MRI
  • DATA ACCESS : https://jmorp.megabank.tohoku.ac.jp/
  • CITATION : Kuriyama, S., Yaegashi, N., Nagami, F., Arai, T., Kawaguchi, Y., Osumi, N., ... & Tohoku Medical Megabank Project Study Group. (2016). The Tohoku medical megabank project: design and mission. Journal of epidemiology, 26(9), 493-511.
  • URL : https://www.megabank.tohoku.ac.jp/english/
  • DESCRIPTION : Tohoku University Tohoku Medical Megabank Organization was founded to establish an advanced medical system to foster the reconstruction from the Great East Japan Earthquake. The organization has been developing a biobank that combines medical and genome information during the process of rebuilding the community medical system and supporting health and welfare in the Tohoku area. The information from the brand-new biobank will create a new medical system, and, based on the findings of its analysis, the organization aims to attract more medical practitioners from all over the country to the area, promote industry-academic partnerships, create employment in related fields, and restore the medical system in Tohoku.
  • NAME_FOR_TABLE : tohoku-medical-megabank
  • Name : Tohoku Medical Megabank
  • Link : Here

Westlake BioBank for Chinese

  • BIOBANK&COHORT : Westlake BioBank for Chinese
  • ABBREVIATION : WBBC
  • CONTINENT : ASIA
  • REGION : China
  • ANCESTRY : EAS
  • SAMPLE SIZE : ~14k
  • Array : ~6K
  • WGS/WES : ~4K
  • NOTE : Westlake University
  • CITATION : Cong, P. K., Bai, W. Y., Li, J. C., Yang, M. Y., Khederzadeh, S., Gai, S. R., ... & Zheng, H. F. (2022). Genomic analyses of 10,376 individuals in the Westlake BioBank for Chinese (WBBC) pilot project. Nature Communications, 13(1), 1-15.
  • CITATION : Zhu, X. W., Liu, K. Q., Wang, P. Y., Liu, J. Q., Chen, J. Y., Xu, X. J., ... & Zheng, H. F. (2021). Cohort profile: the Westlake BioBank for Chinese (WBBC) pilot project. BMJ open, 11(6), e045564.
  • URL : https://wbbc.westlake.edu.cn/
  • DESCRIPTION : The Westlake BioBank for Chinese (WBBC) cohort is a population-based prospective study with its major purpose to better understand the effect of genetic and environmental factors on growth and development from youngster to elderly. The dataset comprises a wide range of demographics and anthropometric measures, serological tests, physical activity, sleep quality, age at menarche and bone mineral density. WBBC is designed as a prospective cohort study and will recruit at least 100,000 Chinese samples. The pilot project of WBBC has recruited a total of 14,726 participants (4,751 males and 9,975 females) and the baseline survey was carried out from 2017 to 2019.
  • NAME_FOR_TABLE : westlake-biobank-for-chinese
  • Name : Westlake BioBank for Chinese
  • Link : Here

EUROPE

Biobank Graz

  • BIOBANK&COHORT : Biobank Graz
  • CONTINENT : EUROPE
  • REGION : Austria
  • SAMPLE SIZE : ~1200k
  • CITATION : Huppertz, B., Bayer, M., Macheiner, T., & Sargsyan, K. (2016). Biobank Graz: the hub for innovative biomedical research. Open journal of bioresources, 3(1).
  • URL : https://biobank.medunigraz.at/en/?link=http%3A%2F%2F169.254.169.254%2Flatest%2Fmeta-data%2F&cHash=3b3a94b34935e2b8509a838b4a34b0eb
  • DESCRIPTION : Biobank Graz is one of the largest and most well-known clinical biobanks in the world. Around 20 million individual specimens of body fluids and human tissue are stored here. Biobank Graz allows access to these specimens and associated data for scientific research purposes. The common goal is to develop approaches to diagnosing and treating disease.
  • NAME_FOR_TABLE : biobank-graz
  • Name : Biobank Graz
  • Link : Here

Biobank Russia

  • BIOBANK&COHORT : Biobank Russia
  • ABBREVIATION : BBRU
  • CONTINENT : EUROPE
  • REGION : Russia
  • ANCESTRY : EUR
  • SAMPLE SIZE : ~4K
  • Array : ~4K
  • STUDY TYPE : prospective
  • CITATION : Usoltsev, D., Kolosov, N., Rotar, O., Loboda, A., Boyarinova, M., Moguchaya, E., ... & Artomov, M. (2023). Understanding Complex Trait Susceptibilities and Ethnical Diversity in a Sample of 4,145 Russians Through Analysis of Clinical and Genetic Data. bioRxiv, 2023-03.
  • URL : https://biobank.almazovcentre.ru/
  • DESCRIPTION : BioBank Russia (BBRU) is a prospective biobank, managed by V. A. Almazov National Medical Research Center.
  • NAME_FOR_TABLE : biobank-russia
  • Name : Biobank Russia
  • Link : Here

East London Genes & Health

  • BIOBANK&COHORT : East London Genes & Health
  • CONTINENT : EUROPE
  • REGION : U.K.
  • SAMPLE SIZE : ~100k
  • CITATION : Finer, S., Martin, H. C., Khan, A., Hunt, K. A., MacLaughlin, B., Ahmed, Z., ... & van Heel, D. A. (2020). Cohort Profile: East London Genes & Health (ELGH), a community-based population genomics and health study in British Bangladeshi and British Pakistani people. International journal of epidemiology, 49(1), 20-21i.
  • URL : https://www.genesandhealth.org/
  • DESCRIPTION : Genes & Health is a huge long-term study of 100,000 people of Bangladeshi and Pakistani origin. We will link genes with health records, to study disease and treatments. Some volunteers may be invited for further studies. We are inviting volunteers to take part in two regions of the UK: East London (East London Genes & Health) and Bradford (Bradford Genes & Health).
  • NAME_FOR_TABLE : east-london-genes-health
  • Name : East London Genes & Health
  • Link : Here

Estonian Biobank

  • BIOBANK&COHORT : Estonian Biobank
  • CONTINENT : EUROPE
  • REGION : Estonia
  • ANCESTRY : EUR
  • SAMPLE SIZE : ~200k
  • CITATION : Leitsalu, L., Haller, T., Esko, T., Tammesoo, M. L., Alavere, H., Snieder, H., ... & Metspalu, A. (2015). Cohort profile: Estonian biobank of the Estonian genome center, university of Tartu. International journal of epidemiology, 44(4), 1137-1147.
  • URL : https://genomics.ut.ee/en/content/estonian-biobank
  • DESCRIPTION : The Estonian Biobank has established a population-based biobank of Estonia with a current cohort size of more than 200,000 individuals (genotyped with genome-wide arrays), reflecting the age, sex and geographical distribution of the adult Estonian population. Considering the fact that about 20% of Estonia's adult population has joined the programme, it is indeed a database that is very important for the development of medical science both domestically and internationally.
  • NAME_FOR_TABLE : estonian-biobank
  • Name : Estonian Biobank
  • Link : Here

Fenland Study

  • BIOBANK&COHORT : Fenland Study
  • CONTINENT : EUROPE
  • REGION : U.K.
  • SAMPLE SIZE : ~12k
  • CITATION : MRC Epidemiology Unit, University of Cambridge. Fenland Study. [Internet]. Cambridge (UK): MRC Epidemiology Unit; 2017; [cited 2017 July 8]. Available from: http://www.mrc-epid.cam.ac.uk/research/studies/fenland/.
  • URL : https://www.mrc-epid.cam.ac.uk/research/studies/fenland/
  • DESCRIPTION : The Fenland Study investigates the interaction between environmental and genetic factors in determining obesity, type 2 diabetes, and related metabolic disorders. These conditions are a considerable public health concern, but their causes and factors that predict who will be affected by them are not completely understood. What makes the Fenland Study unique is the level of detail it collects about the health and lifestyle of participants, and the objective measurement techniques used in the screening. The first phase of the Fenland Study is now complete, and we are now inviting participants who attended an initial Fenland Study visit between 2005 and 2015 to return for a second visit in Phase 2.
  • NAME_FOR_TABLE : fenland-study
  • Name : Fenland Study
  • Link : Here

FinnGen

  • BIOBANK&COHORT : FinnGen
  • CONTINENT : EUROPE
  • REGION : Finland
  • ANCESTRY : EUR
  • SAMPLE SIZE : ~500k
  • CITATION : Kurki, M. I., Karjalainen, J., Palta, P., Sipilä, T. P., Kristiansson, K., Donner, K., ... & Nelis, M. (2022). FinnGen: Unique genetic insights from combining isolated population and national health register data. medRxiv.
  • URL : https://www.finngen.fi/en
  • DESCRIPTION : FinnGen study launched in Finland in the autumn of 2017 is a unique study that combines genome information with digital health care data. The FinnGen study is an unprecedented global research project representing one of the largest studies of this type. Project aims to improve human health through genetic research, and ultimately identify new therapeutic targets and diagnostics for treating numerous diseases. The collaborative nature of the project is exceptional compare to many ongoing studies, and all the partners are working closely together to ensure appropriate transparency, data security and ownership.
  • NAME_FOR_TABLE : finngen
  • Name : FinnGen
  • Link : Here

Generation Scotland

  • BIOBANK&COHORT : Generation Scotland
  • CONTINENT : EUROPE
  • REGION : Scotland
  • SAMPLE SIZE : ~24k
  • CITATION : Smith, B. H., Campbell, A., Linksted, P., Fitzpatrick, B., Jackson, C., Kerr, S. M., ... & Morris, A. D. (2013). Cohort Profile: Generation Scotland: Scottish Family Health Study (GS: SFHS). The study, its participants and their potential for genetic research on health and illness. International journal of epidemiology, 42(3), 689-700.
  • URL : https://www.ed.ac.uk/generation-scotland
  • DESCRIPTION : Generation Scotland is a research study looking at the health and well-being of volunteers and their families. Generation Scotland combines responses to questionnaires of health and well-being from birth through life. We combine this with NHS health records and innovative laboratory science to understand health trajectories. We work closely with researchers and our volunteers to create a rich evidence base for understanding health. Through this rigorous, ethical and safe approach to research, we seek to enable meaningful change in public health.
  • NAME_FOR_TABLE : generation-scotland
  • Name : Generation Scotland
  • Link : Here

INTERVAL Study

  • BIOBANK&COHORT : INTERVAL Study
  • CONTINENT : EUROPE
  • REGION : U.K.
  • SAMPLE SIZE : ~50k
  • CITATION : Moore, C., Bolton, T., Walker, M., Kaptoge, S., Allen, D., Daynes, M., ... & Thompson, S. G. (2016). Recruitment and representativeness of blood donors in the INTERVAL randomised trial assessing varying inter-donation intervals. Trials, 17(1), 1-12.
  • URL : https://www.intervalstudy.org.uk/
  • DESCRIPTION : Between June 2012 and June 2014, the INTERVAL study recruited about 25,000 men and about 25,000 women at NHS Blood and Transplant (NSHBT) blood donation centres across England. During the study participants are asked to give blood either at usual donation intervals or more frequently. Men donate every 12, 10 or 8 weeks and women every 16, 14 or 12 weeks.
  • NAME_FOR_TABLE : interval-study
  • Name : INTERVAL Study
  • Link : Here

Lifelines

  • BIOBANK&COHORT : Lifelines
  • CONTINENT : EUROPE
  • REGION : Netherlands
  • SAMPLE SIZE : ~167k
  • CITATION : Scholtens, S., Smidt, N., Swertz, M. A., Bakker, S. J., Dotinga, A., Vonk, J. M., ... & Stolk, R. P. (2015). Cohort Profile: LifeLines, a three-generation cohort study and biobank. International journal of epidemiology, 44(4), 1172-1180.
  • URL : https://www.lifelines.nl/researcher
  • DESCRIPTION : Lifelines is a large, multigenerational cohort study that includes over 167,000 participants (10%) from the northern population of the Netherlands. We included participants from three generations, who are followed for at least 30 years, to obtain insight into healthy ageing. The aim of Lifelines is to be a resource for the national and international scientific community.
  • NAME_FOR_TABLE : lifelines
  • Name : Lifelines
  • Link : Here

The International Agency for Research on Cancer (IARC) Biobank

  • BIOBANK&COHORT : The International Agency for Research on Cancer (IARC) Biobank
  • ABBREVIATION : IARC
  • CONTINENT : EUROPE
  • REGION : France
  • SAMPLE SIZE : ~560k
  • CITATION : Mendy, M., Caboux, E., Wild, C. P., & IARC Biobank Steering Committee Members. (2019). Centralization of the IARC biobank: combining multiple sample collections into a common platform. Biopreservation and Biobanking, 17(5), 433-443.
  • URL : https://ibb.iarc.fr/
  • DESCRIPTION : The IARC BioBank (IBB) is one of the largest, most varied and richest International collections of samples in the world. The Biobank is publicly funded, (approximately 60% of its budget is provided by IARC Participating States through the regular budget and the remainder is from research grants) and hosts over 50 different studies, led or coordinated by IARC scientists. The IBB contains both population-based collections from research projects focusing on gene-environment interactions (as in the European Prospective Investigation into Cancer and Nutrition (EPIC) study) and disease-based collections which focus on biomarkers (as in the International Head and Neck Cancer Epidemiology (INHANCE)). Study designs include case-series, prevalence studies, case-control and cohort studies, etc. The IBB contains 5.1 million biological samples from 562,000 individuals. 4 million of the samples are from the EPIC study (over 370,000 individuals) and about one million samples from other collections (close to 200,000 individuals). Most of the samples are body fluids, including plasma, serum and urine as well as extracted DNA samples.
  • NAME_FOR_TABLE : the-international-agency-for-research-on-cancer-iarc-biobank
  • Name : The International Agency for Research on Cancer (IARC) Biobank
  • Link : Here

The Trøndelag Health Study

  • BIOBANK&COHORT : The Trøndelag Health Study
  • ABBREVIATION : HUNT
  • CONTINENT : EUROPE
  • REGION : Norway
  • SAMPLE SIZE : ~229k
  • Array : ~88K
  • WGS/WES : ~2K
  • STUDY TYPE : population-based prospective Norwegian cohort
  • CITATION : Brumpton, B. M., Graham, S., Surakka, I., Skogholt, A. H., Løset, M., Fritsche, L. G., ... & Willer, C. J. (2022). The HUNT Study: a population-based cohort for genetic research. Cell Genomics, 2(10), 100193.
  • URL : https://www.ntnu.edu/hunt/hunt-biobank
  • DESCRIPTION : HUNT Biobank is an established and modern research biobank with high-technology equipment for storage, analysis, sample handling and delivery of samples. Our samples satisfy high quality standards and are stored in accordance with the Data Inspectorates laws and regulations. HUNT Biobank engages in sample handling from The Nord-Trøndelag Health Study (HUNT), Cohort of Norway (CONOR), and can receive samples from other researchers and research projects for storage, analysis and processing of DNA. We do not store samples from private individuals.
  • NAME_FOR_TABLE : the-trndelag-health-study
  • Name : The Trøndelag Health Study
  • Link : Here

UK Biobank

  • BIOBANK&COHORT : UK Biobank
  • ABBREVIATION : UKB
  • CONTINENT : EUROPE
  • REGION : U.K.
  • ANCESTRY : EUR
  • SAMPLE SIZE : ~500k
  • Array : ~500K
  • WGS/WES : ~500k
  • Metabolome : ~120K
  • Proteome : ~54K
  • Imaging : ~100K multimodal
  • CITATION : Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., ... & Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203-209.
  • URL : https://www.ukbiobank.ac.uk/
  • DESCRIPTION : UK Biobank is a large-scale biomedical database and research resource, containing in-depth genetic and health information from half a million UK participants. The database is regularly augmented with additional data and is globally accessible to approved researchers undertaking vital research into the most common and life-threatening diseases. It is a major contributor to the advancement of modern medicine and treatment and has enabled several scientific discoveries that improve human health.
  • NAME_FOR_TABLE : uk-biobank
  • Name : UK Biobank
  • Link : Here

deCODE Genetics

  • BIOBANK&COHORT : deCODE Genetics
  • CONTINENT : EUROPE
  • REGION : Iceland
  • SAMPLE SIZE : ~250k
  • URL : https://www.decode.com/
  • DESCRIPTION : deCODE leads the world in the discovery of genetic risk factors for common diseases. Our gene discovery engine is driven by our unique approach and resources, including detailed genetic and medical information on some 500,000 individuals from around the globe taking part in our discovery work and proprietary statistical algorithms and informatics tools for gathering, analyzing, visualizing and storing large amounts of data.
  • NAME_FOR_TABLE : decode-genetics
  • Name : deCODE Genetics
  • Link : Here

OCIENIA

QIMR Berghofer - QIMR Biobank

  • BIOBANK&COHORT : QIMR Berghofer - QIMR Biobank
  • ABBREVIATION : QSkin and GenEpi
  • CONTINENT : OCIENIA
  • REGION : Australia
  • SAMPLE SIZE : ~17k
  • URL : https://genepi.qimr.edu.au/
  • NAME_FOR_TABLE : qimr-berghofer-qimr-biobank
  • Name : QIMR Berghofer - QIMR Biobank
  • Link : Here