Biobanks & Cohorts

This is an effort to collect the information on major biobanks or cohorts with genomic data around the world.

Summary Table

Name	CONTINENT	SAMPLE SIZE	Link
Nigerian 100K Genome Project	AFRICA	~100k	Here
Uganda Genome Resource	AFRICA	~6k	Here
All of Us	AMERICA	~413k	Here
Biobank of the Americas	AMERICA	~20k	Here
BioMe	AMERICA	~32k	Here
BioVU	AMERICA	~120k	Here
CanPath - Ontario Health Study	AMERICA	~7k	Here
CARTaGENE biobank	AMERICA	~30K	Here
Colorado Center for Personalized Medicine	AMERICA	~34k	Here
Massachusetts General Brigham Biobank	AMERICA	~26K	Here
Mexico City Prospective Study	AMERICA	~150k	Here
Michigan Genomics Initiative	AMERICA	~55k	Here
Million Veteran Program	AMERICA	~900k	Here
Penn Medicine Biobank	AMERICA	~40k	Here
The Canadian Longitudinal Study on Aging	AMERICA	~50k	Here
UCLA Precision Health Biobank	AMERICA	~27k	Here
BioBank Japan	ASIA	~270k	Here
Born in Guangzhou Cohort Study	ASIA	~50K	Here
China Kadoorie Biobank	ASIA	~512k	Here
Chinese Millionome Database	ASIA	~141k	Here
Han Chinese Genome Initiative Phase 1 the Han100K Project	ASIA	~114k	Here
IndiGenomes	ASIA	~10k	Here
Korean Genome Project Phase 1	ASIA	~1K	Here
Korean Genome Project Phase 2	ASIA	~4K	Here
National Biobank of Korea	ASIA	~210K	Here
National Center Biobank Network	ASIA	~120K	Here
NyuWa genome resource	ASIA	~3k	Here
Qatar Biobank	ASIA	~80K	Here
Qatar Genome Program	ASIA	~6K	Here
SG10K_Health	ASIA	~10k	Here
Taiwan Biobank	ASIA	~150k	Here
Taizhou Imaging Study	ASIA	~1K	Here
The China Metabolic Analytics Project	ASIA	~10k	Here
The Hisayama Study	ASIA	~8K	Here
The Japan Prospective Studies Collaboration for Aging and Dementia	ASIA	~11K	Here
The Malaysian Cohort	ASIA	~100k	Here
The Nagahama Study	ASIA	~10K	Here
The STROMICS genome study	ASIA	~10k	Here
Tohoku Medical Megabank	ASIA	~157k	Here
Westlake BioBank for Chinese	ASIA	~14k	Here
Biobank Graz	EUROPE	~1200k	Here
deCODE Genetics	EUROPE	~250k	Here
East London Genes & Health	EUROPE	~100k	Here
Estonian Biobank	EUROPE	~200k	Here
Fenland Study	EUROPE	~12k	Here
FinnGen	EUROPE	~500k	Here
Generation Scotland	EUROPE	~24k	Here
INTERVAL Study	EUROPE	~50k	Here
Lifelines	EUROPE	~167k	Here
The International Agency for Research on Cancer (IARC) Biobank	EUROPE	~560k	Here
The Trøndelag Health Study	EUROPE	~229k	Here
UK Biobank	EUROPE	~500k	Here
QIMR Berghofer - QIMR Biobank	OCIENIA	~17k	Here
The Egypt Genome Project	AFRICA	~100K	Here
Biobank Russia	EUROPE	~4K	Here
BioPortal	AMERICA		Here

AFRICA

Nigerian 100K Genome Project

BIOBANK&COHORT : Nigerian 100K Genome Project
CONTINENT : AFRICA
REGION : Nigeria
ANCESTRY : AFR
SAMPLE SIZE : ~100k
WGS/WES : ~1K
NOTE : First phase: Non-Communicable Diseases Genetic Heritage Study (NCD-GHS)
CITATION : Fatumo, S., Yakubu, A., Oyedele, O., Popoola, J., Attipoe, D. A., Eze-Echesi, G., ... & Ene-Obong, A. (2022). Promoting the genomic revolution in Africa through the Nigerian 100K Genome Project. Nature Genetics, 54(5), 531-536.
CITATION : Joshi, E., Biddanda, A., Popoola, J., Yakubu, A., Osakwe, O., Attipoe, D., ... & Salako, B. (2023). Whole-genome sequencing across 449 samples spanning 47 ethnolinguistic groups provides insights into genetic diversity in Nigeria. Cell genomics, 3(9).
URL : https://allofus.nih.gov/ , https://www.researchallofus.org/register/
DESCRIPTION : Genomic studies in African populations provide unique opportunities to understand disease aetiology, human genetic diversity and population history in a regional and a global context. To leverage the relative benefits of different strategies, we undertook a combined approach of genotyping and whole-genome sequencing (WGS) in a population-based study of 6,400 individuals from a geographically defined rural community in South-West Uganda. We present data from 4,778 individuals with genotypes for ~2.2 million SNPs from the Uganda GWAS resource (UGWAS), and sequence data on up to 1,978 individuals spanning 41.5M SNPs and 4.5M indels (UG2G); 343 individuals overlap between the two datasets. We highlight the value of the largest sequence panel from Africa to date as a global resource for variant discovery, imputation and understanding the mutational spectrum and its clinical relevance in African populations. Alongside phenotype data, we provide a rich new genomic resource for researchers in Africa and globally
NAME_FOR_TABLE : nigerian-100k-genome-project
Name : Nigerian 100K Genome Project
Link : Here

The Egypt Genome Project

BIOBANK&COHORT : The Egypt Genome Project
CONTINENT : AFRICA
REGION : Egypt
SAMPLE SIZE : ~100K
CITATION : Elmonem, M.A., Soliman, N.A., Moustafa, A. et al. The Egypt Genome Project. Nat Genet (2024). https://doi.org/10.1038/s41588-024-01739-1
URL : https://egp.sci.eg/
DESCRIPTION : EGYPT CENTER FOR RESEARCH AND REGENERATIVE MEDICINE (ECRRM) IS ONE OF THE RESEARCH UNITS ASSOCIATED WITH THE MINISTRY OF DEFENSE. IT WAS ESTABLISHED BY A PRESIDENTIAL DECREE IN 2017 AND HAS A LEGAL ENTITY AFFILIATED WITH THE MINISTRY OF DEFENSE. THE CENTER WILL INITIATE THE EGYPTIAN GENOME REFERENCE PROJECT, UPON THE PRESIDENT AUTHORIZATION, IN COLLABORATION WITH THE ACADEMY OF SCIENTIFIC RESEARCH AND TECHNOLOGY AND THE MINISTRY OF HIGHER EDUCATION, TO HELP IN THE OVERALL ENHANCEMENT OF THE GENERAL HEALTH CARE IN EGYPT.
NAME_FOR_TABLE : the-egypt-genome-project
Name : The Egypt Genome Project
Link : Here

Uganda Genome Resource

BIOBANK&COHORT : Uganda Genome Resource
ABBREVIATION : UGR
CONTINENT : AFRICA
REGION : Uganda
ANCESTRY : AFR
SAMPLE SIZE : ~6k
NOTE : (MedicalResearchCouncil (MRC)/UgandaVirusResearchInstitute (UVRI) and LSHTM Uganda Research Unit)
CITATION : Gurdasani, D., Carstensen, T., Fatumo, S., Chen, G., Franklin, C. S., Prado-Martinez, J., ... & Sandhu, M. S. (2019). Uganda genome resource enables insights into population history and genomic discovery in Africa. Cell, 179(4), 984-1002.
URL : https://www. lshtm.ac.uk/research/units/mrc-uganda
DESCRIPTION : The IndiGenomes resource encompasses the genomic data from over 1000 whole genome sequences sequenced from across India as part of the IndiGen programme and represents diverse geographies and ethnicities. The resource provides access to over 55 million genetic variants comprising of single nucleotide variants and indels. The variants are systematically annotated according to the recent Genome Reference Consortium Human Build 38 (GRCh38). Clinically relevant annotations as well as allele frequencies from global populations have also been integrated.
NAME_FOR_TABLE : uganda-genome-resource
Name : Uganda Genome Resource
Link : Here

AMERICA

All of Us

BIOBANK&COHORT : All of Us
ABBREVIATION : AoU
CONTINENT : AMERICA
REGION : U.S.
ANCESTRY : EUR, AFR, AMR, EAS, SAS, WAS
SAMPLE SIZE : ~413k
WGS/WES : ~245K
CITATION : Investigators, A. U. R. P. (2019). The “All of Us” research program. New England Journal of Medicine, 381(7), 668-676.
CITATION : Bick, A. G., Metcalf, G. A., Mayo, K. R., Lichtenstein, L., Rura, S., Carroll, R. J., ... & Denny, J. C. (2024). Genomic data in the All of Us research program. Nature.
URL : https://precisionhealth.umich.edu/our-research/michigangenomics/
DESCRIPTION : The All of Us Research Program is a historic effort to collect and study data from one million or more people living in the United States. The goal of the program is better health for all of us. The program began national enrollment in 2018 and is expected to last at least 10 years.
NAME_FOR_TABLE : all-of-us
Name : All of Us
Link : Here

BioMe

BIOBANK&COHORT : BioMe
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~32k
CITATION : Roden, D. M., Pulley, J. M., Basford, M. A., Bernard, G. R., Clayton, E. W., Balser, J. R., & Masys, D. R. (2008). Development of a large‐scale de‐identified DNA biobank to enable personalized medicine. Clinical Pharmacology & Therapeutics, 84(3), 362-369.
URL : https://www.vumc.org/dbmi/biovu
DESCRIPTION : The Institute for Personalized Medicine at the Icahn School of Medicine at Mount Sinai is leading the movement toward diagnosis and classification of disease according to the patient’s molecular profile. This approach accommodates differences at all possible levels of exposure (genome, environment, and lifestyle) and at all stages of the process, from prevention to post-treatment follow-up. At the center of this effort is BioMe, an electronic medical record-linked biobank that enables researchers to rapidly and efficiently conduct genetic, epidemiologic, molecular, and genomic studies on large collections of research specimens linked with medical information.
NAME_FOR_TABLE : biome
Name : BioMe
Link : Here

BioPortal

BIOBANK&COHORT : BioPortal
CONTINENT : AMERICA
REGION : Canada
URL : https://www.mcgill.ca/genepi/bioportal
DESCRIPTION : BioPortal is a unique research platform at the Jewish General Hospital (JGH)/Lady Davis Institute in Montreal built in partnership with the CERC Chair in Genomic Medicine at McGill.
NAME_FOR_TABLE : bioportal
Name : BioPortal
Link : Here

BioVU

BIOBANK&COHORT : BioVU
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~120k
URL : https://bbofa.org/
DESCRIPTION : Planning for BioVU began in mid-2004 and the first samples were collected in February 2007. Prior to collecting DNA samples, all aspects of the BioVU project were extensively tested. BioVU now accrues 500-1000 samples per week, totaling more than 275,000 DNA samples as of January 2022. Vanderbilt clinic patients may sign the BioVU Consent Form if they wish to donate their excess blood samples, or not sign the form if they do not wish to participate.
NAME_FOR_TABLE : biovu
Name : BioVU
Link : Here

Biobank of the Americas

BIOBANK&COHORT : Biobank of the Americas
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~20k
URL : https://www.galatea.bio/#main-biobank
DESCRIPTION : Biobank consented samples with associated clinical data from diverse populations from throughout the United States and Latin America via healthcare and biopharma partnerships.
NAME_FOR_TABLE : biobank-of-the-americas
Name : Biobank of the Americas
Link : Here

CARTaGENE biobank

BIOBANK&COHORT : CARTaGENE biobank
CONTINENT : AMERICA
REGION : Canada
SAMPLE SIZE : ~30K
Array : ~30K
WGS/WES : ~2K
Transcriptome : ~0.9K
DATA ACCESS : https://cartagene.qc.ca/en/researchers.html
CITATION : Awadalla, P., Boileau, C., Payette, Y., Idaghdour, Y., Goulet, J. P., Knoppers, B., ... & Laberge, C. (2013). Cohort profile of the CARTaGENE study: Quebec’s population-based biobank for public health and personalized genomics. International journal of epidemiology, 42(5), 1285-1299.
URL : https://cartagene.qc.ca/en/
DESCRIPTION : CARTaGENE is a public research platform of the CHU Sainte-Justine aiming to accelerate health research. CARTaGENE is made up of both biological samples and data on the health and lifestyle of 43,000 Quebec men and women between the ages of 40 and 69 at recruitment.
NAME_FOR_TABLE : cartagene-biobank
Name : CARTaGENE biobank
Link : Here

CanPath - Ontario Health Study

BIOBANK&COHORT : CanPath - Ontario Health Study
CONTINENT : AMERICA
REGION : Canada
SAMPLE SIZE : ~7k
CITATION : Kirsh, V. A., Skead, K., McDonald, K., Kreiger, N., Little, J., Menard, K., ... & Awadalla, P. (2022). Cohort Profile: The Ontario Health Study (OHS). International Journal of Epidemiology.
URL : https://canpath.ca/cohort/ontario-health-study/
DESCRIPTION : The Ontario Health Study (OHS) is a resource for investigating the ways in which lifestyle, the environment and genetics affect people’s health. It is one of the regional cohorts that collectively form the Canadian Partnership for Tomorrow’s Health (CanPath)—a pan-Canadian cohort with >330 000 participants. The linking of Canada’s rich collection of administrative health data with the cohort’s data represents a powerful means to disseminate high-quality, timely data.
NAME_FOR_TABLE : canpath-ontario-health-study
Name : CanPath - Ontario Health Study
Link : Here

Colorado Center for Personalized Medicine

BIOBANK&COHORT : Colorado Center for Personalized Medicine
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~34k
URL : https://medschool.cuanschutz.edu/cobiobank
DESCRIPTION : Established in 2014 as a partnership between UCHealth and University of Colorado Anschutz Medical Campus, the Colorado Center for Personalized Medicine (CCPM) brings together multiple disciplines and institutions to uncover advancements in genomics that can improve diagnosis and treatment of disease, and identify more tailored approaches to population health management.To facilitate discoveries in personalized medicine, CCPM has created a Biobank that aims to be one of the largest academic medicine biospecimen repositories in the mountain and midwest regions of the U.S. The CCPM Biobank is able to link biospecimens and genotype information with patient health information from electronic medical records in an enterprise data warehouse (Health Data Compass) to support a broad range of research, operational, and clinical quality improvement agendas.
NAME_FOR_TABLE : colorado-center-for-personalized-medicine
Name : Colorado Center for Personalized Medicine
Link : Here

Massachusetts General Brigham Biobank

BIOBANK&COHORT : Massachusetts General Brigham Biobank
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~26K
CITATION : Boutin, N. T., Schecter, S. B., Perez, E. F., Tchamitchian, N. S., Cerretani, X. R., Gainer, V. S., ... & Smoller, J. W. (2022). The Evolution of a Large Biobank at Mass General Brigham. Journal of Personalized Medicine, 12(8), 1323.
CITATION : Castro, V. M., Gainer, V., Wattanasin, N., Benoit, B., Cagan, A., Ghosh, B., ... & Murphy, S. N. (2022). The Mass General Brigham Biobank Portal: an i2b2-based data repository linking disparate and high-dimensional patient data to support multimodal analytics. Journal of the American Medical Informatics Association, 29(4), 643-651.
URL : https://www.massgeneralbrigham.org/en/research-and-innovation/participate-in-research/biobank
DESCRIPTION : The Mass General Brigham Biobank is a large research program designed to help researchers understand how people’s health is affected by their genes, lifestyle, and environment. By participating in the Mass General Brigham Biobank, you can help us better understand, treat, and even prevent the diseases that might affect your health and the health of future generations.
NAME_FOR_TABLE : massachusetts-general-brigham-biobank
Name : Massachusetts General Brigham Biobank
Link : Here

Mexico City Prospective Study

BIOBANK&COHORT : Mexico City Prospective Study
CONTINENT : AMERICA
REGION : Mexico
SAMPLE SIZE : ~150k
CITATION : Ziyatdinov, A., Torres, J., Alegre-Diaz, J., Backman, J., Mbatchou, J., Turner, M., ... & Tapia-Conyer, R. (2022). Genotyping, sequencing and analysis of 140,000 adults from the Mexico City Prospective Study. bioRxiv.
URL : https://www.ctsu.ox.ac.uk/research/prospective-blood-based-study-of-150-000-individuals-in-mexico
DESCRIPTION : Between 1998 and 2004, CTSU, in collaboration with the Mexican Ministry of Health, established a study in Mexico City, in which over 150,000 middle-aged adults (including 100,000 women and 50,000 men) provided information about their lifestyle and disease history, had physical measurements recorded (including weight, waist and hip circumference, blood pressure) and had a blood sample taken.
NAME_FOR_TABLE : mexico-city-prospective-study
Name : Mexico City Prospective Study
Link : Here

Michigan Genomics Initiative

BIOBANK&COHORT : Michigan Genomics Initiative
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~55k
CITATION : Zawistowski, M., Fritsche, L. G., Pandit, A., Vanderwerff, B., Patil, S., Scmidt, E. M., ... & Zoellner, S. (2021). The Michigan Genomics Initiative: a biobank linking genotypes and electronic clinical records in Michigan Medicine patients. medRxiv.
URL : https://pmbb.med.upenn.edu/
DESCRIPTION : The Michigan Genomics Initiative (MGI) is a collaborative research effort among physicians, researchers, and patients at the University of Michigan (U-M) with the goal of combining patient electronic health record (EHR) data with corresponding genetic data to gain novel biomedical insights. There are currently ~84K consented participants through the MGI and partner studies and the addition of ~10K new participants per year is anticipated. Currently, all MGI participants with available genetic data have received care at the University of Michigan Health System.
NAME_FOR_TABLE : michigan-genomics-initiative
Name : Michigan Genomics Initiative
Link : Here

Million Veteran Program

BIOBANK&COHORT : Million Veteran Program
ABBREVIATION : MVP
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~900k
CITATION : Gaziano, J. M., Concato, J., Brophy, M., Fiore, L., Pyarajan, S., Breeling, J., ... & O'Leary, T. J. (2016). Million Veteran Program: A mega-biobank to study genetic influences on health and disease. Journal of clinical epidemiology, 70, 214-223.
CITATION : Hunter-Zinck, H., Shi, Y., Li, M., Gorman, B. R., Ji, S. G., Sun, N., ... & Pyarajan, S. (2020). Genotyping array design and data quality control in the Million Veteran Program. The American Journal of Human Genetics, 106(4), 535-548.
URL : https://www.mvp.va.gov/pwa/
DESCRIPTION : The Million Veteran Program (MVP) is a national research program to learn how genes, lifestyle, and military exposures affect health and illness. Since launching in 2011, over 900,000 Veteran partners have joined one of the world's largest programs on genetics and health.
NAME_FOR_TABLE : million-veteran-program
Name : Million Veteran Program
Link : Here

Penn Medicine Biobank

BIOBANK&COHORT : Penn Medicine Biobank
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~40k
URL : https://www.uclahealth.org/precision-health/programs/ucla-atlas-community-health-initiative/ucla-atlas-precision-health-biobank
DESCRIPTION : The Penn Medicine BioBank (PMBB) is a research program created to study the causes and treatments of many diseases. Any Penn Medicine patient (age 18 and up) can sign up. The PMBB is a collection of biological samples, such as blood or tissue, that are donated by patient volunteers. These samples are then connected to clinical information, such as diseases or lab measures. These data are then used by researchers to discover new ways to detect, treat, and maybe even prevent or cure disease. Some of these studies may be about how genes affect health and disease. Other studies look at how genes affect response to medicines.
NAME_FOR_TABLE : penn-medicine-biobank
Name : Penn Medicine Biobank
Link : Here

The Canadian Longitudinal Study on Aging

BIOBANK&COHORT : The Canadian Longitudinal Study on Aging
ABBREVIATION : CLSA
CONTINENT : AMERICA
REGION : Canada
SAMPLE SIZE : ~50k
CITATION : Raina, P. S., Wolfson, C., Kirkland, S. A., Griffith, L. E., Oremus, M., Patterson, C., ... & Brazil, K. (2009). The Canadian longitudinal study on aging (CLSA). Canadian Journal on Aging/La Revue canadienne du vieillissement, 28(3), 221-229.
URL : https://www.clsa-elcv.ca/
DESCRIPTION : The Canadian Longitudinal Study on Aging (CLSA) is a large, national, long-term study that will follow approximately 50,000 individuals who are between the ages of 45 and 85 when recruited, for at least 20 years. The CLSA will collect information on the changing biological, medical, psychological, social, lifestyle and economic aspects of people’s lives. These factors will be studied to understand how, individually and in combination, they have an impact in both maintaining health and in the development of disease and disability as people age.
NAME_FOR_TABLE : the-canadian-longitudinal-study-on-aging
Name : The Canadian Longitudinal Study on Aging
Link : Here

UCLA Precision Health Biobank

BIOBANK&COHORT : UCLA Precision Health Biobank
CONTINENT : AMERICA
REGION : U.S.
SAMPLE SIZE : ~27k
CITATION : Johnson, R. D., Ding, Y., Bhattacharya, A., Chiu, A., Lajonchere, C., Geschwind, D. H., & Pasaniuc, B. (2022). The UCLA ATLAS Community Health Initiative: promoting precision health research in a diverse biobank. medRxiv.
URL : https://icahn.mssm.edu/research/ipm/programs/biome-biobank
DESCRIPTION : The UCLA ATLAS Precision Health Biobank, under the supervision of the Translational Pathology Core Laboratory (TCPL), collects biological samples from patients who have consented to participate in the UCLA ATLAS Community Health Initiative. As a collaborator with UCLA ATLAS Community Health Initiative, the UCLA ATLAS Precision Health Biobank manages the collection and distribution of biological samples by removing the personally identifiable information.
NAME_FOR_TABLE : ucla-precision-health-biobank
Name : UCLA Precision Health Biobank
Link : Here

ASIA

BioBank Japan

BIOBANK&COHORT : BioBank Japan
ABBREVIATION : BBJ
CONTINENT : ASIA
REGION : Japan
ANCESTRY : EAS
SAMPLE SIZE : ~270k
Array : ~270K
WGS/WES : ~14K
Metabolome : ~4K
CITATION : Nagai, A., Hirata, M., Kamatani, Y., Muto, K., Matsuda, K., Kiyohara, Y., ... & Kubo, M. (2017). Overview of the BioBank Japan Project: study design and profile. Journal of epidemiology, 27(Supplement_III), S2-S8.
URL : https://biobankjp.org/
DESCRIPTION : In 2003, BioBank Japan (BBJ) started developing one of the world’s largest disease biobanks, creating a foundation for research aimed at achieving medical care tailored to the individual traits of each patient. From a total of 260,000 patients representing 440,000 cases of 51 primarily multifactorial (common) diseases, BBJ has collected DNA, serum, medical records (clinical information), etc. with their consent. No less than 5,800 items of screened information are available for research, including the patients’ survival information, with 95% of the patients tracked over an average of 10 years. In addition to large-scale genomic analyses, omics analyses including whole genome sequencing and metabolome/proteome analyses have been performed on the DNA, serum and other biological samples collected, producing significant research findings. The genomic information acquired through the analyses continues to be used as data. The biological samples and data are widely distributed and used by researchers.
NAME_FOR_TABLE : biobank-japan
Name : BioBank Japan
Link : Here

Born in Guangzhou Cohort Study

BIOBANK&COHORT : Born in Guangzhou Cohort Study
ABBREVIATION : BIGCS
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~50K
WGS/WES : ~4K
STUDY TYPE : trio or duo
CITATION : Qiu, X., Lu, J. H., He, J. R., Lam, K. B. H., Shen, S. Y., Guo, Y., ... & Xia, H. M. (2017). The born in Guangzhou cohort study (BIGCS). European journal of epidemiology, 32, 337-346.
CITATION : Huang, S., Liu, S., Huang, M., He, J. R., Wang, C., Wang, T., ... & Qiu, X. (2024). The Born in Guangzhou Cohort Study enables generational genetic discoveries. Nature, 626(7999), 565-573.
URL : http://www.bigcs.com.cn/en_index.html
DESCRIPTION : The Born in Guangzhou Cohort Study (BIGCS) is a large-scale prospective observational study investigating the role of social, biological and environmental influences on pregnancy and child health and development in an urban setting in southern China.
NAME_FOR_TABLE : born-in-guangzhou-cohort-study
Name : Born in Guangzhou Cohort Study
Link : Here

China Kadoorie Biobank

BIOBANK&COHORT : China Kadoorie Biobank
ABBREVIATION : CKB
CONTINENT : ASIA
REGION : U.K. and China
ANCESTRY : EAS
SAMPLE SIZE : ~512k
NOTE : University of Oxford, BJMU, Peking Union Medical College
CITATION : Chen, Z., Chen, J., Collins, R., Guo, Y., Peto, R., Wu, F., & Li, L. (2011). China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. International journal of epidemiology, 40(6), 1652-1666.
CITATION : Walters, R. G., Millwood, I. Y., Lin, K., Valle, D. S., McDonnell, P., Hacker, A., ... & Chen, Z. (2023). Genotyping and population characteristics of the China Kadoorie Biobank. Cell Genomics, 3(8).
URL : https://www.ckbiobank.org/
DESCRIPTION : The China Kadoorie Biobank is one of the world’s largest prospective cohort studies. A long-term collaboration between the UK and China, it aims to generate reliable evidence about the lifestyle, environmental and genetic determinants of a wide range of common diseases that can inform disease prevention, risk prediction and treatment worldwide.
NAME_FOR_TABLE : china-kadoorie-biobank
Name : China Kadoorie Biobank
Link : Here

Chinese Millionome Database

BIOBANK&COHORT : Chinese Millionome Database
ABBREVIATION : CMDB
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~141k
NOTE : Chinese Academy of Sciences (CAS) and German Max Planck Society (MPG) partner institute for computational biology
CITATION : Li, Z., Jiang, X., Fang, M., Bai, Y., Liu, S., Huang, S., & Jin, X. (2022). CMDB: the comprehensive population genome variation database of China. Nucleic Acids Research.
CITATION : Liu, S., Huang, S., Chen, F., Zhao, L., Yuan, Y., Francis, S. S., ... & Xu, X. (2018). Genomic analyses from non-invasive prenatal testing reveal genetic associations, patterns of viral infections, and Chinese population history. Cell, 175(2), 347-359.
URL : https://db.cngb.org/cmdb/
DESCRIPTION : the largest and the most representative Chinese genome variation database to date. The CMDB database contains 9.04 million single nucleotide variants (SNVs) and the allele frequency information from low-coverage (0.06×–0.1×) WGS data of 141 431 unrelated healthy Chinese individuals.
NAME_FOR_TABLE : chinese-millionome-database
Name : Chinese Millionome Database
Link : Here

Han Chinese Genome Initiative Phase 1 the Han100K Project

BIOBANK&COHORT : Han Chinese Genome Initiative Phase 1 the Han100K Project
ABBREVIATION : Han100K
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~114k
WGS/WES : ~114k
NOTE : PGG.Han ShanghaiTech University; University of Chinese Academy of Sciences
CITATION : Gao, Y., Zhang, C., Yuan, L., Ling, Y., Wang, X., Liu, C., ... & Xu, S. (2020). PGG. Han: the Han Chinese genome database and analysis platform. Nucleic acids research, 48(D1), D971-D976.
URL : https://www.hanchinesegenomes.org/
DESCRIPTION : a reference panel of 114 783 Han Chinese individuals (the Han100K), with whole-genome deep-sequenced or high-density genome-wide single-nucleotide variants (SNVs) genotyped or imputed.
NAME_FOR_TABLE : han-chinese-genome-initiative-phase-1-the-han100k-project
Name : Han Chinese Genome Initiative Phase 1 the Han100K Project
Link : Here

IndiGenomes

BIOBANK&COHORT : IndiGenomes
CONTINENT : ASIA
REGION : India
ANCESTRY : SAS
SAMPLE SIZE : ~10k
CITATION : Jain, A., Bhoyar, R. C., Pandhare, K., Mishra, A., Sharma, D., Imran, M., ... & Sivasubbu, S. (2021). IndiGenomes: a comprehensive resource of genetic variants from over 1000 Indian genomes. Nucleic acids research, 49(D1), D1225-D1232.
URL : http://clingen.igib.res.in/indigen/
DESCRIPTION : The Malaysian Cohort study was initiated in 2005 by the Malaysian government. The top-down approach to this population-based cohort study ensured the allocation of sufficient funding for the project which aimed to recruit 100 000 individuals aged 35–70 years. Participants were recruited from rural and urban areas as well as from various socioeconomic groups. The main objectives of the study were to identify risk factors, to study gene-environment interaction and to discover biomarkers for the early detection of cancers and other diseases.
NAME_FOR_TABLE : indigenomes
Name : IndiGenomes
Link : Here

Korean Genome Project Phase 1

BIOBANK&COHORT : Korean Genome Project Phase 1
ABBREVIATION : KGP Korea1K
CONTINENT : ASIA
REGION : Korea
ANCESTRY : EAS
SAMPLE SIZE : ~1K
WGS/WES : ~1K
CITATION : Jeon, S., Bhak, Y., Choi, Y., Jeon, Y., Kim, S., Jang, J., ... & Bhak, J. (2020). Korean Genome Project: 1094 Korean personal genomes with clinical information. Science advances, 6(22), eaaz7835.
URL : http://koreangenome.org/Main_Page
NAME_FOR_TABLE : korean-genome-project-phase-1
Name : Korean Genome Project Phase 1
Link : Here

Korean Genome Project Phase 2

BIOBANK&COHORT : Korean Genome Project Phase 2
ABBREVIATION : KGP Korea4K
CONTINENT : ASIA
REGION : Korea
ANCESTRY : EAS
SAMPLE SIZE : ~4K
WGS/WES : ~4K
NOTE : Second phase of Korean Genome Project
CITATION : Jeon, S., Choi, H., Jeon, Y., Choi, W. H., Choi, H., An, K., ... & Bhak, J. (2024). Korea4K: whole genome sequences of 4,157 Koreans with 107 phenotypes derived from extensive health check-ups. GigaScience, 13, giae014.
URL : http://koreangenome.org/Korea4K_Genomes
DESCRIPTION : Korea4K is the second phase data release of the Korean Genome Project (KGP).
NAME_FOR_TABLE : korean-genome-project-phase-2
Name : Korean Genome Project Phase 2
Link : Here

National Biobank of Korea

BIOBANK&COHORT : National Biobank of Korea
ABBREVIATION : NBK
CONTINENT : ASIA
REGION : Korea
ANCESTRY : EAS
SAMPLE SIZE : ~210K
DATA ACCESS : https://koges.leelabsg.org/ , https://zenodo.org/record/7042518
CITATION : Cho, S. Y., Hong, E. J., Nam, J. M., Han, B., Chu, C., & Park, O. (2012). Opening of the national biobank of Korea as the infrastructure of future biomedical science in Korea. Osong public health and research perspectives, 3(3), 177-184.
CITATION : Nam, K., Kim, J., & Lee, S. (2022). Genome-wide study on 72,298 individuals in Korean biobank data for 76 traits. Cell Genomics, 100189.
URL : https://nih.go.kr/NIH/cms/content/eng/14/65714_view.html
DESCRIPTION : The NBK is the national control center for the collection, management, and utilization of human bioresources in Korea. And NBK manages KBN, it contributes to the development of policies related to human bioresources, standardization of human bioresource management, and advancement of domestic biobanks through developing and providing support for human bioresource technologies. For guaranteeing the fairness in bioresource distribution and development of an efficient distribution system, the NBK also serves as the human bioresource supply hub that supports national healthcare and medical R&D.
NAME_FOR_TABLE : national-biobank-of-korea
Name : National Biobank of Korea
Link : Here

National Center Biobank Network

BIOBANK&COHORT : National Center Biobank Network
ABBREVIATION : NCBN
CONTINENT : ASIA
REGION : Japan
ANCESTRY : EAS
SAMPLE SIZE : ~120K
WGS/WES : ~10K
CITATION : Omae, Yosuke, Yu-ichi Goto, and Katsushi Tokunaga. "National Center Biobank Network." Human Genome Variation 9.1 (2022): 1-6.
URL : https://ncbiobank.org/en/home.php
DESCRIPTION : Six National Centers in Japan conduct specialized medical research under the coordination of the National Center Biobank Network (NCBN) and develop therapeutics to improve and protect national health. They actively collaborate to establish a shared biobank and are developing a structure to facilitate industry-academia-government cooperation regarding bioresources through broad joint research. NCBN strives to promote the success of the National Centers and to create bright future for health and human life.
NAME_FOR_TABLE : national-center-biobank-network
Name : National Center Biobank Network
Link : Here

NyuWa genome resource

BIOBANK&COHORT : NyuWa genome resource
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~3k
WGS/WES : ~3K
NOTE : Health Institute of Biophysics, Chinese Academy of Sciences
CITATION : Zhang, P., Luo, H., Li, Y., Wang, Y., Wang, J., Zheng, Y., ... & Han100K Initiative. (2021). NyuWa Genome resource: a deep whole-genome sequencing-based variation profile and reference panel for the Chinese population. Cell Reports, 37(7), 110017.
URL : http://bigdata.ibp.ac.cn/NyuWa/
DESCRIPTION : NyuWa, or NüWa, is the mother goddess who was the creator of the human population in Chinese mythology. Here we presented the NyuWa genome resource based on high depth (median 26X) WGS of 2,999 Chinese individuals from 23 out of 34 administrative divisions in China. NyuWa Genome Resource present in this website mainly contains two parts as NyuWa Chinese Population Variant Database and NyuWa reference panel server.
NAME_FOR_TABLE : nyuwa-genome-resource
Name : NyuWa genome resource
Link : Here

Qatar Biobank

BIOBANK&COHORT : Qatar Biobank
CONTINENT : ASIA
REGION : Qatar
SAMPLE SIZE : ~80K
CITATION : Al Kuwari, H., Al Thani, A., Al Marri, A., Al Kaabi, A., Abderrahim, H., Afifi, N., ... & Elliott, P. (2015). The Qatar Biobank: background and methods. BMC public health, 15(1), 1-9.
URL : : https://www.qatarbiobank.org.qa/
DESCRIPTION : KoGES, part of the National Biobank of Korea, is a prospective cohort study with a comprehensive range of phenotypic measures and biological samples, such as DNA, serum, plasma, and urine, collected on approximately 210,000 individuals. KoGES includes the community-based Ansan and Ansung study, the urban community-based health examinee study, and the rural community-based cardiovascular disease association study.
NAME_FOR_TABLE : qatar-biobank
Name : Qatar Biobank
Link : Here

Qatar Genome Program

BIOBANK&COHORT : Qatar Genome Program
ABBREVIATION : QGP
CONTINENT : ASIA
REGION : Qatar
SAMPLE SIZE : ~6K
Array : ~6K
NOTE : Part of Qatar Biobank
CITATION : RAZALI, Rozaimi Mohamad, et al. Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes. Nature Communications, 2021, 12.1: 5929.
URL : https://www.qatargenome.org.qa/
NAME_FOR_TABLE : qatar-genome-program
Name : Qatar Genome Program
Link : Here

SG10K_Health

BIOBANK&COHORT : SG10K_Health
ABBREVIATION : SG10K
CONTINENT : ASIA
REGION : Singapore
ANCESTRY : EAS, SAS, SEA
SAMPLE SIZE : ~10k
NOTE : the headline project of the Singapore National Precision Medicine programme (NPM Phase I)
CITATION : Chan, Sock Hoai, et al. "Analysis of clinically relevant variants from ancestrally diverse Asian genomes." Nature communications 13.1 (2022): 1-15.
URL : https://npm.a-star.edu.sg/
DESCRIPTION : SG10K_Health is the headline project of the Singapore National Precision Medicine programme (NPM Phase I). Comprising 10,000 whole-genome sequences from healthy Chinese, Indian, and Malay consented volunteers. SG10K_Health involved a research collaboration across multiple institutions in Singapore, enabling the country to develop the necessary infrastructure and deep capabilities to process, store, and analyse genetic data at the population scale in a safe, secure, and rapid manner. SG10K_Health provides near complete assessment of common genetic variants in Singapore’s three major ethnic groups, which can be used by clinicians to better manage Asian patients with genetic disease and as a control data set to compare against disease studies. Work is ongoing to link the SG10K_Health genomic data to research traits (e.g., height, weight, blood pressure) and clinical records.
NAME_FOR_TABLE : sg10khealth
Name : SG10K_Health
Link : Here

Taiwan Biobank

BIOBANK&COHORT : Taiwan Biobank
ABBREVIATION : TWB
CONTINENT : ASIA
REGION : Taiwan
ANCESTRY : EAS
SAMPLE SIZE : ~150k
Array : ~109K
WGS/WES : ~2K
DATA ACCESS : https://taiwanview.twbiobank.org.tw/data_appl (application required)
CITATION : Feng, Y. C. A., Chen, C. Y., Chen, T. T., Kuo, P. H., Hsu, Y. H., Yang, H. I., ... & Lin, Y. F. (2021). Taiwan Biobank: a rich biomedical research database of the Taiwanese population. medRxiv.
CITATION : Feng, Y. C. A., Chen, C. Y., Chen, T. T., Kuo, P. H., Hsu, Y. H., Yang, H. I., ... & Lin, Y. F. (2022). Taiwan Biobank: a rich biomedical research database of the Taiwanese population. Cell Genomics, 100197.
URL : https://www.twbiobank.org.tw/
DESCRIPTION : The Taiwan Biobank (TWB) is an ongoing prospective study of over 150,000 individuals aged 30-70 recruited from across Taiwan beginning in 2012. A comprehensive list of phenotypes was collected for each consented participant at recruitment and follow-up visits through structured interviews and physical measurements. Biomarkers and genetic data were also generated for all participants from blood and urine samples.
NAME_FOR_TABLE : taiwan-biobank
Name : Taiwan Biobank
Link : Here

Taizhou Imaging Study

BIOBANK&COHORT : Taizhou Imaging Study
ABBREVIATION : TIS
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~1K
Array : ~1K
Metabolome : ~1K
Metagenome : ~1K
Imaging : ~1K
CITATION : Jiang, Y., Cui, M., Tian, W., Zhu, S., Chen, J., Suo, C., ... & Taizhou Imaging Study Group. (2021). Lifestyle, multi‐omics features, and preclinical dementia among Chinese: the Taizhou Imaging Study. Alzheimer's & Dementia, 17(1), 18-28.
URL : https://www.fdtzihs.org.cn/dljs
NAME_FOR_TABLE : taizhou-imaging-study
Name : Taizhou Imaging Study
Link : Here

The China Metabolic Analytics Project

BIOBANK&COHORT : The China Metabolic Analytics Project
ABBREVIATION : ChinaMAP
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~10k
NOTE : Shanghai Jiao Tong University
CITATION : Cao, Y., Li, L., Xu, M., Feng, Z., Sun, X., Lu, J., ... & Wang, W. (2020). The ChinaMAP analytics of deep whole genome sequences in 10,588 individuals. Cell research, 30(9), 717-731.
URL : http://www.mbiobank.com/
DESCRIPTION : The ChinaMAP is based on three large-scale cohorts: The China Noncommunicable Disease Surveillance 2010, a nationally representative study with 150,000 participants; the Risk Evaluation of cAncers in Chinese diabeTic Individuals: a lONgitudinal (REACTION) study with 250,000 participants15 and the Community-based Cardiovascular Risk During Urbanization in Shanghai with 50,000 participants.
NAME_FOR_TABLE : the-china-metabolic-analytics-project
Name : The China Metabolic Analytics Project
Link : Here

The Hisayama Study

BIOBANK&COHORT : The Hisayama Study
CONTINENT : ASIA
REGION : Japan
ANCESTRY : EAS
SAMPLE SIZE : ~8K
CITATION : Ninomiya, T. (2018). Japanese legacy cohort studies: the Hisayama Study. Journal of epidemiology, 28(11), 444-451.
URL : https://www.hisayama.med.kyushu-u.ac.jp/en/
DESCRIPTION : The Hisayama Study is a population-based prospective cohort study that has been conducted in the town of Hisayama, Japan since 1961.
NAME_FOR_TABLE : the-hisayama-study
Name : The Hisayama Study
Link : Here

The Japan Prospective Studies Collaboration for Aging and Dementia

BIOBANK&COHORT : The Japan Prospective Studies Collaboration for Aging and Dementia
ABBREVIATION : JPSC-AD
CONTINENT : ASIA
REGION : Japan
ANCESTRY : EAS
SAMPLE SIZE : ~11K
CITATION : JPSFC-AD Study Group. (2020). Study design and baseline characteristics of a population-based prospective cohort study of dementia in Japan: the Japan Prospective Studies Collaboration for Aging and Dementia (JPSC-AD). Environmental health and preventive medicine, 25(1), 64.
URL : https://www.eph.med.kyushu-u.ac.jp/jpsc/en/
DESCRIPTION : Japan Prospective Studies Collaboration for Aging and Dementia (JPSC-AD） study is a collaborative prospective cohort study of approximately 10,000 elderly people from 8 newly-established community-based dementia cohort studies in Japan, in which the data is prospectively collected by using the pre-specified standardized protocol. The purpose of this study is to evaluate quantitatively environmental and genomic risk factors for dementia in Japanese and to establish effective preventive strategies for dementia, in order to realize healthy aging society.
NAME_FOR_TABLE : the-japan-prospective-studies-collaboration-for-aging-and-dementia
Name : The Japan Prospective Studies Collaboration for Aging and Dementia
Link : Here

The Malaysian Cohort

BIOBANK&COHORT : The Malaysian Cohort
ABBREVIATION : TMC
CONTINENT : ASIA
REGION : Malaysia
SAMPLE SIZE : ~100k
STUDY TYPE : community-dwelling individuals aged 65 years or older at 8 sites of Japan
CITATION : Jamal, R., Syed Zakaria, S. Z., Kamaruddin, M. A., Abd Jalal, N., Ismail, N., Mohd Kamil, N., ... & Malaysian Cohort Study Group. (2015). Cohort profile: The Malaysian Cohort (TMC) project: a prospective study of non-communicable diseases in a multi-ethnic population. International journal of epidemiology, 44(2), 423-431.
URL : https://www.ukm.my/mycohort/ms/
DESCRIPTION : Qatar Biobank, a center within Qatar Foundation, was created in collaboration with Hamad Medical Corporation and the Ministry of Public Health to enable local scientists to conduct medical research on prevalent health issues in Qatar.
NAME_FOR_TABLE : the-malaysian-cohort
Name : The Malaysian Cohort
Link : Here

The Nagahama Study

BIOBANK&COHORT : The Nagahama Study
CONTINENT : ASIA
REGION : Japan
ANCESTRY : EAS
SAMPLE SIZE : ~10K
Array : ~9K
WGS/WES : ~2K
Metabolome : ~9K
Proteome : ~2K
CITATION : Setoh, K., & Matsuda, F. (2022). Cohort profile: the Nagahama prospective genome cohort for comprehensive human bioscience (The Nagahama Study). Socio-Life Science and the COVID-19 Outbreak: Public Health and Public Policy, 127-143.
URL : https://zeroji-cohort.com/english/
DESCRIPTION : The Nagahama Primary Prevention Cohort Project is a joint project based on an agreement between Kyoto University Graduate School of Medicine and Nagahama City, Shiga Prefecture, with the cooperation of approximately 10,000 Nagahama residents. In addition, the project conducts follow-up surveys on morbidity and mortality, special tests and surveys on sleep, brain imaging, memory, motor function, skin condition, socioeconomic status, etc., during health checkups and periodic surveys conducted every five years after that. Furthermore, we have completed a multi-omics analysis focusing on genome analysis of approximately 9,000 people (including whole genome sequencing of roughly 2,500 people), comprehensive metabolite analysis of 3-time points, and comprehensive protein analysis of 2,000 people (as of August 2021), and based on these rich and diverse data, we have been searching for health risk Based on these abundant and varied data, we aim to search for health risk factors and elucidate their interactions.
NAME_FOR_TABLE : the-nagahama-study
Name : The Nagahama Study
Link : Here

The STROMICS genome study

BIOBANK&COHORT : The STROMICS genome study
ABBREVIATION : STROMICS
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~10k
Array : ~10K
STUDY TYPE : prospective registry for patients presented to hospitals with acute ischaemic cerebrovascular events with long-term follow-up
CITATION : Cheng, S., Xu, Z., Bian, S., Chen, X., Shi, Y., Li, Y., ... & Wang, Y. (2023). The STROMICS genome study: deep whole-genome sequencing and analysis of 10K Chinese patients with ischemic stroke reveal complex genetic and phenotypic interplay. Cell Discovery, 9(1), 75.
URL : http://www.stromics.org.cn/
DESCRIPTION : The Stroke Omics Atlas (STROMICS) is committed to using multi-omics and clinical big data to achieve accurate diagnosis and treatment for stroke patients, reduce treatment costs, and contribute to the health of the people. Using artificial intelligence and cutting-edge high-throughput omics technologies (genomics, transcriptomics, epigenomics, proteomics, metabolomics, metagenomics, etc.), potential drug targets for stroke can be found on a large scale and with high efficiency, providing strong technical support for clinical transformation. Relying on the China National Clinical Research Center for Neurological Diseases and Center of excellence for Omics Research (CORe), STROMICS has realized the interdisciplinary integration of clinical medicine, bioinformatics, and multi-omics, creating a new paradigm of drug research and development.
NAME_FOR_TABLE : the-stromics-genome-study
Name : The STROMICS genome study
Link : Here

Tohoku Medical Megabank

BIOBANK&COHORT : Tohoku Medical Megabank
ABBREVIATION : TMM
CONTINENT : ASIA
REGION : Japan
ANCESTRY : EAS
SAMPLE SIZE : ~157k
WGS/WES : ~69K
Imaging : ~12K Brain MRI
DATA ACCESS : https://jmorp.megabank.tohoku.ac.jp/
CITATION : Kuriyama, S., Yaegashi, N., Nagami, F., Arai, T., Kawaguchi, Y., Osumi, N., ... & Tohoku Medical Megabank Project Study Group. (2016). The Tohoku medical megabank project: design and mission. Journal of epidemiology, 26(9), 493-511.
URL : https://www.megabank.tohoku.ac.jp/english/
DESCRIPTION : Tohoku University Tohoku Medical Megabank Organization was founded to establish an advanced medical system to foster the reconstruction from the Great East Japan Earthquake. The organization has been developing a biobank that combines medical and genome information during the process of rebuilding the community medical system and supporting health and welfare in the Tohoku area. The information from the brand-new biobank will create a new medical system, and, based on the findings of its analysis, the organization aims to attract more medical practitioners from all over the country to the area, promote industry-academic partnerships, create employment in related fields, and restore the medical system in Tohoku.
NAME_FOR_TABLE : tohoku-medical-megabank
Name : Tohoku Medical Megabank
Link : Here

Westlake BioBank for Chinese

BIOBANK&COHORT : Westlake BioBank for Chinese
ABBREVIATION : WBBC
CONTINENT : ASIA
REGION : China
ANCESTRY : EAS
SAMPLE SIZE : ~14k
Array : ~6K
WGS/WES : ~4K
NOTE : Westlake University
CITATION : Cong, P. K., Bai, W. Y., Li, J. C., Yang, M. Y., Khederzadeh, S., Gai, S. R., ... & Zheng, H. F. (2022). Genomic analyses of 10,376 individuals in the Westlake BioBank for Chinese (WBBC) pilot project. Nature Communications, 13(1), 1-15.
CITATION : Zhu, X. W., Liu, K. Q., Wang, P. Y., Liu, J. Q., Chen, J. Y., Xu, X. J., ... & Zheng, H. F. (2021). Cohort profile: the Westlake BioBank for Chinese (WBBC) pilot project. BMJ open, 11(6), e045564.
URL : https://wbbc.westlake.edu.cn/
DESCRIPTION : The Westlake BioBank for Chinese (WBBC) cohort is a population-based prospective study with its major purpose to better understand the effect of genetic and environmental factors on growth and development from youngster to elderly. The dataset comprises a wide range of demographics and anthropometric measures, serological tests, physical activity, sleep quality, age at menarche and bone mineral density. WBBC is designed as a prospective cohort study and will recruit at least 100,000 Chinese samples. The pilot project of WBBC has recruited a total of 14,726 participants (4,751 males and 9,975 females) and the baseline survey was carried out from 2017 to 2019.
NAME_FOR_TABLE : westlake-biobank-for-chinese
Name : Westlake BioBank for Chinese
Link : Here

EUROPE

Biobank Graz

BIOBANK&COHORT : Biobank Graz
CONTINENT : EUROPE
REGION : Austria
SAMPLE SIZE : ~1200k
CITATION : Huppertz, B., Bayer, M., Macheiner, T., & Sargsyan, K. (2016). Biobank Graz: the hub for innovative biomedical research. Open journal of bioresources, 3(1).
URL : https://biobank.medunigraz.at/en/?link=http%3A%2F%2F169.254.169.254%2Flatest%2Fmeta-data%2F&cHash=3b3a94b34935e2b8509a838b4a34b0eb
DESCRIPTION : Biobank Graz is one of the largest and most well-known clinical biobanks in the world. Around 20 million individual specimens of body fluids and human tissue are stored here. Biobank Graz allows access to these specimens and associated data for scientific research purposes. The common goal is to develop approaches to diagnosing and treating disease.
NAME_FOR_TABLE : biobank-graz
Name : Biobank Graz
Link : Here

Biobank Russia

BIOBANK&COHORT : Biobank Russia
ABBREVIATION : BBRU
CONTINENT : EUROPE
REGION : Russia
ANCESTRY : EUR
SAMPLE SIZE : ~4K
Array : ~4K
STUDY TYPE : prospective
CITATION : Usoltsev, D., Kolosov, N., Rotar, O., Loboda, A., Boyarinova, M., Moguchaya, E., ... & Artomov, M. (2023). Understanding Complex Trait Susceptibilities and Ethnical Diversity in a Sample of 4,145 Russians Through Analysis of Clinical and Genetic Data. bioRxiv, 2023-03.
URL : https://biobank.almazovcentre.ru/
DESCRIPTION : BioBank Russia (BBRU) is a prospective biobank, managed by V. A. Almazov National Medical Research Center.
NAME_FOR_TABLE : biobank-russia
Name : Biobank Russia
Link : Here

East London Genes & Health

BIOBANK&COHORT : East London Genes & Health
CONTINENT : EUROPE
REGION : U.K.
SAMPLE SIZE : ~100k
CITATION : Finer, S., Martin, H. C., Khan, A., Hunt, K. A., MacLaughlin, B., Ahmed, Z., ... & van Heel, D. A. (2020). Cohort Profile: East London Genes & Health (ELGH), a community-based population genomics and health study in British Bangladeshi and British Pakistani people. International journal of epidemiology, 49(1), 20-21i.
URL : https://www.genesandhealth.org/
DESCRIPTION : Genes & Health is a huge long-term study of 100,000 people of Bangladeshi and Pakistani origin. We will link genes with health records, to study disease and treatments. Some volunteers may be invited for further studies. We are inviting volunteers to take part in two regions of the UK: East London (East London Genes & Health) and Bradford (Bradford Genes & Health).
NAME_FOR_TABLE : east-london-genes-health
Name : East London Genes & Health
Link : Here

Estonian Biobank

BIOBANK&COHORT : Estonian Biobank
CONTINENT : EUROPE
REGION : Estonia
ANCESTRY : EUR
SAMPLE SIZE : ~200k
CITATION : Leitsalu, L., Haller, T., Esko, T., Tammesoo, M. L., Alavere, H., Snieder, H., ... & Metspalu, A. (2015). Cohort profile: Estonian biobank of the Estonian genome center, university of Tartu. International journal of epidemiology, 44(4), 1137-1147.
URL : https://genomics.ut.ee/en/content/estonian-biobank
DESCRIPTION : The Estonian Biobank has established a population-based biobank of Estonia with a current cohort size of more than 200,000 individuals (genotyped with genome-wide arrays), reflecting the age, sex and geographical distribution of the adult Estonian population. Considering the fact that about 20% of Estonia's adult population has joined the programme, it is indeed a database that is very important for the development of medical science both domestically and internationally.
NAME_FOR_TABLE : estonian-biobank
Name : Estonian Biobank
Link : Here

Fenland Study

BIOBANK&COHORT : Fenland Study
CONTINENT : EUROPE
REGION : U.K.
SAMPLE SIZE : ~12k
CITATION : MRC Epidemiology Unit, University of Cambridge. Fenland Study. [Internet]. Cambridge (UK): MRC Epidemiology Unit; 2017; [cited 2017 July 8]. Available from: http://www.mrc-epid.cam.ac.uk/research/studies/fenland/.
URL : https://www.mrc-epid.cam.ac.uk/research/studies/fenland/
DESCRIPTION : The Fenland Study investigates the interaction between environmental and genetic factors in determining obesity, type 2 diabetes, and related metabolic disorders. These conditions are a considerable public health concern, but their causes and factors that predict who will be affected by them are not completely understood. What makes the Fenland Study unique is the level of detail it collects about the health and lifestyle of participants, and the objective measurement techniques used in the screening. The first phase of the Fenland Study is now complete, and we are now inviting participants who attended an initial Fenland Study visit between 2005 and 2015 to return for a second visit in Phase 2.
NAME_FOR_TABLE : fenland-study
Name : Fenland Study
Link : Here

FinnGen

BIOBANK&COHORT : FinnGen
CONTINENT : EUROPE
REGION : Finland
ANCESTRY : EUR
SAMPLE SIZE : ~500k
CITATION : Kurki, M. I., Karjalainen, J., Palta, P., Sipilä, T. P., Kristiansson, K., Donner, K., ... & Nelis, M. (2022). FinnGen: Unique genetic insights from combining isolated population and national health register data. medRxiv.
URL : https://www.finngen.fi/en
DESCRIPTION : FinnGen study launched in Finland in the autumn of 2017 is a unique study that combines genome information with digital health care data. The FinnGen study is an unprecedented global research project representing one of the largest studies of this type. Project aims to improve human health through genetic research, and ultimately identify new therapeutic targets and diagnostics for treating numerous diseases. The collaborative nature of the project is exceptional compare to many ongoing studies, and all the partners are working closely together to ensure appropriate transparency, data security and ownership.
NAME_FOR_TABLE : finngen
Name : FinnGen
Link : Here

Generation Scotland

BIOBANK&COHORT : Generation Scotland
CONTINENT : EUROPE
REGION : Scotland
SAMPLE SIZE : ~24k
CITATION : Smith, B. H., Campbell, A., Linksted, P., Fitzpatrick, B., Jackson, C., Kerr, S. M., ... & Morris, A. D. (2013). Cohort Profile: Generation Scotland: Scottish Family Health Study (GS: SFHS). The study, its participants and their potential for genetic research on health and illness. International journal of epidemiology, 42(3), 689-700.
URL : https://www.ed.ac.uk/generation-scotland
DESCRIPTION : Generation Scotland is a research study looking at the health and well-being of volunteers and their families. Generation Scotland combines responses to questionnaires of health and well-being from birth through life. We combine this with NHS health records and innovative laboratory science to understand health trajectories. We work closely with researchers and our volunteers to create a rich evidence base for understanding health. Through this rigorous, ethical and safe approach to research, we seek to enable meaningful change in public health.
NAME_FOR_TABLE : generation-scotland
Name : Generation Scotland
Link : Here

INTERVAL Study

BIOBANK&COHORT : INTERVAL Study
CONTINENT : EUROPE
REGION : U.K.
SAMPLE SIZE : ~50k
CITATION : Moore, C., Bolton, T., Walker, M., Kaptoge, S., Allen, D., Daynes, M., ... & Thompson, S. G. (2016). Recruitment and representativeness of blood donors in the INTERVAL randomised trial assessing varying inter-donation intervals. Trials, 17(1), 1-12.
URL : https://www.intervalstudy.org.uk/
DESCRIPTION : Between June 2012 and June 2014, the INTERVAL study recruited about 25,000 men and about 25,000 women at NHS Blood and Transplant (NSHBT) blood donation centres across England. During the study participants are asked to give blood either at usual donation intervals or more frequently. Men donate every 12, 10 or 8 weeks and women every 16, 14 or 12 weeks.
NAME_FOR_TABLE : interval-study
Name : INTERVAL Study
Link : Here

Lifelines

BIOBANK&COHORT : Lifelines
CONTINENT : EUROPE
REGION : Netherlands
SAMPLE SIZE : ~167k
CITATION : Scholtens, S., Smidt, N., Swertz, M. A., Bakker, S. J., Dotinga, A., Vonk, J. M., ... & Stolk, R. P. (2015). Cohort Profile: LifeLines, a three-generation cohort study and biobank. International journal of epidemiology, 44(4), 1172-1180.
URL : https://www.lifelines.nl/researcher
DESCRIPTION : Lifelines is a large, multigenerational cohort study that includes over 167,000 participants (10%) from the northern population of the Netherlands. We included participants from three generations, who are followed for at least 30 years, to obtain insight into healthy ageing. The aim of Lifelines is to be a resource for the national and international scientific community.
NAME_FOR_TABLE : lifelines
Name : Lifelines
Link : Here

The International Agency for Research on Cancer (IARC) Biobank

BIOBANK&COHORT : The International Agency for Research on Cancer (IARC) Biobank
ABBREVIATION : IARC
CONTINENT : EUROPE
REGION : France
SAMPLE SIZE : ~560k
CITATION : Mendy, M., Caboux, E., Wild, C. P., & IARC Biobank Steering Committee Members. (2019). Centralization of the IARC biobank: combining multiple sample collections into a common platform. Biopreservation and Biobanking, 17(5), 433-443.
URL : https://ibb.iarc.fr/
DESCRIPTION : The IARC BioBank (IBB) is one of the largest, most varied and richest International collections of samples in the world. The Biobank is publicly funded, (approximately 60% of its budget is provided by IARC Participating States through the regular budget and the remainder is from research grants) and hosts over 50 different studies, led or coordinated by IARC scientists. The IBB contains both population-based collections from research projects focusing on gene-environment interactions (as in the European Prospective Investigation into Cancer and Nutrition (EPIC) study) and disease-based collections which focus on biomarkers (as in the International Head and Neck Cancer Epidemiology (INHANCE)). Study designs include case-series, prevalence studies, case-control and cohort studies, etc. The IBB contains 5.1 million biological samples from 562,000 individuals. 4 million of the samples are from the EPIC study (over 370,000 individuals) and about one million samples from other collections (close to 200,000 individuals). Most of the samples are body fluids, including plasma, serum and urine as well as extracted DNA samples.
NAME_FOR_TABLE : the-international-agency-for-research-on-cancer-iarc-biobank
Name : The International Agency for Research on Cancer (IARC) Biobank
Link : Here

The Trøndelag Health Study

BIOBANK&COHORT : The Trøndelag Health Study
ABBREVIATION : HUNT
CONTINENT : EUROPE
REGION : Norway
SAMPLE SIZE : ~229k
Array : ~88K
WGS/WES : ~2K
STUDY TYPE : population-based prospective Norwegian cohort
CITATION : Brumpton, B. M., Graham, S., Surakka, I., Skogholt, A. H., Løset, M., Fritsche, L. G., ... & Willer, C. J. (2022). The HUNT Study: a population-based cohort for genetic research. Cell Genomics, 2(10), 100193.
URL : https://www.ntnu.edu/hunt/hunt-biobank
DESCRIPTION : HUNT Biobank is an established and modern research biobank with high-technology equipment for storage, analysis, sample handling and delivery of samples. Our samples satisfy high quality standards and are stored in accordance with the Data Inspectorates laws and regulations. HUNT Biobank engages in sample handling from The Nord-Trøndelag Health Study (HUNT), Cohort of Norway (CONOR), and can receive samples from other researchers and research projects for storage, analysis and processing of DNA. We do not store samples from private individuals.
NAME_FOR_TABLE : the-trndelag-health-study
Name : The Trøndelag Health Study
Link : Here

UK Biobank

BIOBANK&COHORT : UK Biobank
ABBREVIATION : UKB
CONTINENT : EUROPE
REGION : U.K.
ANCESTRY : EUR
SAMPLE SIZE : ~500k
Array : ~500K
WGS/WES : ~500k
Metabolome : ~120K
Proteome : ~54K
Imaging : ~100K multimodal
CITATION : Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., ... & Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203-209.
URL : https://www.ukbiobank.ac.uk/
DESCRIPTION : UK Biobank is a large-scale biomedical database and research resource, containing in-depth genetic and health information from half a million UK participants. The database is regularly augmented with additional data and is globally accessible to approved researchers undertaking vital research into the most common and life-threatening diseases. It is a major contributor to the advancement of modern medicine and treatment and has enabled several scientific discoveries that improve human health.
NAME_FOR_TABLE : uk-biobank
Name : UK Biobank
Link : Here

deCODE Genetics

BIOBANK&COHORT : deCODE Genetics
CONTINENT : EUROPE
REGION : Iceland
SAMPLE SIZE : ~250k
URL : https://www.decode.com/
DESCRIPTION : deCODE leads the world in the discovery of genetic risk factors for common diseases. Our gene discovery engine is driven by our unique approach and resources, including detailed genetic and medical information on some 500,000 individuals from around the globe taking part in our discovery work and proprietary statistical algorithms and informatics tools for gathering, analyzing, visualizing and storing large amounts of data.
NAME_FOR_TABLE : decode-genetics
Name : deCODE Genetics
Link : Here

OCIENIA

QIMR Berghofer - QIMR Biobank

BIOBANK&COHORT : QIMR Berghofer - QIMR Biobank
ABBREVIATION : QSkin and GenEpi
CONTINENT : OCIENIA
REGION : Australia
SAMPLE SIZE : ~17k
URL : https://genepi.qimr.edu.au/
NAME_FOR_TABLE : qimr-berghofer-qimr-biobank
Name : QIMR Berghofer - QIMR Biobank
Link : Here