Skip to content

Biobanks ASIA

Curation of ASIA — listings under the Biobanks tab.

Summary Table

Click a column header to sort the table.

NAME MAIN ANCESTRY PARTICIPANTS CONTINENT SAMPLE SIZE URL
BioBank Japan EAS Japanese adults (hospital and population network) ASIA ~270k https://biobankjp.org/
Born in Guangzhou Cohort Study EAS Han Chinese mothers and infants (Guangzhou birth cohort) ASIA ~50K http://www.bigcs.com.cn/en_index.html
Cebu Longitudinal Health and Nutrition Survey EAS Filipino mothers and offspring (Cebu) ASIA ~3k index offspring cohort https://www.cpc.unc.edu/projects/clhns
China Kadoorie Biobank EAS Chinese adults (10 regions; prospective cohort) ASIA ~512k https://www.ckbiobank.org/
Chinese Millionome Database EAS Chinese individuals (aggregated genome database) ASIA ~141k https://db.cngb.org/cmdb/
Han Chinese Genome Initiative Phase 1 the Han100K Project EAS Han Chinese adults (reference genomes) ASIA ~114k https://www.hanchinesegenomes.org/
IndiGenomes SAS Indian subcontinent individuals (genome resource) ASIA ~10k http://clingen.igib.res.in/indigen/
KoGES EAS Korean adults (population-based sub-cohorts) ASIA ~210k (across sub-cohorts) https://koges.leelabsg.org/about
Korean Genome Project Phase 1 EAS Korean individuals (reference genomes) ASIA ~1K https://koreangenome.org/Main_Page
Korean Genome Project Phase 2 EAS Korean individuals (Korea4K reference) ASIA ~4K http://koreangenome.org/Korea4K_Genomes
National Biobank of Korea EAS Korean participants (linked with KoGES / national program) ASIA ~210K https://nih.go.kr/NIH/cms/content/eng/14/65714_view.html
National Center Biobank Network EAS Japanese patients (national hospital biobank network) ASIA ~120K https://ncbiobank.org/en/home.php
NyuWa genome resource EAS Han Chinese individuals (NyuWa reference genomes) ASIA ~3k http://bigdata.ibp.ac.cn/NyuWa/
Qatar Biobank Middle Eastern (Qatari Arab) Consenting adults in Qatar (national biobank) ASIA ~80K https://www.qatarbiobank.org.qa/
Qatar Genome Program Middle Eastern (Qatari Arab) Qatari individuals (national sequencing; linked with Qatar Biobank) ASIA ~6K https://www.qatargenome.org.qa/
SG10K_Health EAS,SAS Singapore residents (Chinese, Malay, Indian, other) ASIA ~10k https://npm.a-star.edu.sg/
Taiwan Biobank EAS Han Chinese adults (general-population recruitment) ASIA ~150k https://www.twbiobank.org.tw/
Taiwan Precision Medicine Initiative EAS Han Chinese adults (precision medicine initiative) ASIA ~460k https://tpmi.ibms.sinica.edu.tw/
Taizhou Imaging Study EAS Han Chinese adults (Taizhou imaging cohort) ASIA ~1K https://www.fdtzihs.org.cn/dljs
The China Metabolic Analytics Project EAS Chinese adults (metabolic disease cohort) ASIA ~10k http://www.mbiobank.com/
The Hisayama Study EAS Japanese adults (Hisayama town) ASIA ~8K https://www.hisayama.med.kyushu-u.ac.jp/en/
The Japan COVID-19 Task Force study EAS Japanese individuals (COVID-19 host genetics) ASIA ~1.4K https://japan-omics.jp/
The Japan Prospective Studies Collaboration for Aging and Dementia EAS Japanese adults (aging and dementia collaboration) ASIA ~11K https://www.eph.med.kyushu-u.ac.jp/jpsc/en/
The Malaysian Cohort EAS,SAS Malaysian adults (multi-ethnic national cohort) ASIA ~100k https://www.ukm.my/mycohort/ms/
The Nagahama Study EAS Japanese adults (Nagahama City, Shiga) ASIA ~10K https://zeroji-cohort.com/english/
The STROMICS genome study EAS Chinese adults (acute ischemic stroke registry) ASIA ~10k http://www.stromics.org.cn/
Tohoku Medical Megabank EAS Japanese adults (Tōhoku region; megabank) ASIA ~157k https://www.megabank.tohoku.ac.jp/english/
Westlake BioBank for Chinese EAS Han Chinese adults (Westlake biobank) ASIA ~14k https://wbbc.westlake.edu.cn/

BioBank Japan (BBJ)

Biobank / cohort
DESCRIPTION
In 2003, BioBank Japan (BBJ) started developing one of the world’s largest disease biobanks, creating a foundation for research aimed at achieving medical care tailored to the individual traits of each patient. From a total of 260,000 patients representing 440,000 cases of 51 primarily multifactorial (common) diseases, BBJ has collected DNA, serum, medical records (clinical information), etc. with their consent. No less than 5,800 items of screened information are available for research, including the patients’ survival information, with 95% of the patients tracked over an average of 10 years. In addition to large-scale genomic analyses, omics analyses including whole genome sequencing and metabolome/proteome analyses have been performed on the DNA, serum and other biological samples collected, producing significant research findings. The genomic information acquired through the analyses continues to be used as data. The biological samples and data are widely distributed and used by researchers.
URL
https://biobankjp.org/
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese adults (hospital and population network)
SAMPLE SIZE
~270k

Born in Guangzhou Cohort Study (BIGCS)

Biobank / cohort
DESCRIPTION
The Born in Guangzhou Cohort Study (BIGCS) is a large-scale prospective observational study investigating the role of social, biological and environmental influences on pregnancy and child health and development in an urban setting in southern China.
URL
http://www.bigcs.com.cn/en_index.html
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese mothers and infants (Guangzhou birth cohort)
SAMPLE SIZE
~50K

Cebu Longitudinal Health and Nutrition Survey (CLHNS)

Biobank / cohort
DESCRIPTION
Long-standing community-based cohort in Metro Cebu with repeated anthropometric, dietary, and health measures from pregnancy/birth through adulthood; genetic and omics data used in developmental and metabolic trait studies.
URL
https://www.cpc.unc.edu/projects/clhns
MAIN ANCESTRY
EAS
PARTICIPANTS
Filipino mothers and offspring (Cebu)
SAMPLE SIZE
~3k index offspring cohort

China Kadoorie Biobank (CKB)

Biobank / cohort
DESCRIPTION
The China Kadoorie Biobank is one of the world’s largest prospective cohort studies. A long-term collaboration between the UK and China, it aims to generate reliable evidence about the lifestyle, environmental and genetic determinants of a wide range of common diseases that can inform disease prevention, risk prediction and treatment worldwide.
URL
https://www.ckbiobank.org/
MAIN ANCESTRY
EAS
PARTICIPANTS
Chinese adults (10 regions; prospective cohort)
SAMPLE SIZE
~512k

Chinese Millionome Database (CMDB)

Biobank / cohort
DESCRIPTION
the largest and the most representative Chinese genome variation database to date. The CMDB database contains 9.04 million single nucleotide variants (SNVs) and the allele frequency information from low-coverage (0.06×–0.1×) WGS data of 141 431 unrelated healthy Chinese individuals.
URL
https://db.cngb.org/cmdb/
MAIN ANCESTRY
EAS
PARTICIPANTS
Chinese individuals (aggregated genome database)
SAMPLE SIZE
~141k

Han Chinese Genome Initiative Phase 1 the Han100K Project (Han100K)

Biobank / cohort
DESCRIPTION
a reference panel of 114 783 Han Chinese individuals (the Han100K), with whole-genome deep-sequenced or high-density genome-wide single-nucleotide variants (SNVs) genotyped or imputed.
URL
https://www.hanchinesegenomes.org/
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese adults (reference genomes)
SAMPLE SIZE
~114k

IndiGenomes

Biobank / cohort
DESCRIPTION
IndiGenomes is a curated resource of genetic variants from 1000+ whole genomes from across India, supporting South Asian reference genomics and downstream association and functional studies.
URL
http://clingen.igib.res.in/indigen/
MAIN ANCESTRY
SAS
PARTICIPANTS
Indian subcontinent individuals (genome resource)
SAMPLE SIZE
~10k

KoGES

Biobank / cohort
DESCRIPTION
Korean Genome and Epidemiology Study: prospective population-based cohorts (Ansan–Ansung, etc.) with biosamples and follow-up; genome-wide data widely used in Korean and trans-ethnic GWAS.
URL
https://koges.leelabsg.org/about
MAIN ANCESTRY
EAS
PARTICIPANTS
Korean adults (population-based sub-cohorts)
SAMPLE SIZE
~210k (across sub-cohorts)

Korean Genome Project Phase 1 (KGP Korea1K)

Biobank / cohort
DESCRIPTION
Reference panel of ~1,000 Korean whole genomes with linked clinical information for population genetics, imputation, and precision-medicine applications.
URL
https://koreangenome.org/Main_Page
MAIN ANCESTRY
EAS
PARTICIPANTS
Korean individuals (reference genomes)
SAMPLE SIZE
~1K

Korean Genome Project Phase 2 (KGP Korea4K)

Biobank / cohort
DESCRIPTION
Korea4K is the second phase data release of the Korean Genome Project (KGP).
URL
http://koreangenome.org/Korea4K_Genomes
MAIN ANCESTRY
EAS
PARTICIPANTS
Korean individuals (Korea4K reference)
SAMPLE SIZE
~4K

National Biobank of Korea (NBK)

Biobank / cohort
DESCRIPTION
The NBK is the national control center for the collection, management, and utilization of human bioresources in Korea. And NBK manages KBN, it contributes to the development of policies related to human bioresources, standardization of human bioresource management, and advancement of domestic biobanks through developing and providing support for human bioresource technologies. For guaranteeing the fairness in bioresource distribution and development of an efficient distribution system, the NBK also serves as the human bioresource supply hub that supports national healthcare and medical R&D.
URL
https://nih.go.kr/NIH/cms/content/eng/14/65714_view.html
MAIN ANCESTRY
EAS
PARTICIPANTS
Korean participants (linked with KoGES / national program)
SAMPLE SIZE
~210K
DATA ACCESS
https://koges.leelabsg.org/ , https://zenodo.org/record/7042518

National Center Biobank Network (NCBN)

Biobank / cohort
DESCRIPTION
Six National Centers in Japan conduct specialized medical research under the coordination of the National Center Biobank Network (NCBN) and develop therapeutics to improve and protect national health. They actively collaborate to establish a shared biobank and are developing a structure to facilitate industry-academia-government cooperation regarding bioresources through broad joint research. NCBN strives to promote the success of the National Centers and to create bright future for health and human life.
URL
https://ncbiobank.org/en/home.php
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese patients (national hospital biobank network)
SAMPLE SIZE
~120K

NyuWa genome resource

Biobank / cohort
DESCRIPTION
NyuWa, or NüWa, is the mother goddess who was the creator of the human population in Chinese mythology. Here we presented the NyuWa genome resource based on high depth (median 26X) WGS of 2,999 Chinese individuals from 23 out of 34 administrative divisions in China. NyuWa Genome Resource present in this website mainly contains two parts as NyuWa Chinese Population Variant Database and NyuWa reference panel server.
URL
http://bigdata.ibp.ac.cn/NyuWa/
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese individuals (NyuWa reference genomes)
SAMPLE SIZE
~3k

Qatar Biobank

Biobank / cohort
DESCRIPTION
Qatar Biobank is a national long-term health research initiative that recruits consenting adults in Qatar, collects biological samples and health data, and supports population and genomic research on diseases relevant to the region (often analyzed together with the Qatar Genome Program).
URL
https://www.qatarbiobank.org.qa/
MAIN ANCESTRY
Middle Eastern (Qatari Arab)
PARTICIPANTS
Consenting adults in Qatar (national biobank)
SAMPLE SIZE
~80K

Qatar Genome Program (QGP)

Biobank / cohort
DESCRIPTION
National sequencing initiative profiling thousands of Qatari genomes (linked with Qatar Biobank) to study migration history and improve imputation and analysis of Arab haplotypes.
URL
https://www.qatargenome.org.qa/
MAIN ANCESTRY
Middle Eastern (Qatari Arab)
PARTICIPANTS
Qatari individuals (national sequencing; linked with Qatar Biobank)
SAMPLE SIZE
~6K

SG10K_Health (SG10K)

Biobank / cohort
DESCRIPTION
SG10K_Health is the headline project of the Singapore National Precision Medicine programme (NPM Phase I). Comprising 10,000 whole-genome sequences from healthy Chinese, Indian, and Malay consented volunteers. SG10K_Health involved a research collaboration across multiple institutions in Singapore, enabling the country to develop the necessary infrastructure and deep capabilities to process, store, and analyse genetic data at the population scale in a safe, secure, and rapid manner. SG10K_Health provides near complete assessment of common genetic variants in Singapore’s three major ethnic groups, which can be used by clinicians to better manage Asian patients with genetic disease and as a control data set to compare against disease studies. Work is ongoing to link the SG10K_Health genomic data to research traits (e.g., height, weight, blood pressure) and clinical records.
URL
https://npm.a-star.edu.sg/
MAIN ANCESTRY
EAS,SAS
PARTICIPANTS
Singapore residents (Chinese, Malay, Indian, other)
SAMPLE SIZE
~10k

Taiwan Biobank (TWB)

Biobank / cohort
DESCRIPTION
The Taiwan Biobank (TWB) is an ongoing prospective study of over 150,000 individuals aged 30-70. A comprehensive list of phenotypes was collected for each consented participant at recruitment and follow-up visits through structured interviews and physical measurements. Biomarkers and genetic data were also generated for all participants from blood and urine samples.
URL
https://www.twbiobank.org.tw/
Main citation
Feng, Y. C. A., Chen, C. Y., Chen, T. T., Kuo, P. H., Hsu, Y. H., Yang, H. I., ... & Lin, Y. F. (2021). Taiwan Biobank: a rich biomedical research database of the Taiwanese population. medRxiv. Feng, Y. C. A., Chen, C. Y., Chen, T. T., Kuo, P. H., Hsu, Y. H., Yang, H. I., ... & Lin, Y. F. (2022). Taiwan Biobank: a rich biomedical research database of the Taiwanese population. Cell Genomics, 100197.
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese adults (general-population recruitment)
SAMPLE SIZE
~150k
DATA ACCESS
https://taiwanview.twbiobank.org.tw/data_appl (application required)

Taiwan Precision Medicine Initiative (TMPI)

Biobank / cohort
DESCRIPTION
The Taiwan Precision Medicine Initiative (TPMI) is a genomic research program designed toadvance precision healthcare. With over 500,000 Taiwanese residents already enrolled, TPMI maintains the mostcomprehensive dataset of genotypes and electronic medicalrecords for Han Chinese populations.
URL
https://tpmi.ibms.sinica.edu.tw/
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese adults (precision medicine initiative)
SAMPLE SIZE
~460k

Taizhou Imaging Study (TIS)

Biobank / cohort
DESCRIPTION
Prospective Chinese cohort (Taizhou) combining lifestyle factors with multi-omics and brain imaging to study aging and preclinical dementia.
URL
https://www.fdtzihs.org.cn/dljs
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese adults (Taizhou imaging cohort)
SAMPLE SIZE
~1K

The China Metabolic Analytics Project (ChinaMAP)

Biobank / cohort
DESCRIPTION
The ChinaMAP is based on three large-scale cohorts: The China Noncommunicable Disease Surveillance 2010, a nationally representative study with 150,000 participants; the Risk Evaluation of cAncers in Chinese diabeTic Individuals: a lONgitudinal (REACTION) study with 250,000 participants15 and the Community-based Cardiovascular Risk During Urbanization in Shanghai with 50,000 participants.
URL
http://www.mbiobank.com/
MAIN ANCESTRY
EAS
PARTICIPANTS
Chinese adults (metabolic disease cohort)
SAMPLE SIZE
~10k

The Hisayama Study

Biobank / cohort
DESCRIPTION
The Hisayama Study is a population-based prospective cohort study that has been conducted in the town of Hisayama, Japan since 1961.
URL
https://www.hisayama.med.kyushu-u.ac.jp/en/
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese adults (Hisayama town)
SAMPLE SIZE
~8K

The Japan COVID-19 Task Force study (JCTF)

Biobank / cohort
DESCRIPTION
Multi-institution Japanese consortium formed during the COVID-19 pandemic; contributes multi-omics and GWAS-related data surfaced through resources such as the Japan Omics Browser.
URL
https://japan-omics.jp/
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese individuals (COVID-19 host genetics)
SAMPLE SIZE
~1.4K

The Japan Prospective Studies Collaboration for Aging and Dementia (JPSC-AD)

Biobank / cohort
DESCRIPTION
Japan Prospective Studies Collaboration for Aging and Dementia (JPSC-AD) study is a collaborative prospective cohort study of approximately 10,000 elderly people from 8 newly-established community-based dementia cohort studies in Japan, in which the data is prospectively collected by using the pre-specified standardized protocol. The purpose of this study is to evaluate quantitatively environmental and genomic risk factors for dementia in Japanese and to establish effective preventive strategies for dementia, in order to realize healthy aging society.
URL
https://www.eph.med.kyushu-u.ac.jp/jpsc/en/
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese adults (aging and dementia collaboration)
SAMPLE SIZE
~11K

The Malaysian Cohort (TMC)

Biobank / cohort
DESCRIPTION
The Malaysian Cohort (TMC) is a prospective study of non-communicable diseases in a multi-ethnic Malaysian population, recruiting adults across rural and urban settings with biosamples and phenotypes for gene–environment and biomarker research.
URL
https://www.ukm.my/mycohort/ms/
MAIN ANCESTRY
EAS,SAS
PARTICIPANTS
Malaysian adults (multi-ethnic national cohort)
SAMPLE SIZE
~100k

The Nagahama Study

Biobank / cohort
DESCRIPTION
The Nagahama Primary Prevention Cohort Project is a joint project based on an agreement between Kyoto University Graduate School of Medicine and Nagahama City, Shiga Prefecture, with the cooperation of approximately 10,000 Nagahama residents. In addition, the project conducts follow-up surveys on morbidity and mortality, special tests and surveys on sleep, brain imaging, memory, motor function, skin condition, socioeconomic status, etc., during health checkups and periodic surveys conducted every five years after that. Furthermore, we have completed a multi-omics analysis focusing on genome analysis of approximately 9,000 people (including whole genome sequencing of roughly 2,500 people), comprehensive metabolite analysis of 3-time points, and comprehensive protein analysis of 2,000 people (as of August 2021), and based on these rich and diverse data, we have been searching for health risk Based on these abundant and varied data, we aim to search for health risk factors and elucidate their interactions.
URL
https://zeroji-cohort.com/english/
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese adults (Nagahama City, Shiga)
SAMPLE SIZE
~10K

The STROMICS genome study (STROMICS)

Biobank / cohort
DESCRIPTION
The Stroke Omics Atlas (STROMICS) is committed to using multi-omics and clinical big data to achieve accurate diagnosis and treatment for stroke patients, reduce treatment costs, and contribute to the health of the people.
Using artificial intelligence and cutting-edge high-throughput omics technologies (genomics, transcriptomics, epigenomics, proteomics, metabolomics, metagenomics, etc.), potential drug targets for stroke can be found on a large scale and with high efficiency, providing strong technical support for clinical transformation.
Relying on the China National Clinical Research Center for Neurological Diseases and Center of excellence for Omics Research (CORe), STROMICS has realized the interdisciplinary integration of clinical medicine, bioinformatics, and multi-omics, creating a new paradigm of drug research and development.
URL
http://www.stromics.org.cn/
MAIN ANCESTRY
EAS
PARTICIPANTS
Chinese adults (acute ischemic stroke registry)
SAMPLE SIZE
~10k

Tohoku Medical Megabank (TMM)

Biobank / cohort
DESCRIPTION
Tohoku University Tohoku Medical Megabank Organization was founded to establish an advanced medical system to foster the reconstruction from the Great East Japan Earthquake. The organization has been developing a biobank that combines medical and genome information during the process of rebuilding the community medical system and supporting health and welfare in the Tohoku area. The information from the brand-new biobank will create a new medical system, and, based on the findings of its analysis, the organization aims to attract more medical practitioners from all over the country to the area, promote industry-academic partnerships, create employment in related fields, and restore the medical system in Tohoku.
URL
https://www.megabank.tohoku.ac.jp/english/
MAIN ANCESTRY
EAS
PARTICIPANTS
Japanese adults (Tōhoku region; megabank)
SAMPLE SIZE
~157k
DATA ACCESS
https://jmorp.megabank.tohoku.ac.jp/

Westlake BioBank for Chinese (WBBC)

Biobank / cohort
DESCRIPTION
The Westlake BioBank for Chinese (WBBC) cohort is a population-based prospective study with its major purpose to better understand the effect of genetic and environmental factors on growth and development from youngster to elderly. The dataset comprises a wide range of demographics and anthropometric measures, serological tests, physical activity, sleep quality, age at menarche and bone mineral density. WBBC is designed as a prospective cohort study and will recruit at least 100,000 Chinese samples. The pilot project of WBBC has recruited a total of 14,726 participants (4,751 males and 9,975 females) and the baseline survey was carried out from 2017 to 2019.
URL
https://wbbc.westlake.edu.cn/
MAIN ANCESTRY
EAS
PARTICIPANTS
Han Chinese adults (Westlake biobank)
SAMPLE SIZE
~14k