Open Access
Catalog entries using this tag (links open the entry card on its page):
Entries
PTB-XL
FULL NAME
PTB-XL: A Large Publicly Available Electrocardiography Dataset
DESCRIPTION
PTB-XL is the largest freely accessible clinical 12-lead ECG-waveform dataset, comprising 21,837 records from 18,885 patients of 10 seconds length. Annotated by up to two cardiologists with 71 SCP-ECG diagnostic, form, and rhythm statements organized into 5 superclasses (NORM, CD, MI, HYP, STTC) and 24 subclasses. Includes raw waveforms at 500Hz and downsampled 100Hz, plus rich metadata: demographics (age, sex, height, weight), signal quality annotations (noise, baseline drift, electrodes), and recommended stratified 10-fold cross-validation splits. Fully open access on PhysioNet — no registration or training required. Widely used as the standard benchmark for automated ECG interpretation, arrhythmia detection, and deep learning in cardiology.
URL
KEYWORDS
ECG, electrocardiography, 12-lead, cardiovascular, waveform, signal processing, PhysioNet
TITLE
PTB-XL, a large publicly available electrocardiography dataset.
Main citation
Wagner P, Strodthoff N, Bousseljot RD, Kreiseler D, Lunze FI, Samek W, Schaeffter T. (2020) PTB-XL, a large publicly available electrocardiography dataset. Scientific Data, 7:154. doi:10.1038/s41597-020-0495-6.
ABSTRACT
Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined benchmarking procedures to allow comparisons of different algorithms. To address these issues, we put forward PTB-XL, the to-date largest freely accessible clinical 12-lead ECG-waveform dataset comprising 21837 records from 18885 patients of 10 seconds length.
DOI
10.1038/s41597-020-0495-6