ECG-Lens: Benchmarking ML & DL Models on PTB-XL Dataset
Saloni Garg, Ukant Jadia, Amit Sagtani, Kamal Kant Hiran
TLDR
This paper benchmarks ML and DL models on raw 12-lead ECG data from PTB-XL, finding complex CNNs like ECG-Lens significantly outperform traditional methods.
Key contributions
- Compares 3 ML and 3 DL models for ECG classification on the PTB-XL dataset.
- Utilizes raw 12-lead ECG signals, with DL models automatically extracting features.
- Applies Stationary Wavelet Transform (SWT) for data augmentation to boost performance.
- ECG-Lens (complex CNN) achieved superior performance (80% accuracy, 90% ROC-AUC).
Why it matters
This study provides a practical benchmark for automated ECG classification, demonstrating that deep learning, particularly complex CNNs, significantly outperforms traditional machine learning on raw 12-lead ECG data. These findings are crucial for guiding the selection of models and future condition-specific model development in cardiovascular disease diagnosis.
Original Abstract
Automated classification of electrocardiogram (ECG) signals is a useful tool for diagnosing and monitoring cardiovascular diseases. This study compares three traditional machine learning algorithms (Decision Tree Classifier, Random Forest Classifier, and Logistic Regression) and three deep learning models (Simple Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and Complex CNN (ECGLens)) for the classification of ECG signals from the PTB-XL dataset, which contains 12-lead recordings from normal patients and patients with various cardiac conditions. The DL models were trained on raw ECG signals, allowing them to automatically extract discriminative features. Data augmentation using the Stationary Wavelet Transform (SWT) was applied to enhance model performance, increase the diversity of training samples, and preserve the essential characteristics of the ECG signals. The models were evaluated using multiple metrics, including accuracy, precision, recall, F1-score, and ROC-AUC. The ECG-Lens model achieved the highest performance, with 80% classification accuracy and a 90% ROC-AUC. These findings demonstrate that deep learning architectures, particularly complex CNNs substantially outperform traditional ML methods on raw 12-lead ECG data, and provide a practical benchmark for selecting automated ECG classification models and identifying directions for condition-specific model development.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.