Journal article 37 views 2 downloads
Quantifying the Impact of Signal Simplification, Data Quantity, and Task Difficulty on Vision Transformer Performance for ECG Rhythm Classification
Computers, Volume: 15, Issue: 5, Start page: 301
Swansea University Authors:
Jarod Hartley, Joe MacInnes
-
PDF | Version of Record
© 2026 by the authors. This is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Download (663.04KB)
DOI (Published version): 10.3390/computers15050301
Abstract
Vision transformers (ViTs) have demonstrated considerable promise for classifying electrocardiogram (ECG) rhythms. However, much of the existing research is conducted in highly controlled, data-sterile settings that fail to reflect the substantial variability present in real-world ECG signals. This...
| Published in: | Computers |
|---|---|
| ISSN: | 2073-431X |
| Published: |
MDPI AG
2026
|
| Online Access: |
Check full text
|
| URI: | https://cronfa.swan.ac.uk/Record/cronfa71894 |
| Abstract: |
Vision transformers (ViTs) have demonstrated considerable promise for classifying electrocardiogram (ECG) rhythms. However, much of the existing research is conducted in highly controlled, data-sterile settings that fail to reflect the substantial variability present in real-world ECG signals. This paper seeks to address this gap by examining how signal simplification, data quantity, and task difficulty influence the performance of the SwinV2 ViT model in ECG rhythm classification. Through systematic analysis, we highlight that classifying highly abstracted signals yields only a limited impact on model performance, with all models achieving over 95% accuracy, while the amount of training data plays a crucial role with an almost 15% accuracy difference between the models trained on the most data and the least data. Finally, our analysis shows the model’s ability to effectively adapt to an increased class count, which is essential due to the varying nature of ECG diagnosis. In summary, these results highlight the importance of carefully balancing data clarity, dataset size, and diagnostic variety when designing ECG classification systems. Achieving this balance is crucial for building reliable and scalable AI solutions for cardiac assessment. |
|---|---|
| Keywords: |
vision transformer; ViT; signal simplification; data quantity; task difficulty; electrocardiogram; ECG; machine learning |
| College: |
Faculty of Science and Engineering |
| Issue: |
5 |
| Start Page: |
301 |

