Efficient pretraining of ECG scalogram images using masked autoencoders for cardiovascular disease diagnosis
- Abstract
- Cardiovascular diseases (CVDs) are the leading cause of mortality worldwide, emphasizing the need for accurate and early diagnosis. Electrocardiograms (ECG) provide a non-invasive means of diagnosing various cardiac conditions. However, traditional methods of interpreting ECG signals require substantial expertise and time, motivating the development of automated deep learning models to enhance diagnostic precision. This study proposes a novel approach that leverages masked autoencoders (MAE) to pretrain a model on ECG scalogram images, thereby enhancing the diagnostic accuracy for seven CVDs. Through extensive experimentation, we demonstrated that pretraining with an 85% masking ratio over 500 epochs yields optimal results. The pretrained ViT-S(MAE-scalo) network demonstrated remarkable performance in detecting CVDs, achieving an AUC of 0.986 and 92.43% accuracy in Lead II. Furthermore, the ensemble learning approach applied across 12 ECG leads enhanced the model’s diagnostic capabilities, resulting in an AUC of 0.994 and 92.72% accuracy. The MAE-based models outperformed traditional models such as ResNet-34 and ViT-S pretrained on ImageNet or random weights, as well as other SSL models such as MoCo-v2 and BYOL. Notably, the MAE-based models demonstrated superior performance even with a significantly smaller dat
- Author(s)
- 강대성; 윤태영
- Issued Date
- 2025-07-08
- Type
- Article
- Keyword
- 인공지능시스템및응용
- DOI
- 10.1038/s41598-025-10773-w
- URI
- http://repository.sungshin.ac.kr/handle/2025.oak/8847
- Publisher
- NATURE PORTFOLIO
- ISSN
- 2045-2322
-
Appears in Collections:
- 바이오헬스융합학부 > 학술논문
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.