{"title":"A deep learning modular ECG approach for cardiologist assisted adjudication of atrial fibrillation and atrial flutter episodes","authors":"Quentin Fleury MSc , Rémi Dubois PhD , Sylvain Christophle-Boulard MSc , Fabrice Extramiana MD, PhD , Pierre Maison-Blanche MD","doi":"10.1016/j.hroo.2024.09.007","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Detection of atrial tachyarrhythmias (ATA) on long-term electrocardiogram (ECG) recordings is a prerequisite to reduce ATA-related adverse events. However, the burden of editing massive ECG data is not sustainable. Deep learning (DL) algorithms provide improved performances on resting ECG databases. However, results on long-term Holter recordings are scarce.</div></div><div><h3>Objective</h3><div>We aimed to build and evaluate a DL modular software using ECG features well known to cardiologists with a user interface that allows cardiologists to adjudicate the results and drive a second DL analysis.</div></div><div><h3>Methods</h3><div>Using a large (n = 187 recordings, 249,419 one-minute samples), beat-to-beat annotated, two-lead Holter database, we built a DL algorithm with a modular structure mimicking expert physician ECG interpretation to classify atrial rhythms. The DL network includes 3 modules (cardiac rhythm regularity, electrical atrial waveform, and raw voltage by time data) followed by a decision network and a long-term weighting factor. The algorithm was validated on an external database.</div></div><div><h3>Results</h3><div>F1 scores of our classifier were 99% for ATA detection, 95% for atrial fibrillation, and 90% for atrial flutter. Using the external Massachusetts Institute of Technology database, the classifier obtains an F1-score of 97% for the normal sinus rhythm class and 96% for the ATA class. Residual errors could be corrected by manual deactivation of 1 module in 7 of 15 of the recordings, with an accuracy < 90%.</div></div><div><h3>Conclusion</h3><div>A DL modular software using ECG features well known to cardiologists provided an excellent overall performance. Clinically significant residual errors were most often related to the classification of the atrial arrhythmia type (fibrillation vs flutter). The modular structure of the algorithm helped to edit and correct the artificial intelligence–based first-pass analysis and will provide a basis for explainability.</div></div>","PeriodicalId":29772,"journal":{"name":"Heart Rhythm O2","volume":"5 12","pages":"Pages 862-872"},"PeriodicalIF":2.5000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11721725/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heart Rhythm O2","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666501824002988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Detection of atrial tachyarrhythmias (ATA) on long-term electrocardiogram (ECG) recordings is a prerequisite to reduce ATA-related adverse events. However, the burden of editing massive ECG data is not sustainable. Deep learning (DL) algorithms provide improved performances on resting ECG databases. However, results on long-term Holter recordings are scarce.
Objective
We aimed to build and evaluate a DL modular software using ECG features well known to cardiologists with a user interface that allows cardiologists to adjudicate the results and drive a second DL analysis.
Methods
Using a large (n = 187 recordings, 249,419 one-minute samples), beat-to-beat annotated, two-lead Holter database, we built a DL algorithm with a modular structure mimicking expert physician ECG interpretation to classify atrial rhythms. The DL network includes 3 modules (cardiac rhythm regularity, electrical atrial waveform, and raw voltage by time data) followed by a decision network and a long-term weighting factor. The algorithm was validated on an external database.
Results
F1 scores of our classifier were 99% for ATA detection, 95% for atrial fibrillation, and 90% for atrial flutter. Using the external Massachusetts Institute of Technology database, the classifier obtains an F1-score of 97% for the normal sinus rhythm class and 96% for the ATA class. Residual errors could be corrected by manual deactivation of 1 module in 7 of 15 of the recordings, with an accuracy < 90%.
Conclusion
A DL modular software using ECG features well known to cardiologists provided an excellent overall performance. Clinically significant residual errors were most often related to the classification of the atrial arrhythmia type (fibrillation vs flutter). The modular structure of the algorithm helped to edit and correct the artificial intelligence–based first-pass analysis and will provide a basis for explainability.