{"title":"CVDLLM: Automated Cardiovascular Disease Diagnosis With Large-Language-Model-Assisted Graph Attentive Feature Interaction","authors":"Xihe Qiu;Haoyu Wang;Xiaoyu Tan;Yaochu Jin","doi":"10.1109/TAI.2025.3527401","DOIUrl":null,"url":null,"abstract":"Electrocardiogram (ECG) measurements are essential for detecting and treating cardiovascular disease (CVD). However, manual evaluation of ECGs is prone to errors due to morphological variations. Although machine learning methods have shown promise in diagnosing diseases, automatic CVD diagnosis based on ECGs is still suffering from low diagnosis accuracy due to the limited usage of time-series information and interlead correlations. In this article, we propose a large language model (LLM)-assisted graph attentive feature interaction learning framework (CVDLLM) for automatic ECG diagnosis. It utilizes ECG data from twelve leads to classify eight heart diseases, including rhythm abnormalities and normal conditions. Our framework combines convolutional and recurrent neural networks for independent time-series feature extraction from 12-lead ECG signals. By incorporating features extracted by heart rate variability (HRV) analysis, we employ graph attention neural networks (GAT) and self-attentive feature interaction mechanism (GSAT) for feature interaction and model learning. Leveraging LLMs with pretrained knowledge bases and advanced language comprehension, we extract and learn semantic embeddings from medical case data. This approach equips our framework with a deep semantic layer, significantly enhancing its capacity to understand complex medical texts. Additionally, by representing the twelve leads as a graph, our framework enables highly accurate disease diagnosis based on spatial and temporal interactions with 12-lead ECG signals. We evaluate the performance of our proposed framework and our framework achieves state-of-the-art performance with accuracy, precision, recall, and F1-score.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 6","pages":"1575-1590"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10835161/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Electrocardiogram (ECG) measurements are essential for detecting and treating cardiovascular disease (CVD). However, manual evaluation of ECGs is prone to errors due to morphological variations. Although machine learning methods have shown promise in diagnosing diseases, automatic CVD diagnosis based on ECGs is still suffering from low diagnosis accuracy due to the limited usage of time-series information and interlead correlations. In this article, we propose a large language model (LLM)-assisted graph attentive feature interaction learning framework (CVDLLM) for automatic ECG diagnosis. It utilizes ECG data from twelve leads to classify eight heart diseases, including rhythm abnormalities and normal conditions. Our framework combines convolutional and recurrent neural networks for independent time-series feature extraction from 12-lead ECG signals. By incorporating features extracted by heart rate variability (HRV) analysis, we employ graph attention neural networks (GAT) and self-attentive feature interaction mechanism (GSAT) for feature interaction and model learning. Leveraging LLMs with pretrained knowledge bases and advanced language comprehension, we extract and learn semantic embeddings from medical case data. This approach equips our framework with a deep semantic layer, significantly enhancing its capacity to understand complex medical texts. Additionally, by representing the twelve leads as a graph, our framework enables highly accurate disease diagnosis based on spatial and temporal interactions with 12-lead ECG signals. We evaluate the performance of our proposed framework and our framework achieves state-of-the-art performance with accuracy, precision, recall, and F1-score.