{"title":"CM-VGG16:从心音图信号自动检测心脏瓣膜疾病的ConvMixer-Enhanced VGG16模型","authors":"Brundavanam Satyasai;Rajeev Sharma","doi":"10.1109/JSEN.2024.3511633","DOIUrl":null,"url":null,"abstract":"The absence of medical resources at remote places prevents many patients from receiving a prompt and accurate diagnosis of cardiovascular disorders. To address this, we proposed a novel deep learning model based on partially fine-tuned VGG16 and ConvMixer for the automatic identification of various heart valve diseases (HVDs) from phonocardiogram (PCG) signals. The method involves preprocessing PCG signals and converting them into gammatone filterbank (GFB) based 2-D time–frequency images. To generate time–frequency images, we used gammatonegram, gammatone cepstral coefficients (GTCCs), and gammatone discrete wavelet coefficients (GDWCs) techniques. These time–frequency images are augmented to reduce overfitting and then fed into a VGG16 model. Partial fine-tuning of the VGG16 model accelerates convergence and further improves performance. By extending the VGG16 model with ConvMixer, Global AveragePooling, Dense, and Softmax layers, we enhance its capacity to capture intricate patterns. The ConvMixer enriches spatial and channelwise features using Depthwise and Pointwise convolutions. We also performed an ablation analysis to highlight the effect of ConvMixer with VGG16. In addition, performance evaluation based on precision, recall, F1-score, test accuracy, and validation accuracy reveals the efficacy of the proposed method. Comparisons between gammatonegram, GTCC, and GDWC show superior performance of gammatonegram, achieving a test accuracy of 99.60% and validation accuracy of 99.75%. Our approach demonstrates significant advances over existing methods, offering a promising solution for remote diagnosis of HVDs.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 2","pages":"3998-4005"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CM-VGG16: ConvMixer-Enhanced VGG16 Model for Automatic Detection of Heart Valve Diseases From Phonocardiogram Signals\",\"authors\":\"Brundavanam Satyasai;Rajeev Sharma\",\"doi\":\"10.1109/JSEN.2024.3511633\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The absence of medical resources at remote places prevents many patients from receiving a prompt and accurate diagnosis of cardiovascular disorders. To address this, we proposed a novel deep learning model based on partially fine-tuned VGG16 and ConvMixer for the automatic identification of various heart valve diseases (HVDs) from phonocardiogram (PCG) signals. The method involves preprocessing PCG signals and converting them into gammatone filterbank (GFB) based 2-D time–frequency images. To generate time–frequency images, we used gammatonegram, gammatone cepstral coefficients (GTCCs), and gammatone discrete wavelet coefficients (GDWCs) techniques. These time–frequency images are augmented to reduce overfitting and then fed into a VGG16 model. Partial fine-tuning of the VGG16 model accelerates convergence and further improves performance. By extending the VGG16 model with ConvMixer, Global AveragePooling, Dense, and Softmax layers, we enhance its capacity to capture intricate patterns. The ConvMixer enriches spatial and channelwise features using Depthwise and Pointwise convolutions. We also performed an ablation analysis to highlight the effect of ConvMixer with VGG16. In addition, performance evaluation based on precision, recall, F1-score, test accuracy, and validation accuracy reveals the efficacy of the proposed method. Comparisons between gammatonegram, GTCC, and GDWC show superior performance of gammatonegram, achieving a test accuracy of 99.60% and validation accuracy of 99.75%. Our approach demonstrates significant advances over existing methods, offering a promising solution for remote diagnosis of HVDs.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 2\",\"pages\":\"3998-4005\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10794609/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10794609/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
由于偏远地区缺乏医疗资源,许多患者无法得到及时和准确的心血管疾病诊断。为了解决这个问题,我们提出了一种基于部分微调的VGG16和ConvMixer的新型深度学习模型,用于从心音图(PCG)信号中自动识别各种心脏瓣膜疾病(hvd)。该方法包括对PCG信号进行预处理,并将其转换成基于伽马素滤波器组(GFB)的二维时频图像。为了生成时频图像,我们使用了伽玛图、伽玛酮倒谱系数(gtcc)和伽玛酮离散小波系数(GDWCs)技术。这些时频图像被增强以减少过拟合,然后输入到VGG16模型中。VGG16模型的局部微调加快了收敛速度,进一步提高了性能。通过使用ConvMixer, Global AveragePooling, Dense和Softmax层扩展VGG16模型,我们增强了其捕获复杂模式的能力。ConvMixer使用深度卷积和点卷积丰富空间和通道特征。我们还进行了消融分析,以突出使用VGG16的ConvMixer的效果。此外,基于精密度、召回率、f1分、测试准确度和验证准确度的性能评估显示了所提方法的有效性。伽玛图与GTCC和GDWC的比较表明,伽玛图具有较好的性能,测试准确率为99.60%,验证准确率为99.75%。我们的方法比现有方法有了显著的进步,为hvd的远程诊断提供了一个有希望的解决方案。
CM-VGG16: ConvMixer-Enhanced VGG16 Model for Automatic Detection of Heart Valve Diseases From Phonocardiogram Signals
The absence of medical resources at remote places prevents many patients from receiving a prompt and accurate diagnosis of cardiovascular disorders. To address this, we proposed a novel deep learning model based on partially fine-tuned VGG16 and ConvMixer for the automatic identification of various heart valve diseases (HVDs) from phonocardiogram (PCG) signals. The method involves preprocessing PCG signals and converting them into gammatone filterbank (GFB) based 2-D time–frequency images. To generate time–frequency images, we used gammatonegram, gammatone cepstral coefficients (GTCCs), and gammatone discrete wavelet coefficients (GDWCs) techniques. These time–frequency images are augmented to reduce overfitting and then fed into a VGG16 model. Partial fine-tuning of the VGG16 model accelerates convergence and further improves performance. By extending the VGG16 model with ConvMixer, Global AveragePooling, Dense, and Softmax layers, we enhance its capacity to capture intricate patterns. The ConvMixer enriches spatial and channelwise features using Depthwise and Pointwise convolutions. We also performed an ablation analysis to highlight the effect of ConvMixer with VGG16. In addition, performance evaluation based on precision, recall, F1-score, test accuracy, and validation accuracy reveals the efficacy of the proposed method. Comparisons between gammatonegram, GTCC, and GDWC show superior performance of gammatonegram, achieving a test accuracy of 99.60% and validation accuracy of 99.75%. Our approach demonstrates significant advances over existing methods, offering a promising solution for remote diagnosis of HVDs.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice