Kamilla A. Bringel , Davi C.M.G. Leone , João Vitor L. de C. Firmino MME , Marcelo C. Rodrigues PhD , Marcelo D.T. de Melo MD, PhD
{"title":"Voice Analysis and Neural Networks as a Clinical Decision Support System for Patients With Lung Diseases","authors":"Kamilla A. Bringel , Davi C.M.G. Leone , João Vitor L. de C. Firmino MME , Marcelo C. Rodrigues PhD , Marcelo D.T. de Melo MD, PhD","doi":"10.1016/j.mcpdig.2024.06.006","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>To analyze the voice of patients with lung diseases, compared with healthy individuals, to detect patterns capable of assessing dyspnea using artificial neural networks (ANNs).</p></div><div><h3>Patients and Methods</h3><p>This research consists of a cross-sectional prospective pilot study performed in a reference tertiary center, which included a group of patients with lung diseases, compared with a group of healthy individuals. Each patient’s voice was recorded in controlled rooms. The following techniques were applied to extract and select signals’ features: statistical analysis, fast Fourier transform, discrete wavelet transform and Mel-Cepstral analysis. In addition, data augmentation was used to avoid overfitting and improve the ANNs accuracy.</p></div><div><h3>Results</h3><p>A total of 195 voices were recorded: 131 from lung disease patients and 64 from healthy individuals, separated according to gender and age. Using data augmentation, 751 additional audio samples were generated: 501 from healthy individuals and 445 from patients with lung disease. Among male participants, 133 samples were related to lung diseases and 197 were related to healthy ones. From them, 264 audios were used for ANNs training, obtaining an accuracy of 89%. In the female group, 312 had lung diseases and 304 were healthy. Among them, 492 audios were used for training, resulting in an accuracy of 87.6%.</p></div><div><h3>Conclusion</h3><p>Spectral analysis techniques applied to voice recordings using ANNs have reported high accuracy in the efficient diagnosis of lung diseases when compared with healthy individuals.</p></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"2 3","pages":"Pages 367-374"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949761224000671/pdfft?md5=1ce8698311c7e33369926c62a773de89&pid=1-s2.0-S2949761224000671-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mayo Clinic Proceedings. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949761224000671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
To analyze the voice of patients with lung diseases, compared with healthy individuals, to detect patterns capable of assessing dyspnea using artificial neural networks (ANNs).
Patients and Methods
This research consists of a cross-sectional prospective pilot study performed in a reference tertiary center, which included a group of patients with lung diseases, compared with a group of healthy individuals. Each patient’s voice was recorded in controlled rooms. The following techniques were applied to extract and select signals’ features: statistical analysis, fast Fourier transform, discrete wavelet transform and Mel-Cepstral analysis. In addition, data augmentation was used to avoid overfitting and improve the ANNs accuracy.
Results
A total of 195 voices were recorded: 131 from lung disease patients and 64 from healthy individuals, separated according to gender and age. Using data augmentation, 751 additional audio samples were generated: 501 from healthy individuals and 445 from patients with lung disease. Among male participants, 133 samples were related to lung diseases and 197 were related to healthy ones. From them, 264 audios were used for ANNs training, obtaining an accuracy of 89%. In the female group, 312 had lung diseases and 304 were healthy. Among them, 492 audios were used for training, resulting in an accuracy of 87.6%.
Conclusion
Spectral analysis techniques applied to voice recordings using ANNs have reported high accuracy in the efficient diagnosis of lung diseases when compared with healthy individuals.