基于语音标记分析和机器学习的帕金森病IT诊断

Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioelektroniki Pub Date : 2023-06-22 DOI:10.35596/1729-7648-2023-21-3-102-110

U. A. Vishniakou, Yiwei Xia

{"title":"基于语音标记分析和机器学习的帕金森病IT诊断","authors":"U. A. Vishniakou, Yiwei Xia","doi":"10.35596/1729-7648-2023-21-3-102-110","DOIUrl":null,"url":null,"abstract":"The results of studying the parameters of the spectra of speech signals by machine learning with the use of neural networks are presented. This study was carried out in order to confirm experimentally the possibility of performing an assessment of these parameters for the detection of Parkinson’s disease in the early stages (IT diagnostics). During the study, the public database was used, which systematized the spectra of vowel sounds uttered by patients with Parkinson’s disease. The applied method is binary data classification. In the course of the study, the speech data spectrum was first preprocessed, which consisted of filtering it in order to remove its noise components and eliminate bursts and gaps in it. Then the parameters of the processed spectrum of speech data were determined: average value, maximum and minimum, peak, wavelet coefficients, MFCC and TQWT. After that, the object was selected using the PCA algorithm. The model was trained using the Knn and Random Forest algorithms, as well as the Bayesian neural network. The Bayesian optimization algorithm and the GridSearch method were used to find the best model hyperparameters. It has been established that when using Knn, Random Forest and Bayesian neural network, it is possible to increase the accuracy of recognition of Parkinson’s disease by 94.7; 88.16 and 74.74 %, respectively. A similar study by other scientists showed that the recognition accuracy of data sets was only 86 %.","PeriodicalId":33565,"journal":{"name":"Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioelektroniki","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"IT Diagnostics of Parkinson’s Disease Based on the Analysis of Voice Markers and Machine Learning\",\"authors\":\"U. A. Vishniakou, Yiwei Xia\",\"doi\":\"10.35596/1729-7648-2023-21-3-102-110\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The results of studying the parameters of the spectra of speech signals by machine learning with the use of neural networks are presented. This study was carried out in order to confirm experimentally the possibility of performing an assessment of these parameters for the detection of Parkinson’s disease in the early stages (IT diagnostics). During the study, the public database was used, which systematized the spectra of vowel sounds uttered by patients with Parkinson’s disease. The applied method is binary data classification. In the course of the study, the speech data spectrum was first preprocessed, which consisted of filtering it in order to remove its noise components and eliminate bursts and gaps in it. Then the parameters of the processed spectrum of speech data were determined: average value, maximum and minimum, peak, wavelet coefficients, MFCC and TQWT. After that, the object was selected using the PCA algorithm. The model was trained using the Knn and Random Forest algorithms, as well as the Bayesian neural network. The Bayesian optimization algorithm and the GridSearch method were used to find the best model hyperparameters. It has been established that when using Knn, Random Forest and Bayesian neural network, it is possible to increase the accuracy of recognition of Parkinson’s disease by 94.7; 88.16 and 74.74 %, respectively. A similar study by other scientists showed that the recognition accuracy of data sets was only 86 %.\",\"PeriodicalId\":33565,\"journal\":{\"name\":\"Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioelektroniki\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioelektroniki\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35596/1729-7648-2023-21-3-102-110\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioelektroniki","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35596/1729-7648-2023-21-3-102-110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

介绍了利用神经网络进行机器学习研究语音信号频谱参数的结果。进行这项研究是为了从实验上证实对这些参数进行评估以在早期阶段发现帕金森病(IT诊断)的可能性。在研究过程中，使用了公共数据库，该数据库将帕金森病患者发出的元音频谱系统化。应用的方法是二进制数据分类。在研究过程中，首先对语音数据频谱进行预处理，对其进行滤波，去除噪声成分，消除其中的突发和间隙。然后确定语音数据处理后的频谱参数:平均值、最大值和最小值、峰值、小波系数、MFCC和TQWT。然后，使用PCA算法对目标进行选择。该模型使用Knn和随机森林算法以及贝叶斯神经网络进行训练。采用贝叶斯优化算法和GridSearch方法寻找最佳模型超参数。研究表明，当使用Knn、随机森林和贝叶斯神经网络时，帕金森病的识别准确率可提高94.7;分别为88.16%和74.74%。其他科学家的一项类似研究表明，数据集的识别准确率只有86%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

IT Diagnostics of Parkinson’s Disease Based on the Analysis of Voice Markers and Machine Learning

The results of studying the parameters of the spectra of speech signals by machine learning with the use of neural networks are presented. This study was carried out in order to confirm experimentally the possibility of performing an assessment of these parameters for the detection of Parkinson’s disease in the early stages (IT diagnostics). During the study, the public database was used, which systematized the spectra of vowel sounds uttered by patients with Parkinson’s disease. The applied method is binary data classification. In the course of the study, the speech data spectrum was first preprocessed, which consisted of filtering it in order to remove its noise components and eliminate bursts and gaps in it. Then the parameters of the processed spectrum of speech data were determined: average value, maximum and minimum, peak, wavelet coefficients, MFCC and TQWT. After that, the object was selected using the PCA algorithm. The model was trained using the Knn and Random Forest algorithms, as well as the Bayesian neural network. The Bayesian optimization algorithm and the GridSearch method were used to find the best model hyperparameters. It has been established that when using Knn, Random Forest and Bayesian neural network, it is possible to increase the accuracy of recognition of Parkinson’s disease by 94.7; 88.16 and 74.74 %, respectively. A similar study by other scientists showed that the recognition accuracy of data sets was only 86 %.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioelektroniki

自引率

0.00%

发文量

审稿时长

8 weeks