Classification-based screening of Parkinson’s disease patients through voice signal

Fulvio Cordella, A. Paffi, A. Pallotti
{"title":"Classification-based screening of Parkinson’s disease patients through voice signal","authors":"Fulvio Cordella, A. Paffi, A. Pallotti","doi":"10.1109/MeMeA52024.2021.9478683","DOIUrl":null,"url":null,"abstract":"In this paper a classification algorithm for Parkinson’s Disease screening is proposed. Code executes the processing of specific voice signals recorded by healthy and ill subjects. In the direction of a future implementation and validation in a home telemonitoring system, the algorithm has been built with the objective to serve as a screening tool for the precocious directing of subjects with high risk of neurological diseases to instrumental exams. In fact, in several neurological disorders, such as Parkinson’s disease, motor impairments of vocal apparatus arise earlier than postural and ambulatory symptoms. In a home telemonitoring system, in which hardware would consist in a voice recorder (that could be a simple smartphone) and a server for the web platform, data would be acquired and instantly stored on a platform for their processing through machine learning algorithms and to be viewed by specialists. For this purpose, a fully automatic process is needed. Therefore, in this work, audio-preprocessing and features computation are completely performed automatically, using Matlab. Final models have been trained in Matlab environments from Weka’s libraries. The family of developed models are trained with different type of phonations, from simple vowels to complex sounds, for a wider and more efficient analysis of vocal apparatus motor impairments. Moreover, dataset was 612 observation large, that is significantly above the mean size of similar works using simple phonations only. For a deeper analysis, different groups of parameters have been tested and cepstral features have been found to be optimal for classification and made up the big part of final algorithm. Developed models are part of the K-Nearest Neighbor family, thus, available for implementation in web platform. Finally, obtained models have shown high accuracies on the whole dataset, reaching values comparable with the literature but with more stability (standard deviation less than 1%). These results have been confirmed in the last validation session in which models have been exported and validated with 25% of data, reaching a best performance with a true positive rate of 98% and a true negative rate of 87%.","PeriodicalId":429222,"journal":{"name":"2021 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MeMeA52024.2021.9478683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

In this paper a classification algorithm for Parkinson’s Disease screening is proposed. Code executes the processing of specific voice signals recorded by healthy and ill subjects. In the direction of a future implementation and validation in a home telemonitoring system, the algorithm has been built with the objective to serve as a screening tool for the precocious directing of subjects with high risk of neurological diseases to instrumental exams. In fact, in several neurological disorders, such as Parkinson’s disease, motor impairments of vocal apparatus arise earlier than postural and ambulatory symptoms. In a home telemonitoring system, in which hardware would consist in a voice recorder (that could be a simple smartphone) and a server for the web platform, data would be acquired and instantly stored on a platform for their processing through machine learning algorithms and to be viewed by specialists. For this purpose, a fully automatic process is needed. Therefore, in this work, audio-preprocessing and features computation are completely performed automatically, using Matlab. Final models have been trained in Matlab environments from Weka’s libraries. The family of developed models are trained with different type of phonations, from simple vowels to complex sounds, for a wider and more efficient analysis of vocal apparatus motor impairments. Moreover, dataset was 612 observation large, that is significantly above the mean size of similar works using simple phonations only. For a deeper analysis, different groups of parameters have been tested and cepstral features have been found to be optimal for classification and made up the big part of final algorithm. Developed models are part of the K-Nearest Neighbor family, thus, available for implementation in web platform. Finally, obtained models have shown high accuracies on the whole dataset, reaching values comparable with the literature but with more stability (standard deviation less than 1%). These results have been confirmed in the last validation session in which models have been exported and validated with 25% of data, reaching a best performance with a true positive rate of 98% and a true negative rate of 87%.
基于语音信号的帕金森病患者分类筛查
本文提出了一种用于帕金森病筛查的分类算法。代码执行对健康和患病受试者记录的特定语音信号的处理。为了在未来的家庭远程监控系统中实现和验证,该算法的目标是作为一种筛选工具,用于过早指导具有神经系统疾病高风险的受试者进行仪器检查。事实上,在一些神经系统疾病中,如帕金森氏病,发声器官的运动损伤比姿势和运动症状出现得更早。在家庭远程监控系统中,硬件将由录音机(可以是一个简单的智能手机)和网络平台的服务器组成,数据将被获取并立即存储在平台上,以便通过机器学习算法进行处理,并供专家查看。为此,需要一个全自动的过程。因此,在本工作中,音频预处理和特征计算完全是自动完成的,使用Matlab。最终的模型已经在来自Weka库的Matlab环境中进行了训练。该系列开发的模型使用不同类型的发音进行训练,从简单的元音到复杂的声音,以便更广泛、更有效地分析发声器官运动障碍。此外,数据集的大小为612个观测值,明显高于仅使用简单发音的同类作品的平均大小。为了进行更深入的分析,我们测试了不同的参数组,发现倒谱特征是最适合分类的,并构成了最终算法的大部分。开发的模型是k近邻系列的一部分,因此可以在web平台上实现。最后,获得的模型在整个数据集上显示出很高的精度,达到与文献相当的值,但具有更高的稳定性(标准差小于1%)。这些结果在最后一次验证会话中得到了证实,其中导出模型并使用25%的数据进行验证,达到了真阳性率为98%和真阴性率为87%的最佳性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信