{"title":"Automatic detection of Parkinsonian speech using wavelet scattering features.","authors":"Mittapalle Kiran Reddy, Paavo Alku","doi":"10.1121/10.0036660","DOIUrl":null,"url":null,"abstract":"<p><p>In this paper, we study the automatic detection of Parkinson's disease (PD) from speech using features computed by a two-layer wavelet scattering network, which generates locally stable and translation-invariant features at each layer. The scattering features are encoded using Fisher vectors to obtain a single fixed-size feature vector per utterance. Support vector machine and feed-forward neural network classifiers are trained using the utterance-level features to perform the detection task (healthy vs PD). The results obtained with the PC-GITA database revealed that the proposed approach shows better results in comparison to the state-of-the-art techniques. The best classification accuracy of 87% was achieved with the proposed approach using speech from a text reading task.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 5","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JASA express letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/10.0036660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we study the automatic detection of Parkinson's disease (PD) from speech using features computed by a two-layer wavelet scattering network, which generates locally stable and translation-invariant features at each layer. The scattering features are encoded using Fisher vectors to obtain a single fixed-size feature vector per utterance. Support vector machine and feed-forward neural network classifiers are trained using the utterance-level features to perform the detection task (healthy vs PD). The results obtained with the PC-GITA database revealed that the proposed approach shows better results in comparison to the state-of-the-art techniques. The best classification accuracy of 87% was achieved with the proposed approach using speech from a text reading task.