{"title":"基于小波散射特征的帕金森语音自动检测。","authors":"Mittapalle Kiran Reddy, Paavo Alku","doi":"10.1121/10.0036660","DOIUrl":null,"url":null,"abstract":"<p><p>In this paper, we study the automatic detection of Parkinson's disease (PD) from speech using features computed by a two-layer wavelet scattering network, which generates locally stable and translation-invariant features at each layer. The scattering features are encoded using Fisher vectors to obtain a single fixed-size feature vector per utterance. Support vector machine and feed-forward neural network classifiers are trained using the utterance-level features to perform the detection task (healthy vs PD). The results obtained with the PC-GITA database revealed that the proposed approach shows better results in comparison to the state-of-the-art techniques. The best classification accuracy of 87% was achieved with the proposed approach using speech from a text reading task.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 5","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic detection of Parkinsonian speech using wavelet scattering features.\",\"authors\":\"Mittapalle Kiran Reddy, Paavo Alku\",\"doi\":\"10.1121/10.0036660\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In this paper, we study the automatic detection of Parkinson's disease (PD) from speech using features computed by a two-layer wavelet scattering network, which generates locally stable and translation-invariant features at each layer. The scattering features are encoded using Fisher vectors to obtain a single fixed-size feature vector per utterance. Support vector machine and feed-forward neural network classifiers are trained using the utterance-level features to perform the detection task (healthy vs PD). The results obtained with the PC-GITA database revealed that the proposed approach shows better results in comparison to the state-of-the-art techniques. The best classification accuracy of 87% was achieved with the proposed approach using speech from a text reading task.</p>\",\"PeriodicalId\":73538,\"journal\":{\"name\":\"JASA express letters\",\"volume\":\"5 5\",\"pages\":\"\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JASA express letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1121/10.0036660\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JASA express letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/10.0036660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}
Automatic detection of Parkinsonian speech using wavelet scattering features.
In this paper, we study the automatic detection of Parkinson's disease (PD) from speech using features computed by a two-layer wavelet scattering network, which generates locally stable and translation-invariant features at each layer. The scattering features are encoded using Fisher vectors to obtain a single fixed-size feature vector per utterance. Support vector machine and feed-forward neural network classifiers are trained using the utterance-level features to perform the detection task (healthy vs PD). The results obtained with the PC-GITA database revealed that the proposed approach shows better results in comparison to the state-of-the-art techniques. The best classification accuracy of 87% was achieved with the proposed approach using speech from a text reading task.