{"title":"Automatic identification of pathological voice quality based on the GRBAS categorization","authors":"A. Sasou","doi":"10.1109/APSIPA.2017.8282229","DOIUrl":null,"url":null,"abstract":"Acoustic analysis-based automatic detection of voice pathologies enables non-invasive, low-cost and objective assessments of the presence of disorders, which might assist in accelerating and improving the diagnosis and clinical treatment given to patients. In this paper, we focus on the automatic assessment of pathological voice quality by identifying the four attributes of Roughness, Breathiness, Asthenia, and Strain based on the GRBAS categorization. The proposed method adopts higher-order local auto-correlation (HLAC) features, which are calculated from the excitation source signal obtained by an automatic topology-generated AR-HMM analysis, and identifies the four attributes using a feed-forward neural network (FFNN)-based classifier. In the experiments, an average F-measure of 87.25% was obtained for a speaker- based identification task, which confirms the feasibility of the proposed method.","PeriodicalId":142091,"journal":{"name":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"320 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2017.8282229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Acoustic analysis-based automatic detection of voice pathologies enables non-invasive, low-cost and objective assessments of the presence of disorders, which might assist in accelerating and improving the diagnosis and clinical treatment given to patients. In this paper, we focus on the automatic assessment of pathological voice quality by identifying the four attributes of Roughness, Breathiness, Asthenia, and Strain based on the GRBAS categorization. The proposed method adopts higher-order local auto-correlation (HLAC) features, which are calculated from the excitation source signal obtained by an automatic topology-generated AR-HMM analysis, and identifies the four attributes using a feed-forward neural network (FFNN)-based classifier. In the experiments, an average F-measure of 87.25% was obtained for a speaker- based identification task, which confirms the feasibility of the proposed method.