{"title":"Deep Learning Based Pathological Voice Detection Algorithm Using Speech and Electroglottographic (EGG) Signals","authors":"R. Islam, E. Abdel-Raheem, M. Tarique","doi":"10.1109/ICECTA57148.2022.9990262","DOIUrl":null,"url":null,"abstract":"This paper presents a convolutional neural network-based pathological voice detection system using speech and electroglottographic (EGG) signals. Speech signals have been popularly used to detect voice pathology. Recently, the EGG signals have drawn considerable attention from researchers in this field. They argued that the EGG signals could detect the vocal fold vibration more accurately than speech signals and hence can be considered more appropriate for voice pathology detection. This work investigates the effectiveness of the EGG and speech signals in detecting pathological voices using sustained vowel (“/a/”) samples collected from the Saarbrücken Voice Database (SVD). The Mel frequency cepstral coefficients (MFCCs) extracted from the speech and EGG samples are employed as discerning features for this investigation. The results show that the proposed system achieves a higher accuracy (more than 23%) in identifying dysphonic voices from healthy ones with speech signals.","PeriodicalId":337798,"journal":{"name":"2022 International Conference on Electrical and Computing Technologies and Applications (ICECTA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical and Computing Technologies and Applications (ICECTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECTA57148.2022.9990262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a convolutional neural network-based pathological voice detection system using speech and electroglottographic (EGG) signals. Speech signals have been popularly used to detect voice pathology. Recently, the EGG signals have drawn considerable attention from researchers in this field. They argued that the EGG signals could detect the vocal fold vibration more accurately than speech signals and hence can be considered more appropriate for voice pathology detection. This work investigates the effectiveness of the EGG and speech signals in detecting pathological voices using sustained vowel (“/a/”) samples collected from the Saarbrücken Voice Database (SVD). The Mel frequency cepstral coefficients (MFCCs) extracted from the speech and EGG samples are employed as discerning features for this investigation. The results show that the proposed system achieves a higher accuracy (more than 23%) in identifying dysphonic voices from healthy ones with speech signals.