N. Sripriya, S. Poornima, R. Shivaranjani, P. Thangaraju
{"title":"基于抖动和闪烁的非侵入性病理语音分类技术","authors":"N. Sripriya, S. Poornima, R. Shivaranjani, P. Thangaraju","doi":"10.1109/ICCCSP.2017.7944104","DOIUrl":null,"url":null,"abstract":"Speech signal contains two characteristics, system and source. When there is disturbance in vocal cord function, there is notable change in source characteristic. Despite the technological advances in the medical field, the voice pathologists use endoscopic methods to view the vocal cord flap movements for patients with infections and disturbances in vocal cords which are painful. This work is an alternative for classifying pathological voice from normal voice by evaluating the jitter and shimmer variations in the speech signal of an affected person. When there is any distortion in voice, it is reflected in the source characteristics. Pitch being the fundamental source characteristic, analyzing pitch helps us classify pathological voice from normal voice. Jitter and shimmer are derived characteristics of pitch. The glottal closure instants are better representatives of source compared to pitch. In this work, we have explored using the glottal closure instants to calculate the jitter, shimmer and other speech parameters instead of the pitch period. Analyzing these jitter and shimmer parameters for various pathological voices and normal voices help us to classify them. Experiments were carried out using a database containing normal and pathological voices. An accuracy of 85% was achieved for normal-pathological voice classification.","PeriodicalId":269595,"journal":{"name":"2017 International Conference on Computer, Communication and Signal Processing (ICCCSP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Non-intrusive technique for pathological voice classification using jitter and shimmer\",\"authors\":\"N. Sripriya, S. Poornima, R. Shivaranjani, P. Thangaraju\",\"doi\":\"10.1109/ICCCSP.2017.7944104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech signal contains two characteristics, system and source. When there is disturbance in vocal cord function, there is notable change in source characteristic. Despite the technological advances in the medical field, the voice pathologists use endoscopic methods to view the vocal cord flap movements for patients with infections and disturbances in vocal cords which are painful. This work is an alternative for classifying pathological voice from normal voice by evaluating the jitter and shimmer variations in the speech signal of an affected person. When there is any distortion in voice, it is reflected in the source characteristics. Pitch being the fundamental source characteristic, analyzing pitch helps us classify pathological voice from normal voice. Jitter and shimmer are derived characteristics of pitch. The glottal closure instants are better representatives of source compared to pitch. In this work, we have explored using the glottal closure instants to calculate the jitter, shimmer and other speech parameters instead of the pitch period. Analyzing these jitter and shimmer parameters for various pathological voices and normal voices help us to classify them. Experiments were carried out using a database containing normal and pathological voices. An accuracy of 85% was achieved for normal-pathological voice classification.\",\"PeriodicalId\":269595,\"journal\":{\"name\":\"2017 International Conference on Computer, Communication and Signal Processing (ICCCSP)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computer, Communication and Signal Processing (ICCCSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCSP.2017.7944104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computer, Communication and Signal Processing (ICCCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCSP.2017.7944104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Non-intrusive technique for pathological voice classification using jitter and shimmer
Speech signal contains two characteristics, system and source. When there is disturbance in vocal cord function, there is notable change in source characteristic. Despite the technological advances in the medical field, the voice pathologists use endoscopic methods to view the vocal cord flap movements for patients with infections and disturbances in vocal cords which are painful. This work is an alternative for classifying pathological voice from normal voice by evaluating the jitter and shimmer variations in the speech signal of an affected person. When there is any distortion in voice, it is reflected in the source characteristics. Pitch being the fundamental source characteristic, analyzing pitch helps us classify pathological voice from normal voice. Jitter and shimmer are derived characteristics of pitch. The glottal closure instants are better representatives of source compared to pitch. In this work, we have explored using the glottal closure instants to calculate the jitter, shimmer and other speech parameters instead of the pitch period. Analyzing these jitter and shimmer parameters for various pathological voices and normal voices help us to classify them. Experiments were carried out using a database containing normal and pathological voices. An accuracy of 85% was achieved for normal-pathological voice classification.