{"title":"喉肌协调障碍的新语言特征","authors":"Branimir Dropuljić, D. Petrinović, K. Ćosić","doi":"10.1109/COGINFOCOM.2016.7804545","DOIUrl":null,"url":null,"abstract":"The present paper proposes speech features derived from the fundamental frequency (Fo) contour decomposition. The decomposition method is designed in order to differentiate, as much as possible, simultaneous neurobiological effects on vocal fold vibration. The focus of this paper is placed on involuntary disturbances of such vibrations, which are analyzed in the context of emotional stress. The proposed features are compared with conventional perturbation measures, i.e. jitter and shimmer, using two datasets: Synthetic perturbations and SUSAS (Speech Under Simulated and Actual Stress) subset - Roller-coaster. Features are additionally analyzed in the context of elimination potential of voluntary effects like Fo contour changes during natural pronunciation. Results of the initial synthetic perturbation analysis indicate that the proposed features could be less affected by the voluntary control and, on the other hand, more related to disturbances in laryngeal muscle coordination. The proposed features generally outperform conventional perturbation features in speech under stress analysis.","PeriodicalId":440408,"journal":{"name":"2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Novel speech features of disturbances in laryngeal muscle coordination\",\"authors\":\"Branimir Dropuljić, D. Petrinović, K. Ćosić\",\"doi\":\"10.1109/COGINFOCOM.2016.7804545\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The present paper proposes speech features derived from the fundamental frequency (Fo) contour decomposition. The decomposition method is designed in order to differentiate, as much as possible, simultaneous neurobiological effects on vocal fold vibration. The focus of this paper is placed on involuntary disturbances of such vibrations, which are analyzed in the context of emotional stress. The proposed features are compared with conventional perturbation measures, i.e. jitter and shimmer, using two datasets: Synthetic perturbations and SUSAS (Speech Under Simulated and Actual Stress) subset - Roller-coaster. Features are additionally analyzed in the context of elimination potential of voluntary effects like Fo contour changes during natural pronunciation. Results of the initial synthetic perturbation analysis indicate that the proposed features could be less affected by the voluntary control and, on the other hand, more related to disturbances in laryngeal muscle coordination. The proposed features generally outperform conventional perturbation features in speech under stress analysis.\",\"PeriodicalId\":440408,\"journal\":{\"name\":\"2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COGINFOCOM.2016.7804545\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COGINFOCOM.2016.7804545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Novel speech features of disturbances in laryngeal muscle coordination
The present paper proposes speech features derived from the fundamental frequency (Fo) contour decomposition. The decomposition method is designed in order to differentiate, as much as possible, simultaneous neurobiological effects on vocal fold vibration. The focus of this paper is placed on involuntary disturbances of such vibrations, which are analyzed in the context of emotional stress. The proposed features are compared with conventional perturbation measures, i.e. jitter and shimmer, using two datasets: Synthetic perturbations and SUSAS (Speech Under Simulated and Actual Stress) subset - Roller-coaster. Features are additionally analyzed in the context of elimination potential of voluntary effects like Fo contour changes during natural pronunciation. Results of the initial synthetic perturbation analysis indicate that the proposed features could be less affected by the voluntary control and, on the other hand, more related to disturbances in laryngeal muscle coordination. The proposed features generally outperform conventional perturbation features in speech under stress analysis.