{"title":"基于自然谱图统计的非侵入式语音质量评估","authors":"Shakeel Zafar, I. Nizami, Muhammad Majid","doi":"10.1109/iCoMET48670.2020.9074140","DOIUrl":null,"url":null,"abstract":"Speech quality assessment is one of the active research area in the field of communication and signal processing. In this paper, we proposed a new method to predict the quality of non-intrusive speech signals. This work uses the natural spectro-gram statistical (NSS) properties of speech signals. Undistorted speech follows a natural pattern, which is changed in the presence of distortion. The deviation of NSS in the presence of distortion is used to assess the quality of speech signals by extracting features using the generalized Gaussian distribution and mean subtracted contrast normalized coefficients of the spectrogram. The proposed methodology assess the quality of speech signals without the use of reference speech signal. Experimental results show that the proposed methodology gives high correlation of 0.92 and 0.89, and lowest root-mean-squared error of 0.16 and 0.21 on NOIZEUS-930 and CSTR VCTK Corpus datasets respectively when compared with state-of-the-art speech quality assessment techniques.","PeriodicalId":431051,"journal":{"name":"2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Non-intrusive Speech Quality Assessment using Natural Spectrogram Statistics\",\"authors\":\"Shakeel Zafar, I. Nizami, Muhammad Majid\",\"doi\":\"10.1109/iCoMET48670.2020.9074140\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech quality assessment is one of the active research area in the field of communication and signal processing. In this paper, we proposed a new method to predict the quality of non-intrusive speech signals. This work uses the natural spectro-gram statistical (NSS) properties of speech signals. Undistorted speech follows a natural pattern, which is changed in the presence of distortion. The deviation of NSS in the presence of distortion is used to assess the quality of speech signals by extracting features using the generalized Gaussian distribution and mean subtracted contrast normalized coefficients of the spectrogram. The proposed methodology assess the quality of speech signals without the use of reference speech signal. Experimental results show that the proposed methodology gives high correlation of 0.92 and 0.89, and lowest root-mean-squared error of 0.16 and 0.21 on NOIZEUS-930 and CSTR VCTK Corpus datasets respectively when compared with state-of-the-art speech quality assessment techniques.\",\"PeriodicalId\":431051,\"journal\":{\"name\":\"2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iCoMET48670.2020.9074140\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iCoMET48670.2020.9074140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Non-intrusive Speech Quality Assessment using Natural Spectrogram Statistics
Speech quality assessment is one of the active research area in the field of communication and signal processing. In this paper, we proposed a new method to predict the quality of non-intrusive speech signals. This work uses the natural spectro-gram statistical (NSS) properties of speech signals. Undistorted speech follows a natural pattern, which is changed in the presence of distortion. The deviation of NSS in the presence of distortion is used to assess the quality of speech signals by extracting features using the generalized Gaussian distribution and mean subtracted contrast normalized coefficients of the spectrogram. The proposed methodology assess the quality of speech signals without the use of reference speech signal. Experimental results show that the proposed methodology gives high correlation of 0.92 and 0.89, and lowest root-mean-squared error of 0.16 and 0.21 on NOIZEUS-930 and CSTR VCTK Corpus datasets respectively when compared with state-of-the-art speech quality assessment techniques.