{"title":"语音信号Mel频谱图的语音质量评估","authors":"Shakeel Zafar, I. Nizami, Muhammad Majid","doi":"10.1109/ICoDT252288.2021.9441536","DOIUrl":null,"url":null,"abstract":"Non-intrusive speech quality assessment (NI-SQA) has gained importance, due to recent advancements in multimedia, signal processing, machine learning, speech communication, and automatic speech recognition. The performance of NI-SQA techniques highly dependent on the extracted features to predict speech quality. In this article, a new machine learning-based method is proposed for predicting speech quality, without using reference signals is proposed. Traditional techniques used in literature cannot be implemented in practical application scenarios due to less correlation accuracy between subjective and objective scores. In this work, we used Mel-frequency cepstral coefficients (MFCCs) for predicting speech quality that is degraded in different noise conditions. We have computed the proposed work results on two independent databases. Experimental results show significant improvement in the performance when compared with current approaches for assessment of speech quality.","PeriodicalId":207832,"journal":{"name":"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Speech Quality Assessment using Mel Frequency Spectrograms of Speech Signals\",\"authors\":\"Shakeel Zafar, I. Nizami, Muhammad Majid\",\"doi\":\"10.1109/ICoDT252288.2021.9441536\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Non-intrusive speech quality assessment (NI-SQA) has gained importance, due to recent advancements in multimedia, signal processing, machine learning, speech communication, and automatic speech recognition. The performance of NI-SQA techniques highly dependent on the extracted features to predict speech quality. In this article, a new machine learning-based method is proposed for predicting speech quality, without using reference signals is proposed. Traditional techniques used in literature cannot be implemented in practical application scenarios due to less correlation accuracy between subjective and objective scores. In this work, we used Mel-frequency cepstral coefficients (MFCCs) for predicting speech quality that is degraded in different noise conditions. We have computed the proposed work results on two independent databases. Experimental results show significant improvement in the performance when compared with current approaches for assessment of speech quality.\",\"PeriodicalId\":207832,\"journal\":{\"name\":\"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICoDT252288.2021.9441536\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoDT252288.2021.9441536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Quality Assessment using Mel Frequency Spectrograms of Speech Signals
Non-intrusive speech quality assessment (NI-SQA) has gained importance, due to recent advancements in multimedia, signal processing, machine learning, speech communication, and automatic speech recognition. The performance of NI-SQA techniques highly dependent on the extracted features to predict speech quality. In this article, a new machine learning-based method is proposed for predicting speech quality, without using reference signals is proposed. Traditional techniques used in literature cannot be implemented in practical application scenarios due to less correlation accuracy between subjective and objective scores. In this work, we used Mel-frequency cepstral coefficients (MFCCs) for predicting speech quality that is degraded in different noise conditions. We have computed the proposed work results on two independent databases. Experimental results show significant improvement in the performance when compared with current approaches for assessment of speech quality.