相位失真下语音和音乐信号质量评价的特点

2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO) Pub Date : 2017-04-01 DOI:10.1109/ELNANO.2017.7939796

A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi

{"title":"相位失真下语音和音乐信号质量评价的特点","authors":"A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi","doi":"10.1109/ELNANO.2017.7939796","DOIUrl":null,"url":null,"abstract":"In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.","PeriodicalId":333746,"journal":{"name":"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"On peculiarities of evaluating the quality of speech and music signals subjected to phase distortion\",\"authors\":\"A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi\",\"doi\":\"10.1109/ELNANO.2017.7939796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.\",\"PeriodicalId\":333746,\"journal\":{\"name\":\"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELNANO.2017.7939796\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELNANO.2017.7939796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

本文对受相位失真影响的语音和音乐信号的主观和客观质量估计进行了比较，实现了客观和主观质量估计之间的映射。研究发现，语音信号的相位失真比音乐信号的相位失真更明显。考虑两种类型的相位畸变:1)低频信号分量滞后于高频信号分量30 - 90ms;2)高频信号分量同时滞后于低频信号分量。结果表明，人的听觉系统对前面提到的后一种相位失真更为敏感。就客观的质量度量而言，发现它们对于区分这些类型的相位失真是无用的。建立了低(125 Hz)和高(8 kHz)频率组延迟时间的最大差异，其中失真是可以接受的。这些阈值对于语音是40毫秒，对于音乐是80毫秒。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On peculiarities of evaluating the quality of speech and music signals subjected to phase distortion

In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)

自引率

0.00%

发文量