相位失真下语音和音乐信号质量评价的特点

A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi
{"title":"相位失真下语音和音乐信号质量评价的特点","authors":"A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi","doi":"10.1109/ELNANO.2017.7939796","DOIUrl":null,"url":null,"abstract":"In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.","PeriodicalId":333746,"journal":{"name":"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"On peculiarities of evaluating the quality of speech and music signals subjected to phase distortion\",\"authors\":\"A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi\",\"doi\":\"10.1109/ELNANO.2017.7939796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.\",\"PeriodicalId\":333746,\"journal\":{\"name\":\"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELNANO.2017.7939796\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELNANO.2017.7939796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

本文对受相位失真影响的语音和音乐信号的主观和客观质量估计进行了比较,实现了客观和主观质量估计之间的映射。研究发现,语音信号的相位失真比音乐信号的相位失真更明显。考虑两种类型的相位畸变:1)低频信号分量滞后于高频信号分量30 - 90ms;2)高频信号分量同时滞后于低频信号分量。结果表明,人的听觉系统对前面提到的后一种相位失真更为敏感。就客观的质量度量而言,发现它们对于区分这些类型的相位失真是无用的。建立了低(125 Hz)和高(8 kHz)频率组延迟时间的最大差异,其中失真是可以接受的。这些阈值对于语音是40毫秒,对于音乐是80毫秒。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On peculiarities of evaluating the quality of speech and music signals subjected to phase distortion
In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信