A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi
{"title":"相位失真下语音和音乐信号质量评价的特点","authors":"A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi","doi":"10.1109/ELNANO.2017.7939796","DOIUrl":null,"url":null,"abstract":"In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.","PeriodicalId":333746,"journal":{"name":"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"On peculiarities of evaluating the quality of speech and music signals subjected to phase distortion\",\"authors\":\"A. Prodeus, Vitalii Didkovskyi, M. Didkovska, I. Kotvytskyi\",\"doi\":\"10.1109/ELNANO.2017.7939796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.\",\"PeriodicalId\":333746,\"journal\":{\"name\":\"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELNANO.2017.7939796\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELNANO.2017.7939796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On peculiarities of evaluating the quality of speech and music signals subjected to phase distortion
In this paper, subjective and objective estimators of the quality of speech and music signals subjected to phase distortion are compared, and mapping between objective and subjective quality estimates is realized. It was found that the phase distortion of speech signals is perceived stronger than ones for musical signals. Two types of phase distortion are considered: 1) low-frequency signal components lag behind high-frequency components in the 30–90 ms; 2) high-frequency signal components lag behind low frequency components in the same time. It is shown that the human auditory system is more sensitive to the latter type of phase distortion referenced previously. As far as objective quality measures, it was found they are useless for distinguishing these types of phase distortion. Maximum differences between group delay times for low (125 Hz) and high (8 kHz) frequencies, for which distortion is acceptable, were established. These threshold values are 40 ms for speech and 80 ms for music.