语音变时标修正的谱变函数

2021 National Conference on Communications (NCC) Pub Date : 2021-07-27 DOI:10.1109/NCC52529.2021.9530088

P. Kachare, P. C. Pandey

{"title":"语音变时标修正的谱变函数","authors":"P. Kachare, P. C. Pandey","doi":"10.1109/NCC52529.2021.9530088","DOIUrl":null,"url":null,"abstract":"Spectral variation function is used to detect salient segments (segments with sharp spectral transitions). It is calculated from cosine of the angle between the averaged feature vectors of the adjacent segments. A modified version of this function is presented for variable time-scale modification of the speech signal. It uses the magnitude spectrum smoothed by auditory critical band filters and a small offset in the normalization for the angle cosine. Test results showed that the modified function detects spectral saliencies and does not have spurious peaks. It is applied for variable time-scale modification without altering the overall duration. Listening tests showed significantly better speech quality for processing using the modified function.","PeriodicalId":414087,"journal":{"name":"2021 National Conference on Communications (NCC)","volume":"2006 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Spectral Variation Function for Variable Time-Scale Modification of Speech\",\"authors\":\"P. Kachare, P. C. Pandey\",\"doi\":\"10.1109/NCC52529.2021.9530088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spectral variation function is used to detect salient segments (segments with sharp spectral transitions). It is calculated from cosine of the angle between the averaged feature vectors of the adjacent segments. A modified version of this function is presented for variable time-scale modification of the speech signal. It uses the magnitude spectrum smoothed by auditory critical band filters and a small offset in the normalization for the angle cosine. Test results showed that the modified function detects spectral saliencies and does not have spurious peaks. It is applied for variable time-scale modification without altering the overall duration. Listening tests showed significantly better speech quality for processing using the modified function.\",\"PeriodicalId\":414087,\"journal\":{\"name\":\"2021 National Conference on Communications (NCC)\",\"volume\":\"2006 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC52529.2021.9530088\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC52529.2021.9530088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

光谱变化函数用于检测显著段(具有尖锐光谱过渡的段)。它是由相邻段的平均特征向量之间的夹角的余弦计算得到的。针对语音信号的可变时间尺度修改，提出了该函数的改进版本。它使用由听觉临界带滤波器平滑的幅度谱和角余弦归一化的小偏移。测试结果表明，改进后的函数可以检测到光谱的显著性，并且没有假峰。它适用于不改变总持续时间的可变时间尺度修改。听力测试显示，使用修改后的功能处理的语音质量明显更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Spectral Variation Function for Variable Time-Scale Modification of Speech

Spectral variation function is used to detect salient segments (segments with sharp spectral transitions). It is calculated from cosine of the angle between the averaged feature vectors of the adjacent segments. A modified version of this function is presented for variable time-scale modification of the speech signal. It uses the magnitude spectrum smoothed by auditory critical band filters and a small offset in the normalization for the angle cosine. Test results showed that the modified function detects spectral saliencies and does not have spurious peaks. It is applied for variable time-scale modification without altering the overall duration. Listening tests showed significantly better speech quality for processing using the modified function.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 National Conference on Communications (NCC)

自引率

0.00%

发文量